Utility indifference pricing and hedging for structured contracts in energy markets

Callegaro, Giorgia; Campi, Luciano; Giusto, Valeria; Vargiolu, Tiziano

doi:10.1007/s00186-016-0569-6

Utility indifference pricing and hedging for structured contracts in energy markets

Open access
Published: 04 February 2017

Volume 85, pages 265–303, (2017)
Cite this article

Download PDF

You have full access to this open access article

Mathematical Methods of Operations Research Aims and scope Submit manuscript

Utility indifference pricing and hedging for structured contracts in energy markets

Download PDF

Giorgia Callegaro¹,
Luciano Campi²,
Valeria Giusto³ &
…
Tiziano Vargiolu¹

2542 Accesses
6 Citations
Explore all metrics

Abstract

In this paper we study the pricing and hedging of structured products in energy markets, such as swing and virtual gas storage, using the exponential utility indifference pricing approach in a general incomplete multivariate market model driven by finitely many stochastic factors. The buyer of such contracts is allowed to trade in the forward market in order to hedge the risk of his position. We fully characterize the buyer’s utility indifference price of a given product in terms of continuous viscosity solutions of suitable nonlinear PDEs. This gives a way to identify reasonable candidates for the optimal exercise strategy for the structured product as well as for the corresponding hedging strategy. Moreover, in a model with two correlated assets, one traded and one nontraded, we obtain a representation of the price as the value function of an auxiliary simpler optimization problem under a risk neutral probability, that can be viewed as a perturbation of the minimal entropy martingale measure. Finally, numerical results are provided.

Duality in optimal consumption–investment problems with alternative data

Article Open access 14 June 2024

A stochastic Asset Liability Management model for life insurance companies

Article Open access 05 May 2022

A Multiscale study of flexible customer’s energy demand under smart grid architecture: A modeling and simulation study

Article 10 June 2024

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

In the last 15 years, since the start of the energy market deregulation and privatization in Europe and in the US, the study of energy markets became a challenging topic from both a practical and a theoretical perspective. Especially important is the problem of pricing and hedging of energy contracts. This is far from being trivial because of the peculiarity of the models and since these contracts typically have a very complex structure, incorporating optionality features which can be exercised by the buyer at multiple times. The two main examples of products used in energy markets for primary supply are swing contracts and forward contracts. While the structure of the forward contracts is rather simple, swing contracts are much more involved as they give the buyer some degrees of freedom about the quantity of energy to buy for each sub-period, usually with daily or monthly scale, subject to a cumulated constraint over the contract period. This flexibility is much welcomed by the contract buyers, as energy markets are affected by many unexpected events such as peaks in consumption related to sudden weather changes, breakdowns of power plants, financial turmoils and so on. Many other kinds of contract are traded in the energy market, they are often negotiated over-the-counter, and some of them, e.g. virtual storage contracts, also include optionality components similar to the ones of swing contracts.

The pricing of these products has a consolidated tradition in discrete time models (see, e.g. Edoli et al. 2013 or Henaff et al. 2013 and references therein), which is manly based on dynamic programming. The article (Jaillet et al. 2004) use a multilevel lattice method to study the pricing of a swing option on natural gas. The two papers (Pagès et al. 2009, 2010) propose a different method based on optimal quantization theory. In continuous time models, the first approaches were based on optimal switching techniques (e.g., Carmona and Ludkovski 2010) or multiple stopping (e.g., Carmona and Touzi 2008). In all these articles the optionality features of the structured product can be exercised over a discrete set of stopping times, that can be chosen by the buyer. A very detailed comparison of the literature on storage and swing evaluation can be found in Aïd (2015), Table 4.1.

A different approach consists in approximating the contract payoff with its continuous time counterpart. This idea has been proposed in Benth et al. (2012) (and further exploited in Basei et al. (2014)) for swing contracts and in Chen and Forsyth (2007), Felix (2012), Thompson et al. (2009) for virtual storage contracts. Other examples of structured contracts can be treated with the same methodology, see e.g. Benth and Eriksson (2013) for flexible load contracts and tolling agreements. The main advantage of this approach is that it makes the pricing problem more tractable, since it allows using the stochastic control theory in continuous time, based on PDE methods. In those papers, the price of a structured contract is defined—in analogy with American options—as the supremum, over all the strategies available to the buyer, of the expected payoff, where the expectation is taken under a given risk-neutral measure. When the model for the underlying of the structured contract is Markovian, as it happens in most of the models used in practice, the pricing problem reduces to solving the corresponding Hamilton–Jacobi–Bellman (HJB) equation. Notice that the choice of the risk-neutral measure is not at all obvious since energy market models are typically incomplete, because of the presence of assets bearing non tradable risks. Moreover, with the exception of Warin (2012) which focuses on gas storage contracts and uses a delta-hedging approach, the problem of hedging the risk coming from a long position in those structured products is not considered in those papers. Hedging such contracts can be quite a delicate task in energy markets, since the underlying of the contracts is often not tradable, hence the buyer has to trade in some other asset with a good correlation with the underlying. For an extensive review of the existing literature with descriptions of the most traded contracts and a detailed comparison between the main articles we refer once more to the recent book (Aïd 2015, Chapter 4).

Our contribution to the literature consists in building on the idea of continuously approximating the payoff as in Basei et al. (2014), Benth et al. (2012), in order to provide a general framework, where both problems of pricing and hedging of structured contracts can be solved in a consistent fashion. The main novelty of this paper is that the buyer of the given structured contract is allowed to (at least partially) hedge his position by trading in forward contracts, written on the underlying of the structured contract itself or on some asset correlated with the underlying. We model the forward market as a general incomplete multivariate market model with finitely many forward contracts (with different maturities), evolving over time as diffusions whose coefficients depend on a certain number of exogenous stochastic factors with Markovian dynamics. The underlying of the structured contract is defined as a function of such factors. This setting includes many models that have been previously proposed and studied in the literature, e.g. Aïd et al. (2014), Carmona and Ludkovski (2006), Cartea and Villaplana (2008), Schwartz and Smith (2000) to cite a few.

The market being incomplete, we adopt the utility indifference price (henceforth UIP) approach, which is one of the most appealing ways of pricing in incomplete markets, since it naturally incorporates the buyer preferences in the price of the contract. We assume that the preferences of the buyer can be encoded in an exponential utility function with a risk aversion parameter $\gamma >0$. The UIP approach has been extensively used for pricing European and American options in a wide range of financial market models. We refer to Henderson and Hobson (2009) for an excellent survey on this approach. This approach was already used in Porchet et al. (2009), Ludkovski (2008) for the evaluation of industrial assets (see Remark 2.10) and in Fiorenzani (2006) and Benedetti and Campi (2016) for energy derivatives.

We apply this method for evaluating a rather general structured derivative. Its buying UIP will be characterized as the difference between the two log-value functions of the agent (with and without the contract), that can be obtained as the unique viscosity solutions of a suitable HJB equation. Our results are consistent with the ones in Basei et al. (2014), Benth et al. (2012), Chen and Forsyth (2007), Felix (2012), Thompson et al. (2009), in the case of complete market models. Moreover, the shape of such HJB equation gives reasonable candidates for the optimal withdrawal strategy of the structured product, as well as for the related hedging strategy.

Finally, we push our general results further in two specific examples. One of them is a class of models with two risky assets, one traded and one nontraded, and constant correlation. This includes models with a nontraded asset and basis risk, which have been studied by many authors (see, e.g., the papers Davis 2006, Henderson 2002, Monoyios 2004 to cite only a few). For these models, we obtain a representation of the price as the value function of an auxiliary simpler optimization problem under a risk-neutral probability, that can be viewed as a perturbation of the minimal entropy martingale measure. Such a perturbation is due to the dependence of drift and volatility of the traded asset on the nontraded one and depends on the value function without the contract. It seems that such a measure change is new to the incomplete market literature. The second example is based on a slight generalization of the two-factor model developed in Cartea and Villaplana (2008) for energy markets, where the factors can be correlated.

The paper is organized as follows. In Sect. 2 we formulate the problem of pricing, by introducing the general payoff of the structured contracts, the market model and the (exponential) utility indifference price. In Sect. 3, we characterize the UIP in terms of viscosity solutions of suitable HJB equations. In Sect. 4 we consider the two examples described above while Sect. 5 presents some numerical applications of our results. Finally, Sect. 6 concludes.

Notation In what follows, unless explicitly stated, vectors will be column vectors, the symbol “*” will denote transposition and the trace of a square matrix A will be denoted by $\text {tr}(A)$. Furthermore, $\langle a, b \rangle := a^* b$ will stand for the Euclidean scalar product. We choose as matricial norm $| A | = \sqrt{\mathrm{{tr}}(AA^*)}$. On the set $\mathcal S_n$ of all symmetric squared matrices of order n, we define the order $A\le B$ if and only if $B-A \in \mathcal S_n ^+$, the subspace of nonnegative definite matrices in $\mathcal S_n$. We will denote by $I_n$ the identity matrix of dimension n.

2 Formulation of the problem

Let $T>0$ be a finite time horizon. All the processes introduced below will be defined on the canonical probability space $(\Omega , \mathcal F, {\mathbb {P}})$, where $\Omega := C([0,T];{\mathbb {R}}^d)$ is the space of all continuous functions from [0, T] into $\mathbb R^d$. For $\omega \in \Omega $, we set $W_t (\omega ) = \omega (t)$ and define $(\mathcal F_t)_{t\in [0,T]}$ as the smallest right-continuous filtration such that W is optional. Moreover, $\mathcal F := \mathcal F_T$. We let ${\mathbb {P}}$ be the Wiener measure on $(\Omega , {\mathbb {F}}_T)$. We can assume without loss of generality that such a filtration is complete.

2.1 Structured products

In this section we give a short description of the main structured products that are traded in energy markets. The typical payoff is given by a family of random variables

$$\begin{aligned} C_T^u :=\int _0^T L(P_s,Z_s^u,u_s) ds + \Phi (P_T,Z_T^u), \end{aligned}$$

(2.1)

indexed by a control u, which typically represents the marginal quantity of commodity purchased and it belongs to a suitable set of admissible controls $\mathcal U$ that we will specify later. In particular, the admissible controls take values in some bounded interval $[0,{\bar{u}}]$ for a given threshold ${\bar{u}} >0$. The variable P in the Eq. (2.1) above denotes the spot price of the commodity (e.g., gas) and $Z^u _t := z_0 + \int _0^t u_s ds$ for all $t \in [0,T]$, for some initial value $z_0 \ge 0$. For technical reasons, that will become clear in the proofs of our results, we will need the following assumption on the structured products.

Assumption 2.1

The functions $L: {\mathbb {R}} \times [0,{\bar{u}} T] \times [0,{\bar{u}}] \rightarrow {\mathbb {R}}$ and $\Phi : {\mathbb {R}} \times [0,{\bar{u}} T] \rightarrow {\mathbb {R}}$ in (2.1) are continuous and bounded.

The most common structured products in energy markets are swing and virtual storage contracts. More details are given just below. See also the subsequent Remark 2.4 explaining how one can safely modify these contracts in order to satisfy Assumption 2.1.

Example 2.2

(Swing contract) For a swing contract one has (see, e.g., Basei et al. 2014; Benth et al. 2012)

$$\begin{aligned} L(p,z,u) = u (p - K), \end{aligned}$$

where K is the purchase price or strike price, and u is any admissible control. These products usually include some additional features, such as inter-temporal constraints on u or on the cumulated control $Z^u$ or some penalty function appearing in the payoff. More precisely, constraints on u and $Z^u$ are typically of the form $Z_T^u \in [m,M]$, with $0 \le m < M$, with possibly further intermediate constraints on $Z_{t_i}^u$, $t_i < T$, $i = 1,\ldots ,k$. In the absence of such additional constraints, a penalty is usually present which can be expressed as a function $\Phi $ of the terminal spot price $P_T$ and cumulated consumption $Z^u_T$. A typical form of $\Phi $ is

$$\begin{aligned} \Phi (p,z) = - C \left( (m - z)^+ + (z - M)^+\right) \end{aligned}$$

(2.2)

for constants $C>0$ and $0 \le m < M$ (see Basei et al. 2014; Benth et al. 2012 and references therein). We will focus on the latter case, i.e., a non-zero penalty function $\Phi (P_T ,Z_T^u)$ without any other contraints on the admissible controls.

Example 2.3

(Virtual storage contract) These products replicate a physical gas storage position, while being handled as pure trading contracts. In this case one has

$$\begin{aligned} L(p,z,u) = - p (u - a(z,u)), \quad \Phi (p,z) = - C (M - z), \end{aligned}$$

with $C,M > 0$ suitable constants, $a(z,u):= \bar{a} \mathbbm {1}_{u<0}$ and where the control u represents the gas injected into the reservoir and is such that

$$\begin{aligned} u_t \in \left[ u_{\mathrm {in}}\left( Z_t^u\right) ,u_{\mathrm {out}}\left( Z_t^u\right) \right] ,\quad t\in [0,T], \end{aligned}$$

where $u_{\mathrm {in}},u_{\mathrm {out}}$ are suitable deterministic functions given by the physics of fluids: their typical shapes are

$$\begin{aligned} u_{\mathrm {in}}(z):= - K_1 \sqrt{z} , \quad u_{\mathrm {out}}(z):= K_2 \sqrt{\frac{1}{z + Z_b} - K_3} \end{aligned}$$

with $Z_b,K_i > 0$, $i = 1,2,3$, given constants (Chen and Forsyth 2007; Felix 2012; Thompson et al. 2009).

Remark 2.4

The boundedness of L as in Assumption 2.1 is not verified in the two Examples 2.2 and 2.3, where L is linear in p, which can in principle take any real value. In practice, one can artificially bound L, for example by introducing

$$\begin{aligned} {\tilde{L}}(p,z,u):= \max (- \kappa , \min (L(p,z,u),\kappa )), \end{aligned}$$

so that $|{\tilde{L}}(p,z,u)| \le \kappa $ for all (p, z, u), for a suitably chosen and large enough threshold $\kappa >0$ such that the instantaneous profit should not be larger than $\kappa $ in absolute value with high probability. The same truncation argument can be applied to the penalty function $\Phi (p,z)$. Alternatively, one could truncate the unbounded variable p appearing in both payoffs L and $\Phi $.

2.2 The market model

The spot price of the commodity, P, underlying the structured products, is modelled as $P_t:= p(t,X_t)$, where $p:[0,T] \times {\mathbb {R}}^m \rightarrow {\mathbb {R}}$ is a measurable function and X represents the factors driving the market. We assume that the process X has Markovian dynamics

$$\begin{aligned} dX_t = b(t,X_t)\ dt + \Sigma ^*(t,X_t)\ dW_t, \quad X_0 =x \in {\mathbb {R}}^m, \end{aligned}$$

(2.3)

with drift $b:[0,T] \times {\mathbb {R}}^m \rightarrow {\mathbb {R}}^m$ and volatility matrix $\Sigma :[0,T] \times {\mathbb {R}}^m \rightarrow {\mathbb {R}}^{d \times m}$.

We also assume that $n\le d$ forward contracts on the commodity P are traded in the market, with maturities $T_1< \cdots < T_n$, with $T_1 \ge T$. Letting $F^i$ to denote the price of the forward contract with maturity $T_i$, $i = 1,\ldots ,n$, we assume that the dynamics of $F:= (F^1,\ldots ,F^n)$ is given by

$$\begin{aligned} dF_t = \mathrm {diag}(F_t) \left( \mu _F (t,X_t) dt + \sigma _F ^* (t,X_t) dW_t \right) , \quad F_0 = f_0 \in {\mathbb {R}}^n, \end{aligned}$$

(2.4)

for some functions $\mu _F: [0,T]\times {\mathbb {R}}^m \rightarrow {\mathbb {R}}^n$ and $\sigma _F: [0,T]\times {\mathbb {R}}^m \rightarrow {\mathbb {R}}^{d\times n}$.

Assumptions on the coefficients of X and F are given below. We will always assume throughout the paper that the interest rate is zero.

Notice that the forward contracts are not necessarily written on the commodity with spot price P, as they could also be written on some correlated commodity. For instance, P could be the spot price of gasoline, while the F’s are written on oil, as in Carmona and Ludkovski (2006), Fiorenzani (2006). This can be also due to illiquidity or to the fact that forward contracts relative to the commodity do not exist: for a detailed discussion of this phenomenon, see Carmona and Ludkovski (2006), Sect. 2.3.

We will always work under the following standing assumptions on the coefficients of the model:

Assumption 2.5

(i)
The function $p{:}\, [0,T] \times {\mathbb {R}}^m \rightarrow \mathbb R$ is continuous.

(ii)
The coefficients $b{:}\,[0,T] \times {\mathbb {R}}^m \rightarrow {\mathbb {R}}^m$ and $\Sigma :[0,T] \times {\mathbb {R}}^m \rightarrow {\mathbb {R}}^{d \times m}$ of the factor process X are continuous functions, Lipschitz in x uniformly in t and with linear growth in x uniformly in t.
(iii)
The drift $\mu _F{:}\, [0,T]\times {\mathbb {R}}^m \rightarrow {\mathbb {R}}^n$ and the volatility $\sigma _F: [0,T]\times {\mathbb {R}}^m \rightarrow {\mathbb {R}}^{d\times n}$ are continuous functions, Lipschitz in x uniformly in t and with linear growth in x uniformly in t.

Under such assumptions, the SDEs (2.3) and (2.4) are well-known to admit a unique strong solution (X, F) such that $X_0 =x$ and $F_0 = f_0$ [see, e.g., Theorem 13.1 in Rogers and Williams (2000), Chapter V].

2.3 Admissible strategies and utility indifference price

We consider an agent whose preferences are modelled by an exponential utility function $U(x)=-\frac{1}{\gamma }e^{-\gamma x}$ , $x \in {\mathbb {R}}$, with risk aversion parameter $\gamma > 0$. We assume that (s)he has a long position in $q \ge 0$ units of a given structured product with payoff $C_T = (C_T ^u)_{u\in \mathcal U}$ with $C_T ^u $ as in (2.1). Moreover, in order to hedge away the risk attached to such a contract, (s)he trades in the financial market of forward contracts described in the previous section. At any time $s \in [0,T]$, the agent invests the amount of wealth $\pi ^i_s$ in the forward contracts $F^i$ with $i = 1,\ldots ,n$. Hence the evolution of the agent’s portfolio is

$$\begin{aligned} \left\langle \pi _s, \frac{dF_s}{F_s} \right\rangle = \sum _{i=1}^n \pi ^i_s \frac{dF^i_s}{F^i_s} = \langle \pi _s, \mu _F (s,X_s) ds + \sigma _F^* (s,X_s) dW_s \rangle , \end{aligned}$$

where we recall that $\langle \cdot , \cdot \rangle $ denotes the Euclidean scalar product in $\mathbb {R}^n$ and we use the notation

$$\begin{aligned} \frac{dF_s}{F_s}:= \left( \frac{dF^i_s}{F^i_s}\right) _{i=1,\ldots ,n} = \mu _F (s,X_s) ds + \sigma _F^* (s,X_s) dW_s , \quad s\in [0,T] . \end{aligned}$$

At this point, we need to specify the set $\mathcal A$ of admissible strategies.

Definition 2.6

Let ${\bar{u}} >0$ be a given threshold. The set of admissible controls $\mathcal A$ is the set of all couples $(u,\pi )$, where u is any adapted process such that $u_t \in [0,{\bar{u}}]$ for all $t\in [0,T]$, and $\pi $ is any progressively measurable ${\mathbb {R}}^n$-valued process such that

$$\begin{aligned} \sup _{t\in [0,T]} {\mathbb {E}}\left[ \exp \left( \varepsilon | \pi _t |^2 \right) \right] < \infty , \end{aligned}$$

(2.5)

for some $\varepsilon >0$. We will denote by $\mathcal U$ the set of all admissible controls u. Moreover, $\mathcal A_t$ (resp. $\mathcal U_t$) will be the set of admissible controls $(u,\pi )$ (resp. admissible controls u) starting from t.

Now, we are ready to introduce the utility indifference (buying) price of q units of the structured product $C_T$. We will use the notation $C_{t,T}^u$ for the payoff of the structured contract $C_T^u$ starting at time t, i.e.,

$$\begin{aligned} C_{t,T}^u = \int _t^T L(P_s,Z_s^u,u_s) ds + \Phi (P_T,Z_T^u), \quad t\in [0,T]. \end{aligned}$$

Moreover, we set $C_T^u = C_{0,T}^u$.

Definition 2.7

The utility indifference (buying) price at time t for a position $q \ge 0$ in the structured product, when starting from the initial portfolio value $y_t$, is defined as the unique $\mathcal F_t$-measurable random variable $v_t $ solution (whenever it exists) to

$$\begin{aligned} V(y_t - v_t,q) = V(y_t,0), \end{aligned}$$

(2.6)

where

$$\begin{aligned} V(y_t,q):= \sup _{(u,\pi ) \in \mathcal A_t} {\mathbb {E}}_t \left[ -\frac{1}{\gamma }\exp \left( - \gamma \left( y_t + \int _t^T \left\langle \pi _s, \frac{d F_s}{F_s} \right\rangle + q C_{t,T}^u \right) \right) \right] , \end{aligned}$$

(2.7)

and ${\mathbb {E}}_t$ stands for the conditional expectation given $\mathcal F_t$.

Clearly, $V(y_0,q)$ gives the maximal expected utility from terminal wealth, computed at time 0, that an agent with an exponential utility can obtain starting from an initial wealth $y_0$ and having a position $q \ge 0$ in the structured product. Therefore, the (buying) UIP defined above represents the highest price the buyer is willing to pay for q units of the structured contract.

The maximization problem (2.7) can be easily translated into a standard Markovian control problem by suitably redefining the set of state variables as follows. Let $t \in [0,T]$. First, using Equation (2.1), we can rewrite the terminal wealth as follows

$$\begin{aligned} y_t + \int _t^T \left\langle \pi _s, \frac{d F_s}{F_s} \right\rangle + q C_{t,T}^u= & {} y_t +\int _t^T \left\langle \pi _s, \frac{d F_s}{F_s} \right\rangle \\&+\, q \int _t^T L(P_s,Z_s^u,u_s) ds +q \Phi (P_T,Z_T^u). \end{aligned}$$

Using the fact that $P_t = p(t,X_t)$ is a function of the factor process X, we obtain that the value function in (2.7) equals

$$\begin{aligned} V(t,x,y,z ; q):= \sup _{(u,\pi ) \in \mathcal A_t} \mathbb E_{t,x,y,z} \left[ G \left( X_T, Y_T^{u,\pi },Z_T^{u} ;q\right) \right] , \end{aligned}$$

(2.8)

where the reward function G is given by

$$\begin{aligned} G (x,y,z;q):= -\frac{1}{\gamma } e^{-\gamma (y + q \Phi (p(T,x),z))}, \end{aligned}$$

(2.9)

and the state variables $(X,Y^{u,\pi },Z^u)$ evolve as

$$\begin{aligned} \left\{ \begin{array}{lcl} d X_s &{}=&{} b(s,X_s) ds + \Sigma ^*(s,X_s) dW_s,\\ dY_s^{u,\pi } &{}=&{} \left( \langle \pi _s,\mu _F (s,X_s) \rangle + q L(p(s,X_s),Z_s^u,u_s) \right) ds + \langle \pi _s, \sigma _F^* (s,X_s) dW_s \rangle ,\\ d Z_s^u &{}=&{} u_s ds, \end{array} \right. \end{aligned}$$

(2.10)

with initial conditions $(X_t, Y^{u,\pi }_t,Z^u _t) = (x,y,z)$. Notice that the linear growth properties set in Assumption 2.5 combined with the boundedness of L in Assumption 2.1 give the following estimate for the controlled state process $(X,Y^{u,\pi },Z^u)$:

$$\begin{aligned} {\mathbb {E}}_{t,x,y,z} \left[ \sup _{t \le \tau \le T} \left| \left( X_\tau , Y^{u,\pi }_\tau , Z^u_\tau \right) \right| ^p \right] \le C_{u,\pi } (1+|(x,y,z)|^p), \quad t \in [0,T), \quad p\ge 1, \end{aligned}$$

(2.11)

for some constant $C_{u,\pi }>0$, which depends possibly on the control $(\pi , u)$ and is uniform in t.

Remark 2.8

Observe that the linear growth condition on b and $\Sigma $ [cf. Assumption 2.5(ii)] imply, through an application of Gronwall’s lemma, that

$$\begin{aligned} \sup _{t\in [0,T]} {\mathbb {E}}\left[ e^{\eta | X_t|^2} \right] < \infty , \end{aligned}$$

(2.12)

for some $\eta >0$.

Within this formulation, the UIP of $q \ge 0$ units of the structured product is the unique solution $v_t = v(t,x,y,z;q)$ (whenever it exists) to

$$\begin{aligned} V(t,x,y-v_t,z;q) = V(t,x,y,z;0). \end{aligned}$$

Remark 2.9

In principle, the controls associated to the virtual storage contract described in Example 2.3 do not satisfy Definition 2.6, where the control $u_t$ belongs to $[0,{\bar{u}}]$ with ${\bar{u}}$ constant. However, this example can be reduced to our setting by simply reparameterizing the control. In fact, one could define a new control c with values in [0, 2] such that the old control u satisfies $u_t = f(c_t,Z_t)$ for a suitable function f(c, z) given by

$$\begin{aligned} f(c,z):= \left\{ \begin{array}{ll} (c - 1) K_1 \sqrt{z}, &{} 0 \le c \le 1, \\ (c - 1) K_2 \sqrt{\frac{1}{z + Z_b} - K_3}, &{} 1 \le c \le 2, \\ \end{array} \right. \end{aligned}$$

Z solves

$$\begin{aligned} dZ_t = f(c_t,Z_t)\ dt , \quad Z_0 = z_0.\end{aligned}$$

and $L(p,z,c) = - p (f(c,z) - a(z,f(c,z)))$.

Remark 2.10

Here, we briefly discuss two papers, Ludkovski (2008) and Porchet et al. (2009), that do not fit (strictly speaking) the literature on structured products but that are somewhat related. Indeed, they both deal with the pricing of a physical/industrial asset using a UIP approach with an investment component. However, even though the optimization problems in Ludkovski (2008), Porchet et al. (2009) are mathematically similar to the one considered here, the controls affecting the asset are switching controls with finitely many states. Hence their methods, that are based on optimal switching and BSDEs, are different from ours. Finally, our model is more specific than theirs as it is tailor-made for the pricing and hedging of structured contracts on energy.

3 Characterization of the UIP with viscosity solutions

In this section we characterize, under some further technical assumptions given below, the UIP in terms of viscosity solutions of suitable Cauchy problems. More precisely, we prove that the log-value functions for problem (2.8) with zero initial wealth, defined as

$$\begin{aligned} J(t,x,z;q):= - \frac{1}{\gamma } \log \left( - V(t,x,0, z;q) \right) , \quad q \ge 0, \end{aligned}$$

(3.1)

can be characterized as the unique continuous viscosity solutions with quadratic growth to a suitable Cauchy problem. The UIP is obtained from there as the difference between the two log-value functions, corresponding to the problems with and without the structured products. This is done using some techniques developed in Pham (2002) together with recent results on uniqueness for a class of second order Bellman-Isaacs equations, established in Da Lio and Ley (2006).

3.1 Heuristics on the value function PDE

In this section we derive, in a heuristic way, the PDE that the value functions appearing in the definition of UIP are expected to satisfy. It is a classical property in the presence of an exponential utility function (see, e.g., the papers Becherer 2003, 2004; Becherer and Schweizer 2005; Henderson and Hobson 2009; Valdez and Vargiolu 2013) that one can factor out the initial wealth y so that

$$\begin{aligned} V(t,x,y,z;q) = e^{- \gamma y} V(t,x,0,z;q), \quad y \in {\mathbb {R}}. \end{aligned}$$

Hence, by definition of UIP, we deduce

$$\begin{aligned} e^{- \gamma (y-v)} V(t,x,0,z;q)= & {} V(t,x,y-v,z;q)\\= & {} V(t,x,y,z;0) = e^{- \gamma y} V(t,x,0,z;0) \end{aligned}$$

so that the UIP v is given by

$$\begin{aligned} v = - \frac{1}{\gamma } \log \frac{V(t,x,0,z;q)}{V(t,x,0,z;0)}= J(t,x,z;q) - J(t,x,z;0), \end{aligned}$$

(3.2)

where J denotes the log-value function defined in (3.1). From the general theory of stochastic optimal control with Markovian state variables it is clear that the value function V is expected to satisfy the following HJB equation

$$\begin{aligned} V_t(t,x,y,z;q) + \sup _{(u,\pi ) \in [0,{\bar{u}}]\times {\mathbb {R}}^n} \mathcal L^{u,\pi } V(t,x,y,z;q) = 0, \end{aligned}$$

(3.3)

with terminal condition $V(T,x,y,z ; q) = G(x,y,z;q)$ and where

$$\begin{aligned} \mathcal L^{u,\pi } V= & {} \left( \langle \pi ,\mu _F \rangle + q L \right) V_y + \langle b, V_x \rangle + u V_z + \frac{1}{2} |\pi ^{*} \sigma _F^*|^2 V_{yy}\\&+ \frac{1}{2} \mathrm {tr} \left( \Sigma ^* \Sigma V_{xx} \right) + \pi ^{*} \sigma _F^* \Sigma V_{xy} \end{aligned}$$

is the generator of the state variable (X, Y, Z). Recalling that $V(t,x,y,z;q) = - e^{- \gamma y - \gamma J(t,x,z;q)}$, we can easily deduce from (3.3) the following PDE for the log-value function $J:=J(t,x,y,z;q)$:

$$\begin{aligned} \begin{array}{l} \displaystyle J_t + \sup _{(u,\pi ) \in [0,{\bar{u}}]\times \mathbb R^n} \big [ \langle \pi , \mu _F \rangle + qL + \langle b, J_x \rangle + u J_z - \frac{1}{2} \gamma |\pi ^{*} \sigma _F^*|^2 \\ \quad -\, \frac{1}{2} \gamma | \Sigma J_x|^2 + \frac{1}{2} \text {tr} \left( \Sigma ^* \Sigma J_{xx} \right) - \gamma \pi ^{*} \sigma ^*_F \Sigma J_x \big ] = 0. \end{array} \end{aligned}$$

(3.4)

The Hamiltonian therein is maximised by the control ${\hat{\pi }^q}$, given by

$$\begin{aligned} {\hat{\pi }}^q = (\sigma _F^* \sigma _F)^{-1} \left( \frac{\mu _F}{\gamma } - \sigma _F^* \Sigma J_x \right) . \end{aligned}$$

(3.5)

Substituting ${\hat{\pi }}^q$ into the Eq. (3.4) leads to

$$\begin{aligned} \begin{array}{l} \displaystyle J_t + \frac{1}{2 \gamma } \langle (\sigma _F^* \sigma _F)^{-1} \mu _F,\mu _F\rangle + \langle {\bar{b}} , J_x \rangle + \sup _{u \in [0,{\bar{u}}]} \Big [ u J_z + qL \Big ] \\ \quad -\frac{1}{2} \gamma J_x^{*} B J_x + \frac{1}{2} \text {tr} \left( \Sigma ^* \Sigma J_{xx}\right) = 0, \end{array} \end{aligned}$$

(3.6)

where

$$\begin{aligned} {\bar{b}}:= b - \Sigma ^* \sigma _F (\sigma _F^* \sigma _F)^{-1} \mu _F \end{aligned}$$

(3.7)

and B is a $m \times m$ symmetric matrix given by

$$\begin{aligned} B:= \Sigma ^* \Sigma - (\sigma _F^* \Sigma )^* (\sigma _F^* \sigma _F)^{-1} (\sigma _F^* \Sigma ) = \Sigma ^* (I_d - \sigma _F (\sigma _F^* \sigma _F)^{-1} \sigma _F^* ) \Sigma . \end{aligned}$$

(3.8)

The terminal condition for V translates into

$$\begin{aligned} J(T,x,z;q) = \frac{\log \gamma }{\gamma } + q\Phi (p(T,x),z), \quad (x,z) \in {\mathbb {R}}^m \times [0, {\bar{u}} T]. \end{aligned}$$

(3.9)

Remark 3.1

In order to compute the UIP as in Eq. (2.6), we first calculate J(t, x, z; 0), which satisfies Eq. (3.6) with the terminal condition $J(T,x,z;0) = \frac{\log \gamma }{\gamma }$. It is a classical and intuitive result that, in this situation, J(t, x, z; 0) does not depend on z. Denoting J(t, x, z; 0) by $J^0 (t,x)$ for simplicity, we have that it fulfills

$$\begin{aligned} \begin{array}{c} \displaystyle J_t^0 + \frac{1}{2 \gamma } \langle (\sigma _F^* \sigma _F)^{-1} \mu _F,\mu _F\rangle + \langle {\bar{b}} , J^0_x \rangle - \frac{1}{2} \gamma J_x^{0,*} B J_x^0 + \frac{1}{2} \text {tr} \left( \Sigma ^* \Sigma J_{xx}^0\right) = 0. \end{array} \end{aligned}$$

(3.10)

Thus, subtracting Eq. (3.10) from Eq. (3.6) and using the fact that

$$\begin{aligned} - \frac{1}{2} \gamma J_x^{*} B J_x + \frac{1}{2} \gamma J_x^{0,*} B J_x^0 = - \frac{1}{2} \gamma v_x^* B v_x - \gamma J_x^{0,*} B v_x \end{aligned}$$

we obtain the following PDE for the UIP v:

$$\begin{aligned} \begin{array}{c} \displaystyle v_t + \langle {\bar{b}}, v_x \rangle + \sup _{u \in [0,{\bar{u}}]} \Big [ u v_z+ q L \Big ] + \frac{1}{2} \mathrm {tr} \left( \Sigma ^* \Sigma v_{xx}\right) \displaystyle - \frac{1}{2} \gamma v_x^{*} B v_x - \gamma J_x^{0,*} B v_x = 0, \end{array} \end{aligned}$$

(3.11)

with the terminal condition

$$\begin{aligned} v(T,x,z;q) = q \ \Phi (p(T,x),z). \end{aligned}$$

(3.12)

Notice that solving the HJB equation for the UIP v(t, x, z; q) above requires the knowledge of $J^0$, which is the log-value function of the optimal investment problem with no claim. This phenomenon is due to the presence of the non-tradable factors X in the dynamics of the forward contracts F and it has been observed in a somewhat different model in Becherer (2004), where the non-tradable factors follow a pure jump dynamics.

3.2 Existence and uniqueness results

In this section we show that the log-value function J is the unique continuous viscosity solution with quadratic growth of Eq. (3.6) with the terminal condition (3.9). From there, the UIP v is easily found via the equality (3.2). We will work under the following standing assumption. Recall that the matrix B has been defined in (3.8).

Assumption 3.2

The following properties hold:

(i)
b is $C^1$, B and $\Sigma ^* \sigma _F ( \sigma _F ^* \sigma _F)^{-1} \mu _F$ are $C^1$ and Lipschitz in x uniformly in t.
(ii)
$\mu _F$ is bounded and $\langle (\sigma _F ^* \sigma _F)^{-1} \mu _F , \mu _F\rangle $ is $C^1$ and Lipschitz in x uniformly in t.
(iii)
$\sigma _F^* \sigma _F$ is bounded and uniformly elliptic, i.e., for some $\epsilon > 0$,
$$\begin{aligned} (\sigma _F ^* \sigma _F)(t,x) \ge \epsilon I_n, \quad \text{ for } \text{ all } (t,x) \in [0,T] \times \mathbb {R}^m\text{. } \end{aligned}$$
(3.13)
(iv)
The matrix B is positive semidefinite and there exists a constant $\delta >0$ (uniform in t, x) such that
$$\begin{aligned} \frac{1}{\delta } | \xi |^2 \le \langle \xi , B \xi \rangle \le \delta | \xi |^2 \end{aligned}$$
(3.14)
for all vectors $\xi \in {\mathrm{Im}}(B)$, the image of B.

Some comments on these hypotheses are in order. All the assumptions above, with the exception of $C^1$-regularities and boundedness of $\mu _F$ (linear growth is actually sufficient) have to be imposed in order to apply the method and the results established in Da Lio and Ley (2006). In particular, condition (iv) on B is related to the coercivity hypothesis in Assumption A1 in Da Lio and Ley (2006), which has a crucial role in the proof of their comparison theorem. Such a property has to be verified on a case-by-case basis. Some examples where this assumption is verified are provided in Sect. 4.

The additional $C^1$-regularity assumptions as well as the boundedness of $\mu _F$ allow us to adapt results from Pham (2002) to get the quadratic growth condition of the log-value function $J^0$ of the investment problem with no claim. Furthermore, thanks to Assumption 2.1 on the structured contract, the latter property will be inherited by the log-value function, J, with the claim.

We are now ready to state the main result of the paper.

Theorem 3.3

Let Assumptions 2.1 and 2.5 hold. Under Assumption 3.2, the log-value function J, defined in (3.1), is the unique continuous viscosity solution with quadratic growth of the Cauchy problem (3.6) with terminal condition (3.9).

Before proving the theorem, we give a preliminary result showing that the value function V is a (possibly discontinuous) viscosity solution of a Hamilton–Jacobi–Bellman (HJB) equation in the interior of its domain. Its proof is postponed to the “Appendix”.

Proposition 3.4

Let Assumptions 2.1 and 2.5 hold. Under Assumption 3.2, the value function V in (2.8) is a (possibly discontinuous) viscosity solution of the HJB equation

$$\begin{aligned}&V_t(t,x,y,z;q) + \sup _{(u,\pi ) \in [0,{\bar{u}}]\times {\mathbb {R}}^n} \mathcal L^{u,\pi } V(t, x,y,z ;q) = 0, \nonumber \\&\qquad (t, x,y,z) \in [0,T) \times {\mathbb {R}}^m \times {\mathbb {R}} \times {\mathbb {R}} \end{aligned}$$

(3.15)

with terminal condition $V(T, x,y,z;q) = G(x,y,z ;q )$, where

$$\begin{aligned} \mathcal L^{u,\pi } V= & {} \left( \langle \pi ,\mu _F \rangle + q L \right) V_y + \langle b, V_x \rangle + u V_z \\&\quad + \frac{1}{2} |\pi ^{*} \sigma _F^*|^2 V_{yy} + \frac{1}{2} \mathrm {tr} \left( \Sigma ^* \Sigma V_{xx} \right) + \pi ^{*} \sigma _F^* \Sigma V_{xy}. \end{aligned}$$

At this point we are in position to prove Theorem 3.3.

Proof of Theorem 3.3

We consider the existence first. This is an easy consequence of Proposition 3.4 above, which gives that the value function V is a viscosity solution of Eq. (3.15). It then suffices to use the definition of viscosity solution to check that the log-value function J defined in (3.1) is a (possibly discontinuous) viscosity solution of the PDE (3.6).

To complete the proof, it remains to show that J is unique in the class of all continuous viscosity solutions with quadratic growth for the Cauchy problem (3.6) and (3.9). The main idea for uniqueness is to use the comparison theorem in Da Lio and Ley (2006), Th. 2.1. For reader’s convenience, we split the rest of the proof into two steps.

(i) Reduction to Da Lio and Ley (2006) setting. First, we use a Fenchel-Legendre transform to express the quadratic term in our pricing PDE (3.6) into an infimum over the image of B of a suitable function. More precisely, we apply a classical result in convex analysis [e.g. (Rockafellar 1970, Ch.III, Sect. 12)] to get

$$\begin{aligned} F(w):= - \frac{1}{2} \langle w, B w \rangle = \inf _{\alpha \in {\mathrm{Im}}(B)} \{ - {\tilde{F}} (\alpha ) - \langle \alpha ,w \rangle \} = \inf _{\alpha \in {\mathbb {R}}^m} \{ - {\tilde{F}} (\alpha ) - \langle \alpha ,w \rangle \}, \end{aligned}$$

(3.16)

for all vectors $w \in {\mathbb {R}}^m$, where ${\tilde{F}}$ is the conjugate of F, which is also given by

$$\begin{aligned} {\tilde{F}} (\alpha ) = - \frac{1}{2} \langle \alpha , B^{-1} \alpha \rangle , \end{aligned}$$

when $\alpha \in {\mathrm{Im}}(B)$ and $-\infty $ otherwise. Notice that the first infimum in (3.16) is computed over the image of B since the matrix B is not necessarily invertible in our framework. Using (3.16), we can rewrite Eq. (3.6) as

$$\begin{aligned} \begin{array}{l} \displaystyle J_t + \frac{1}{2 \gamma } \langle (\sigma _F^* \sigma _F)^{-1} \mu _F,\mu _F\rangle + \langle {\bar{b}}, J_x \rangle \\ \quad + \sup _{u \in [0,{\bar{u}}]} \Big [ u J_z + qL \Big ] +\gamma F(J_x) + \frac{1}{2} \text {tr} \left( \Sigma ^* \Sigma J_{xx}\right) = 0, \end{array} \end{aligned}$$

(3.17)

with ${\bar{b}}$ as in (3.7) and with terminal condition $J(T,x,z;q) = \frac{\log \gamma }{\gamma } + q\Phi (p(T,x),z)$. Notice that, since the function F above can be written as an infimum as in (3.16), we get a PDE with the same form as in Da Lio and Ley (2006), Eq. (1.1) provided we apply the time reversal transformation ${\widehat{J}} (t,x,z;q):= J(T-t,x,z;q)$. Hence the PDE (3.17) turns into the following

$$\begin{aligned}&\displaystyle -{\widehat{J}}_t + \frac{1}{2 \gamma } \langle (\sigma _F^* \sigma _F)^{-1} \mu _F,\mu _F\rangle + \langle {\bar{b}}, {\widehat{J}}_x \rangle \nonumber \\&\quad + \sup _{u \in [0,{\bar{u}}]} \Big [ u {\widehat{J}}_z + qL \Big ] +\gamma F({\widehat{J}}_x) + \frac{1}{2} \text {tr} \left( \Sigma ^* \Sigma {\widehat{J}}_{xx}\right) = 0, \end{aligned}$$

(3.18)

with the initial condition

$$\begin{aligned} {\widehat{J}}(0,x,z;q) = \frac{\log \gamma }{\gamma } + q\Phi (p(T,x),z). \end{aligned}$$

(3.19)

Notice that this Cauchy problem is a particular case of the one studied in Da Lio and Ley (2006) since our Assumptions 2.1, 2.5 and 3.2 imply Assumptions (A1), (A2), (A3) in Da Lio and Ley (2006). In particular, Assumption 3.2 (iv) implies the same property for $B^{-1}$, giving (A1) (iii) in Da Lio and Ley (2006). Indeed on the image of B, $B^{1/2}$ as well its inverse $B^{-1/2}$ are well-defined. Since $B^{-1/2}: \text {Im}(B) \rightarrow \text {Im}(B)$, we have that, e.g., the LHS in (3.14) implies $\delta ^{-1}\vert B^{-1/2} y\vert ^2 \le \langle B^{-1/2} y, BB^{-1/2}y\rangle $ for all $y\in \text {Im}(B)$, leading to $\langle y , B^{-1} y \rangle \le \delta \vert y \vert ^2$ for all $y\in \text {Im}(B)$. The other inequality is obtained in a similar way.

(ii) Uniqueness. We proceed by contradiction. Assume that there exists another continuous viscosity solution with quadratic growth ${\tilde{J}}$ of the Cauchy problem (3.18) with terminal condition (3.19). Then, by calling $J^*$ and ${\tilde{J}}^*$ their u.s.c. envelopes and $J_*$ and ${\tilde{J}}_*$ their l.s.c. envelopes, we have, by definition of viscosity solution, that $J^*$, ${\tilde{J}}^*$ are u.s.c. viscosity subsolutions and $J_*$, ${\tilde{J}}_*$ are l.s.c. viscosity supersolutions of equation (3.18), obviously with ${\tilde{J}}_* \le {\tilde{J}}^*$. We also have $J_*(T,x,z;q) \le \frac{\log \gamma }{\gamma }+ q \Phi (p(T,x),z) \le J^*(T,x,z;q)$, by definition of upper and lower envelopes. We now want to prove that

$$\begin{aligned} J^*(T,x,z;q) \le \frac{\log \gamma }{\gamma }+q \Phi (p(T,x),z) \le J_* (T,x,z;q) , \end{aligned}$$

(3.20)

for all $q \ge 0$, $x \in {\mathbb {R}}^m, z\in [0,{\bar{u}} T]$. To prove the inequalities (3.20) it suffices to apply Theorem 4.3.2 and subsequent Remark 4.3.5 in Pham (2009).^{Footnote 1}

Moreover it can be proved that J(t, x, z; q) has quadratic growth in (x, z), uniformly in t, for all $q\ge 0$ (ref. Lemma 8.1 in the “Appendix 2”). Then, by the comparison theorem (Da Lio and Ley 2006, Theorem 2.1), we have that

$$\begin{aligned} J_* \le J^* \le {\tilde{J}}_* \le {\tilde{J}}^* \le J_* \end{aligned}$$

on $[0,T] \times \mathbb {R}^m \times \mathbb {R}$. This implies that $J_* = J^* = J = {\tilde{J}}$, and that J is continuous. The proof is therefore complete. $\square $

As a consequence of the result in Theorem 3.3, we have a good candidate for the optimal hedging strategy, which is given by

$$\begin{aligned} {\hat{h}}^q:= {\hat{\pi }}^q - {\hat{\pi }}^0 = - (\sigma _F^* \sigma _F)^{-1} \sigma _F^* \Sigma v_x, \end{aligned}$$

(3.21)

where $v_x$ is the gradient with respect to the factor variables, when it exists, of the UIP (compare Becherer 2003, 2004). Concerning the optimal exercise policy ${\hat{u}}$ of the structured contract, a candidate in feedback form is given by solving the maximization problem

$$\begin{aligned} \max _{u \in [0,{\bar{u}}]} \left[ uv_z (t,x,z;q) + qL (p,z,u)\right] . \end{aligned}$$

For an explicit formula, consider the case $L(p,z,u)=u \ell (p,z)$ with $\ell $ bounded. In this case, it is easy to see that

$$\begin{aligned} {\hat{u}} (t,x,z;q) = {\bar{u}} {\mathbf {1}}_{\left[ v_z (t,x,z;q) > - q \ell (p,z)\right] }. \end{aligned}$$

(3.22)

Even though working with viscosity solutions does not allow to justify rigorously the optimality of such controls, we observe that they are consistent with the optimal policies that have been obtained in the past literature for more specific models (see e.g. Basei et al. 2014; Benth et al. 2012).

Remark 3.5

Note that we have worked on the log-value function’s PDE (3.6) instead of on the PDE for the price v [cf. Eq. (3.11)]. The reason for doing so is that the latter is more delicate to handle due to the fact that it contains the first derivative $J_x^0$ of the log-value function with no claim. Applying Da Lio and Ley’s results directly to Eq. (3.11) would require a Lipschitz continuity for $J_x^0$ uniformly in t, which is difficult to have in general. Nonetheless, when this condition is satisfied as in Cartea–Villaplana (see Sect. 4.2) and in the linear dynamics model in Example 4.6, the same arguments go through with fewer assumptions than in Theorem 3.3. Indeed, the boundedness of the payoffs L and $\Phi $ implies that the UIP v is bounded and so it has quadratic growth. Therefore, Lemma 8.1 is not needed anymore and neither are all the $C^1$-regularities and the boundedness of $\mu _F$ as in Assumption 3.2. Under the remaining assumptions and when $\mu _F$ has linear growth in x uniformly in t (replacing its boundedness) we can prove that v is the unique continuous viscosity solution with quadratic growth to Eq. (3.11) with terminal condition (3.12). The proof is analogous to that of Theorem 3.3, it is therefore omitted.

Remark 3.6

(Complete market case) When the market is complete, i.e. $d = n$ and $\sigma _F$ has full rank, we have $B=0$ so that Assumption 3.2(iv) is trivially satisfied and $J^0_x$ does not appear in the PDE for v anymore. In this case, we can work directly with the PDE for v along the same lines as in the previous Remark 3.5. Therefore, under Assumptions 2.1, 2.5 and 3.2 (i)–(ii)–(iii), one can show that v is the unique continuous viscosity solution with quadratic growth of the HJB equation

$$\begin{aligned} \begin{array}{c} \displaystyle v_t + \langle b-\Sigma ^* (\sigma _F ^* )^{-1} \mu _F , v_x \rangle + \frac{1}{2} \mathrm {tr} \left( \Sigma ^* \Sigma v_{xx}\right) + \sup _{u \in [0,{\bar{u}}]} \Big [ u v_z+ q L \Big ] = 0, \end{array} \end{aligned}$$

(3.23)

with terminal condition

$$\begin{aligned} v(T,x,z;q) = q \Phi (p(T,x),z). \end{aligned}$$

(3.24)

Moreover, one can weaken the boundedness of $\mu _F$ and require only linear growth in x uniformly in t. This result extends to our setting previous ones in Basei et al. (2014), Benth et al. (2012), Chen and Forsyth (2007), Felix (2012), Thompson et al. (2009), which were obtained for particular types of structured contracts, e.g., swings and virtual storages, and without trading in forward contracts.

4 Examples

4.1 A class of models with two assets and constant correlation

In this section we focus on the following incomplete market model:

$$\begin{aligned} \left\{ \begin{array}{rcl} \displaystyle \frac{dF_t}{F_t} &{}=&{} \mu _F (t, X_t) dt + {\bar{\sigma }}_F (t, X_t) dW^1 _t , \\ dX_t &{}=&{} b (t, X_t) dt + \sigma (t, X_t) \left( \rho dW^1 _t + \sqrt{1- \rho ^2}dW^2_t \right) , \end{array} \right. \end{aligned}$$

(4.1)

where $W=(W^1,W^2)$ is a bidimensional Brownian motion and $\rho \in (-1,1)$. This is clearly a particular case of the general model in the previous section with $\sigma ^* _F (t,x) = ({\bar{\sigma }}_F (t,x), 0)$, $\Sigma ^* (t,x) = \sigma (t,x) (\rho , \sqrt{1-\rho ^2})$ and $P_t =p(t,X_t)$ for some continuous function p(t, x). This model is a generalization of the usual Black-Scholes model with basis risk (see Davis 2006; Henderson 2002; Monoyios 2004 among many others), with the additional feature that the non traded asset or factor X can appear in the coefficients of the traded asset F.

We suppose that Assumptions 2.1 and 2.5 are in force. Concerning Assumption 3.2, we are going to specialize it to the present setting as follows. Observe first that the quantity $\Sigma ^* \sigma _F (\sigma _F ^* \sigma _F)^{-1} \mu _F $ appearing in Assumption 3.2(i) reads as

$$\begin{aligned} \Sigma ^* \sigma _F (\sigma _F ^* \sigma _F)^{-1} \mu _F (t,x)= \rho \mu _F(t,x) \frac{ \sigma (t,x)}{{\bar{\sigma }}_F(t,x)} \end{aligned}$$

while the scalar product $\langle (\sigma _F ^* \sigma _F)^{-1} \mu _F , \mu _F\rangle $ in Assumption 3.2(ii) is

$$\begin{aligned} \langle (\sigma _F ^* \sigma _F)^{-1} \mu _F , \mu _F\rangle (t,x) = \frac{\mu _F^2(t,x)}{{\bar{\sigma }}_F^2(t,x)}. \end{aligned}$$

and $(\sigma _F^* \sigma _F)(t,x)$ in Assumption 3.2 (iii) corresponds to $(\sigma _F^* \sigma _F)(t,x)= {\bar{\sigma }}_F^2(t,x)$. Finally, we have $B(t,x)=(1- \rho ^2)\sigma ^2(t,x)$. Hence, Assumption 3.2 is guaranteed by the conditions listed just below and the general results in Theorem 3.3 can be safely applied.

Assumption 4.1

Let the following properties hold:

(i)
$b\in C^1$, $\sigma \in C^1$;
(ii)
$\mu _F$ is bounded;
(iii)
$\sigma $ and ${\bar{\sigma }}_F$ are bounded and bounded away from zero;
(iv)
$\frac{\mu _F}{{\bar{\sigma }}_F} \in C^1$ and it is Lipschitz in x uniformly in t.

In this more specific setting, we can obtain more information on the structure of the value function of the buyer of q units of the structured product provided we have the following

Assumption 4.2

Let the log-value function $J^0 _x$ be Lipschitz in x uniformly in t.

Under this assumption, we do not need to suppose that $\mu _F$ is bounded as in 4.1(ii) above. Indeed the considerations in Remark 3.5 apply, so that in particular $\mu _F$ can be a linear function of x as in Example 4.6 below.

Let $C_T^u$ be the payoff of a given structured contract as in (2.1). Inspired by the results in Oberman and Zariphopoulou (2003), which in turn extend Karoui and Rouge (2000) to American options, we obtain a representation of the UIP of the structured product $C_T^u$ as the value function of an auxiliary optimization problem with respect to the control u only, under a suitable equivalent martingale measure involving the derivative $J_x ^0$ of the log-value function of the problem with no claim, and where $\gamma $ is replaced by a modified risk aversion ${\widetilde{\gamma }}=\gamma (1-\rho ^2)$.

Let us consider the measure ${\mathbb {Q}} ^0$ defined as

$$\begin{aligned} \frac{d{\mathbb {Q}}^0}{d{\mathbb {P}}} \Big \vert _{\mathcal F_t}: = D^0 _t: = \exp \left( -\int _0 ^t \theta ^* _u dW_u -\frac{1}{2} \int _0 ^t |\theta _u | ^2 du \right) , \quad t \in [0,T], \end{aligned}$$

(4.2)

where $W=(W^1 , W^2)^*$ and $\theta $ is given by

$$\begin{aligned} \theta _t = (\theta _t ^1 , \theta _t ^2)^* = \left( \frac{\mu _F}{{\bar{\sigma }}_F} , \; \gamma \sqrt{1-\rho ^2} \sigma J_x ^0 \right) ^* (t,X_t). \end{aligned}$$

(4.3)

Notice that the stochastic exponential is well defined, since X has continuous paths and $\mu _F / {\bar{\sigma }}_F$ is continuous, so that the stochastic integral $\int _0 ^t \theta _u ^1 dW_u ^1$ is well-defined for every t. Moreover, the second integral $\int _0 ^t \theta ^2 _u dW_u ^2$ is also well-defined thanks to the continuity of $\sigma (t,X_t)$ and the linear growth of $J_x ^0$ (cf. Lemma 8.1).

Finally, in order for the equation (4.2) to define a probability measure, we need to impose that ${\mathbb {E}}[D_T ^0] =1$. This equality holds true when, for instance, $J_x^0$ is bounded, so that in particular Novikov’s criterion applies. More generally, one could use the deterministic criteria proposed in Mijatović and Urusov (2012) (e.g. Theorem 2.1 therein).

Remark 4.3

In the case when the coefficients of F do not depend on the state variable X, when, e.g. both follows geometric Brownian motions with constant correlation, we have that $J^0 _x \equiv 0$, and ${\mathbb {Q}}^0$ coincides with the minimal entropy martingale measure. Therefore the measure ${\mathbb {Q}}^0$ can be viewed as a perturbation of the minimal entropy martingale measure (see Frittelli 2000) where the correction involves the log-value function $J^0$ of the optimal pure investment problem.

In what follows we will need the following preliminary lemma, stating the dynamics of the spot price under the martingale measure ${\mathbb {Q}}^0$. Its proof is based on a standard application of Girsanov’s theorem, and it is therefore omitted.

Lemma 4.4

Assume ${\mathbb {E}}[D_T ^0] =1$. Then the dynamics of X under ${\mathbb {Q}}^0$ is given by

$$\begin{aligned} d X_t = {\tilde{b}} (t,X_t) dt + \sigma (t,X_t) d W^0 _t, \end{aligned}$$

(4.4)

where

$$\begin{aligned} {\tilde{b}} (t,X_t):= \left( b - \rho \sigma \frac{\mu _F}{{\bar{\sigma }}_F} - {{\tilde{\gamma }}} \sigma ^2 J_x ^0 \right) (t, X_t) \end{aligned}$$

and

$$\begin{aligned} d W^0_t:= \rho dW^1 _t + \sqrt{1- \rho ^2}dW^2_t + \left( \rho \frac{\mu _F}{{\bar{\sigma }}_F} + \tilde{\gamma }\sigma J_x ^0 \right) (t,X_t) dt \end{aligned}$$

defines a ${\mathbb {Q}}^0$-Brownian motion and ${{\tilde{\gamma }}} = \gamma (1 - \rho ^2)$.

The following proposition extends to our setting the characterisation in Oberman and Zariphopoulou (Oberman and Zariphopoulou 2003 Prop. 10).

Proposition 4.5

Let the standing Assumptions 2.1, 2.5, 4.1 (i)–(iii)–(iv) and 4.2 hold. Then the UIP $v = v(t,x,z;q)$ satisfies

$$\begin{aligned} v(t,x,z;q) = \sup _{u \in \mathcal U_t} \left( - \frac{1}{\tilde{\gamma }} \ln {\mathbb {E}}^0 _{t,x,z} \left[ e^{- {{\tilde{\gamma }}} q C_{t,T}^u} \right] \right) , \end{aligned}$$

(4.5)

where ${\mathbb {E}}^0 _{t,x,z}$ denotes the conditional expectation under ${\mathbb {Q}}^0$.

Proof

We prove the result by showing that the candidate function

$$\begin{aligned} {\tilde{v}} = {\tilde{v}}(t,x,z;q):= \sup _{u \in \mathcal U_t} \left( - \frac{1}{{{\tilde{\gamma }}}} \ln {\mathbb {E}}^0 _{t,x,z} \left[ e^{- {{\tilde{\gamma }}} q C_{t,T}^u} \right] \right) \end{aligned}$$

satisfies Eq. (3.11) with terminal condition (3.12) and we conclude using the comparison theorem in Da Lio and Ley (2006), Th. 2.1. To this end, write ${\tilde{v}}$ as

$$\begin{aligned} {\tilde{v}}(t,x,z;q)= & {} - \frac{1}{{{\tilde{\gamma }}}} \ln ( - w(t,x,z;q)), \end{aligned}$$

(4.6)

with

$$\begin{aligned} w(t,x,z;q): = \sup _{u \in \mathcal U_t} {\mathbb {E}}^0_{t,x,z} \left[ - e^{- {{\tilde{\gamma }}} q C_{t,T}^u }\right] . \end{aligned}$$

The value function w above solves the following Cauchy problem in a viscosity sense

$$\begin{aligned} \left\{ \begin{array}{l} \displaystyle w_t(t,x,z;q) + \sup _{u \in [0,{\bar{u}}]} \left[ \mathcal L^{u} w(t,x,z;q) - {{\tilde{\gamma }}} q L(p(t,x),z,u) w(t,x,z;q) \right] = 0 \\ w(T,x,z;q) = -\exp (-{{\tilde{\gamma }}} q \Phi (p(T,x),z) ) \end{array} \right. \end{aligned}$$

with

$$\begin{aligned} \mathcal L^{u} w = {\tilde{b}} w_x + u w_z + \frac{1}{2} \sigma ^2 w_{xx}. \end{aligned}$$

The corresponding Cauchy problem for ${\tilde{v}}$ is immediately obtained:

$$\begin{aligned} \left\{ \begin{array}{l} \displaystyle {\tilde{v}}_t(t,x,z;q) + \sup _{u \in [0,{\bar{u}}]} \left[ \tilde{\mathcal L}^{u} {\tilde{v}}(t,x,z;q) + q L(p(t,x),z,u) \right] = 0 \\ {\tilde{v}}(T,x,z;q) = q \Phi (p(T,x),z), \end{array} \right. \end{aligned}$$

(4.7)

with

$$\begin{aligned} \tilde{\mathcal L}^{u} {\tilde{v}}= & {} {\tilde{b}} {\tilde{v}}_x + u {\tilde{v}}_z + \frac{1}{2} \sigma ^2 \left[ {\tilde{v}}_{xx} - {{\tilde{\gamma }}} {\tilde{v}}_x^2 \right] , \end{aligned}$$

which is a particular case of Eq. (3.11) in this setting.

To identify ${\tilde{v}}$ with the UIP v and conclude, we need a uniqueness result for the Cauchy problem (4.7). Since $J_x ^0$ is assumed to be Lipschitz in x uniformly in t, we can use Remark 3.5 to get the existence of a unique continuous viscosity solution with quadratic growth to the Cauchy problem (4.7). Finally, the boundedness of the payoff $C^u _{t,T}$ (cf. Assumption 2.1) clearly implies that the value function ${\tilde{v}} (t,x,z)$ has quadratic growth. Thus the proof is complete. $\square $

The previous proposition suggests the following approach to compute the UIP and the corresponding (partial) hedging strategy of a given structured product in this setting:

first, solve the pure optimal investment problem V(t, x, y; 0) with no claim;
second, compute the x-derivative of the log-value function $J^0$ giving the new probability measure ${\mathbb {Q}}^0$ as well as the corresponding dynamics of X;
finally, solve the maximisation problem in (4.5), which is now computed with respect to the control u only; its value function gives the UIP while its derivative with respect to x gives the hedging strategy via (3.5).

Example 4.6

(Linear dynamics model) This example is a slight generalization of the model studied in Carmona and Ludkovski (2006), Sect. 2.2 and Fiorenzani (2006):

$$\begin{aligned} dF_t= & {} F_t\left( (a - k X_t) dt + {\bar{\sigma }}_F dW_t ^1\right) ,\end{aligned}$$

(4.8)

$$\begin{aligned} dX_t= & {} \delta (\theta - X_t )dt + \sigma \left( \rho dW^1 _t + \sqrt{1-\rho ^2} dW_t ^2\right) , \end{aligned}$$

(4.9)

where $a,k,{\bar{\sigma }}_F , \delta , \theta , \sigma $ are real constants, the correlation $\rho $ belongs to $(-1,1)$, and $(W^1, W^2)$ is a bidimensional Brownian motion as before. Here F represents the price of a liquid forward contract with maturity T written on a commodity, while instead X is the log-price of another, less liquid, commodity on which the structured product is written (i.e. $p(t,x)=e^x$ in this case). In practical applications, one searches for a liquidly traded forward F written on a commodity correlated with $P_t = e^{X_t}$, with a correlation coefficient $\rho $ as close to 1 as possible [for practical examples, see Carmona and Ludkovski (2006), Sect. 2.3 and Fiorenzani (2006)]. When $k = 1$ we obtain exactly the model in Carmona and Ludkovski (2006), Sect. 2.2, while for $k = 0$ we obtain the model in Fiorenzani (2006).

Notice that if ${\bar{\sigma }}_F>0$, $\sigma >0$ and $k=0$, then Assumption 4.1 holds true, while in the general case when $k \ne 0$ Assumptions 4.1 (ii) is not satisfied. Nevertheless, as we are going to see, in this example $J_x^0$ is Lipschitz, so that Remark 3.5 applies. Hence we can take $\mu _F$ linear in x as above.

To see that $J_x ^0$ is Lipschitz, consider Eq. (3.10) which in this setting becomes

$$\begin{aligned}&J_t^0 + \frac{1}{2 \gamma } \frac{(a - k x)^2}{{\bar{\sigma }}_F ^2} - \frac{\rho \sigma }{{\bar{\sigma }}_F} (a - k x) J_x^0 + \delta (\theta - x) J^0_x\\&\quad - \frac{1}{2} \gamma \sigma ^2 (1 - \rho ^2) \left( J_x^0 \right) ^2 + \frac{1}{2} \sigma ^2 J_{xx}^0 = 0. \end{aligned}$$

Then, in analogy with Benth and Karlsen (2005), one guesses that the solution $J^0$ has the general form

$$\begin{aligned} J^0(t,x) = \alpha (t) + \beta (t) x + \Gamma (t) x^2, \end{aligned}$$

such that $J^0(T,x) \equiv \frac{\log \gamma }{\gamma }$. This ansatz gives the system of first order ODEs

$$\begin{aligned} \left\{ \begin{array}{l} \displaystyle \alpha ' + \frac{a^2}{2 \gamma {\bar{\sigma }}_F ^2} + \left( \delta \theta - \rho \frac{\sigma }{{\bar{\sigma }}_F} a\right) \beta - \frac{1}{2} \gamma \sigma ^2 (1 - \rho ^2) \beta ^2 + \sigma ^2 \Gamma = 0,\\ \displaystyle \beta ' + \left( \rho k\frac{\sigma }{{\bar{\sigma }}_F} -\delta -2\gamma \sigma ^2 (1-\rho ^2)\Gamma \right) \beta - \frac{ak}{\gamma {\bar{\sigma }}_F ^2} + 2 \left( \delta \theta - \rho a \frac{\sigma }{{\bar{\sigma }}_F} \right) \Gamma = 0, \\ \displaystyle \Gamma ' + \frac{k^2}{2 \gamma {\bar{\sigma }}_F ^2} + 2 \left( \rho k \frac{\sigma }{{\bar{\sigma }}_F} -\delta \right) \Gamma - 2 \gamma \sigma ^2 (1 - \rho ^2) \Gamma ^2 = 0, \\ \end{array} \right. \end{aligned}$$

(4.10)

with final condition

$$\begin{aligned} \alpha (T) = \frac{\log \gamma }{\gamma }, \qquad \beta (T) = 0, \qquad \Gamma (T) = 0. \end{aligned}$$

The system above is solvable in closed form, as the third equation is a Riccati equation in $\Gamma $, the second one is a linear equation in $\beta $, which can be solved once that $\Gamma $ is known, and, finally, the first one can be solved in $\alpha $ just by integration.

Notice that, if the parameter k appearing in the forward drift is zero then the dynamics of the forward contract does not depend on X, so that $J^0$ does not depend on x, thus leading to $\beta \equiv \Gamma \equiv 0$.

Finally, Eq. (3.11) is given in this case by

$$\begin{aligned} \begin{array}{c} \displaystyle v_t + \left( \delta (\theta - x) - \rho \frac{\sigma }{{\bar{\sigma }}_F} (a - k x) - \gamma \sigma ^2 (1 - \rho ^2) (\beta + 2 \Gamma x) \right) v_x + \frac{1}{2} \sigma ^2 v_{xx} \\ \displaystyle - \frac{1}{2} \gamma \sigma ^2 (1 - \rho ^2) v_x^2 + \sup _{u \in [0,{\bar{u}}]} \Big [ u v_z+ q L \Big ] = 0, \end{array} \end{aligned}$$

(4.11)

with terminal condition

$$\begin{aligned} v(T,x,z;q) = q \ \Phi (e^x,z). \end{aligned}$$

(4.12)

4.2 The Cartea–Villaplana model with correlation

Here we consider a slight generalization of the two factor model for the electricity spot price introduced by Cartea and Villaplana (2008). While the two factors are assumed independent in the original paper (Cartea and Villaplana 2008), here we allow for a possibly non zero (constant) correlation between them. We recall briefly the main features of the model. The electricity spot log-price $P_t$ at time t is decomposed into the sum of two stochastic factors $X^C$ and $X^D$, i.e.,

$$\begin{aligned} P_t = \exp \left( \eta (t) + \alpha _C X^C_t + \alpha _D X^D_t \right) , \end{aligned}$$

with $\alpha _C < 0$ and $\alpha _D > 0$, where $\eta $ represents a seasonal continuous deterministic component. The factors $X^i_t$, $i = C,D$, are Ornstein-Uhlenbeck processes driving, respectively, the capacity of power plants and the demand of electricity. Their dynamics is given by

$$\begin{aligned} d X^i_t = - k^i X^i_t\ dt + \sigma _i(t)\ dW^i_t , \end{aligned}$$

where $k^i$ are constant coefficients, $\sigma _i (t)$ are deterministic measurable functions of time and each $W^i$, for $i=C,D$, is a unidimensional Brownian motion such that $ d\langle W^C, W^D \rangle _t = \rho dt $ with a constant correlation $\rho \in (-1,1)$. Notice that the Cartea–Villaplana model reduces to the Schwarz–Smith model (2000) when $\alpha _C = \alpha _D = 1$ and $k^C = 0$ (or $k^D = 0$). In this example we work under the following standing assumptions:

Assumption 4.7

Let $\sigma _C(t)$ and $\sigma _D (t)$ be continuous, bounded and bounded away from zero over [0, T].

Since the interest rate is zero, the price at time t of a forward contract with maturity T can be computed via the usual formula $F_t = \mathbb {E}^\mathbb {Q}[P_T|\mathcal{F}_t]$, $t\in [0,T]$, for a suitable choice of risk-neutral measure ${\mathbb {Q}}$ preserving the Gaussian structure of the model as in Cartea and Villaplana (2008), Sect. 5. Following the approach in Cartea and Villaplana (2008) we can obtain the dynamics of the forward price under the risk-neutral measure ${\mathbb {Q}}$ as

$$\begin{aligned} \frac{dF_t}{F_t} = \alpha _C e^{-k^C(T-t)} \sigma _C(t)\ dW_t^{{\mathbb {Q}},C} + \alpha _D e^{-k^D(T-t)} \sigma _D(t)\ dW_t^{{\mathbb {Q}}, D}, \end{aligned}$$

where $W^{{\mathbb {Q}}, C}$ and $W^{{\mathbb {Q}}, D}$ are two $\mathbb Q$-Brownian motions with correlation $\rho $. Choosing suitably the market prices of risk as in Cartea and Villaplana (2008) and using Assumption 4.7, we can obtain the following forward dynamics under the objective probability ${\mathbb {P}}$:

$$\begin{aligned} \frac{dF_t}{F_t} = \mu _F (t) dt + \alpha _C e^{-k^C(T-t)} \sigma _C(t)\ dW_t^C + \alpha _D e^{-k^D(T-t)} \sigma _D(t)\ dW_t^D, \end{aligned}$$

where the drift $\mu _F (t)$ is a bounded function of time.

We deal separately with two different situations: the incomplete market case with one forward contract (recall that we have two stochastic factors) and the complete one with two forward contracts.

4.2.1 The case of one forward contract

In this case the agent is allowed to hedge the structured product by trading only in one forward contract. The Cartea–Villaplana model fits the general setting of Sect. 2.3 with $X = (X^C,X^D)^*$, whose coefficients are

$$\begin{aligned} b(t,x^C,x^D) = \left( \begin{array}{c} - k^C x^C \\ - k^D x^D \\ \end{array} \right) , \quad \Sigma ^*(t,x^C,x^D) = \left( \begin{array}{c@{\quad }c} \sigma _C(t) &{} 0 \\ 0 &{} \sigma _D(t) \\ \end{array} \right) \cdot \left( \begin{array}{c@{\quad }c} 1 &{} 0 \\ \rho &{} \sqrt{1 - \rho ^2} \\ \end{array} \right) . \end{aligned}$$

Notice that $\Sigma $ has full rank unless $\rho = \pm 1$, as

$$\begin{aligned} \Sigma ^* \Sigma = \left( \begin{array}{cc} \sigma _C^2 &{} \rho \sigma _C \sigma _D\\ \rho \sigma _C \sigma _D &{} \sigma _D^2 \\ \end{array} \right) . \end{aligned}$$

Let us consider a forward contract F with maturity T. Here $\sigma _F(t,X_t)$ depends only on t, so that for simplicity we set $\sigma _F (t):= \sigma _F(t,X_t)$, and we have

$$\begin{aligned} \sigma _F^*(t)= & {} \left( \begin{array}{cc} \alpha _C e^{- k^C(T - t)} \sigma _C(t) \quad \alpha _D e^{- k^D(T-t)} \sigma _D(t) \\ \end{array} \right) \cdot \left( \begin{array}{cc} 1 &{} 0 \\ \rho &{} \sqrt{1 - \rho ^2} \\ \end{array} \right) \\= & {} \left( \alpha _C e^{- k^C(T - t)} \sigma _C(t) + \rho \alpha _D e^{- k^D(T-t)} \sigma _D(t) , \sqrt{1-\rho ^2} \alpha _D e^{- k^D(T-t)} \sigma _D(t) \right) . \end{aligned}$$

We note that, since the correlation between the spot and forward log-prices is not constant, this model does not fit the setting in Sect. 4.1.

In this model the matrix B has rank equal to one. In fact, by definition [cf. Eq. (3.8)] we have

$$\begin{aligned} B = \Sigma ^* (I_2 - \sigma _F (\sigma _F^* \sigma _F)^{-1} \sigma _F^*) \Sigma , \end{aligned}$$

with

$$\begin{aligned} (\sigma _F^* \sigma _F)(t)= & {} \alpha _D^2 \sigma ^2_D(t) e^{- 2 k^D (T-t)} + \alpha _C^2 \sigma _C^2(t) e^{-2k^C(T-t)} \nonumber \\&\quad +\, 2 \rho \alpha _C \alpha _D \sigma _C(t) \sigma _D(t) e^{- (k^C + k^D) (T-t)} . \end{aligned}$$

(4.13)

Consider $x = \Sigma ^{-1} \sigma _F$. Then $x \ne 0$ and we have

$$\begin{aligned} \langle x, B x\rangle = \sigma _F^* (I_2 - \sigma _F (\sigma _F^* \sigma _F)^{-1} \sigma _F^*) \sigma _F = \sigma _F^* \sigma _F - \sigma _F^* \sigma _F (\sigma _F^* \sigma _F)^{-1} \sigma _F^* \sigma _F = 0. \end{aligned}$$

Therefore, working on the image of B in Eq. (3.16) is fully justified here, as $\mathrm {rank}(B) = 1$. Now, we show that Assumption 3.2(iv) is satisfied in this case. Indeed, a direct computation shows that

$$\begin{aligned} B&= \kappa (t) \times \left( \begin{array}{lc} \alpha _D^2 e^{-2k^{D}(T-t)} &{}\quad -\alpha _C\alpha _D e^{-(k^C+k^{D})(T-t)}\\ -\alpha _C\alpha _D e^{-(k^C+k^{D})(T-t)} &{}\quad \alpha _C ^2 e^{-2k^{C}(T-t)} \end{array}\right) \end{aligned}$$

where

$$\begin{aligned} \kappa (t):= \dfrac{(1-\rho ^2)\sigma _C ^2 (t) \sigma _D ^2 (t)}{\alpha _{D}^{2} \sigma _{D}^{2}(t) e^{-2k^{D}(T-t)}+\alpha ^{2}_{C} \sigma _{C}^{2}(t) e^{-2k^{C}(T-t)}+2 \rho \alpha _C\alpha _D \sigma _C (t) \sigma _D (t) e^{-(k^{C}+k^{D})(T-t)}}. \end{aligned}$$

Hence, the two eigenvalues of B are $\lambda _1 (t) \equiv 0$ and

$$\begin{aligned} \lambda _2 (t) = \kappa (t) \left( \alpha _D ^2 e^{-k^D (T-t)} + \alpha _C ^2 e^{-k^C (T-t)} \right) >0 . \end{aligned}$$

By Assumption 4.7 we have that $\sigma _C(t)$ and $\sigma _D (t)$ are bounded and bounded away from zero over [0, T], yielding $\frac{1}{\delta } \le \lambda _2 (t) \le \delta $ for some $\delta >0$ independent of $t \in [0,T]$. This implies Assumption 3.2 (iv).

Since in this example the two factors $X^C$ and $X^D$ do not enter in the coefficients of the forward contract dynamics, we expect that the derivative $J^0 _x $ of the log-value function is zero. Indeed, this can be obtained from the PDE (3.10) satisfied by $J^0$. Since $\mu _F$ and $\sigma _F$ do not depend on X, such a PDE simplifies to

$$\begin{aligned} \displaystyle \displaystyle J^0 _t + \frac{1}{2 \gamma } \frac{|\mu _F|^2}{|\sigma _F|^2} = 0, \end{aligned}$$

which gives

$$\begin{aligned} J^0(t) = \frac{\log \gamma }{\gamma } + \int _t^T \frac{1}{2 \gamma } \frac{|\mu _F(s)|^2}{|\sigma _F(s)|^2}\ ds. \end{aligned}$$

Therefore $J_x^0 \equiv 0$, and Eq. (3.11) for the UIP becomes

$$\begin{aligned}&\displaystyle v_t + \langle b - \Sigma ^* \sigma _F (\sigma _F^* \sigma _F)^{-1} \mu _F , v_x \rangle \\&\quad + \frac{1}{2} \mathrm {tr} \left( \Sigma ^* \Sigma v_{xx}\right) - \frac{1}{2} \gamma v_x^{*} B v_x + \sup _{u \in [0,{\bar{u}}]} \Big [ u v_z+ q L \Big ] = 0. \end{aligned}$$

Hence, under Assumption 4.7, the considerations in Remark 3.5 apply and give that the UIP v is the unique viscosity solution with quadratic growth of the PDE above.

Finally, in this case the candidate optimal hedging strategy is given by ${\hat{h}}^q = {\hat{\pi }}^q - {\hat{\pi }}^0 = - (\sigma _F ^* \sigma _F)^{-1} \sigma _F ^* \Sigma v_x$ as in (3.21), where $\sigma _F ^* \sigma _F$ is as in (4.13) and

$$\begin{aligned} (\sigma _F^* \Sigma )^*(t) = \left( \begin{array}{c} \alpha _C e^{-(T-t) k^C} \sigma ^2_C(t) + \rho \alpha _D e^{-(T-t) k^D} \sigma _C(t) \sigma _D(t) \\ \alpha _D e^{-(T-t) k^D} \sigma ^2_D(t) + \rho \alpha _C e^{-(T-t) k^C} \sigma _C(t) \sigma _D(t) \end{array} \right) . \end{aligned}$$

4.2.2 The case of two forward contracts

We look now at the much simpler situation where the agent can hedge the structured product by trading in two forward contracts $F^1$ and $F^2$ with respective maturities $T_1$ and $T_2$, with $T \le T_1 < T_2$. Then we have

$$\begin{aligned} \sigma _F^*(t) = \left( \begin{array}{cc} \alpha _C e^{- k^C(T_1 - t)} \sigma _C(t) &{} \alpha _D e^{- k^D(T_1-t)} \sigma _D(t) \\ \alpha _C e^{- k^C(T_2 - t)} \sigma _C(t) &{} \alpha _D e^{- k^D(T_2-t)} \sigma _D(t) \\ \end{array} \right) \cdot \left( \begin{array}{cc} 1 &{} 0 \\ \rho &{} \sqrt{1 - \rho ^2} \\ \end{array} \right) . \end{aligned}$$

Of course, in this case $B = 0$, since $\sigma _F$ is invertible. Hence, the market model is complete and we are in the situation described in Remark 3.6. Analogously to the previous case, it is possible to find an explicit expression for $J^0$, which is now given by

$$\begin{aligned} J^0(t) = \frac{\log \gamma }{\gamma } + \int _t^T \frac{1}{2 \gamma } \langle \mu _F, (\sigma ^*_F \sigma _F)^{-1} \mu _F \rangle (s) ds. \end{aligned}$$

Here again $J_x^0 \equiv 0$, so that Remark 3.5 applies and Eq. (3.11) for the UIP becomes

$$\begin{aligned} \displaystyle v_t + \langle {\bar{b}} , v_x \rangle + \frac{1}{2} \mathrm {tr} \left( \Sigma ^* \Sigma v_{xx}\right) + \sup _{u \in [0,{\bar{u}}]} \Big [ u v_z+ q L \Big ] = 0. \end{aligned}$$

Finally the candidate optimal hedging strategy is given by ${\hat{h}}^q = -(\sigma _F^* \sigma _F)^{-1} \sigma _F^* \Sigma v_x$ as before, where this time

$$\begin{aligned} (\sigma _F^* \sigma _F)^{-1} (\sigma _F^* \Sigma )(t) = \left( \begin{array}{cc} \displaystyle \frac{e^{-(T_1-t) k^C}}{\alpha _C \left( 1-e^{(T_1-T_2)(k^C - k^D)}\right) } &{} \displaystyle \frac{e^{-(T_1-t) k^D}}{\alpha _D \left( 1-e^{(T_1-T_2)(k^D - k^C)}\right) } \\ \\ \displaystyle \frac{e^{-(T_2-t) k^C}}{\alpha _C \left( 1-e^{(T_1-T_2)(k^D - k^C)}\right) } &{} \displaystyle \frac{e^{-(T_2-t) k^D}}{\alpha _D \left( 1-e^{(T_1-T_2)(k^C - k^D)}\right) } \end{array} \right) . \end{aligned}$$

5 Numerical results

In this section we present some numerical applications of our results to swing options (see Example 2.2).^{Footnote 2} We focus on this type of contract for essentially two reasons: first, swing options are the main type of volumetric contracts in commodity markets and, second, we want to compare our results to those in Benth et al. (2012).

More specifically, in Sect. 5.1 we consider the benchmark case with strike price $K=0$ and minimal cumulated quantity $m=0$ in order to compare the prices obtained following the UIP approach to those in Benth et al. (2012); in Sect. 5.2 we consider more general swing options with $K > 0$ and $m >0$. In both parts, we compute the solution of the relevant PDEs using finite difference schemes, as suggested in Benth et al. (2012).

5.1 Comparison with the results in Benth et al. (2012)

Here, we compare the UIP, obtained by solving the non-linear PDE (4.11), with the classical linear pricing rule which is used in the energy market literature (e.g. Basei et al. 2014; Benth et al. 2012; Chen and Forsyth 2007; Felix 2012; Thompson et al. 2009). The latter is given in terms of a PDE which is essentially linear, except for the first derivative in z and which has the same form as Eq. (3.23), namely Eq. (4.11) without the quadratic term in $v_x$. In both cases, the optimal strategy ${\hat{u}}(t,x,z;q)$ is given by Eq. (3.22) with $\ell (p,z) = p-K$.

We consider, as in Benth et al. (2012), one swing option (i.e., we take $q=1$) with parameter values

$$\begin{aligned} K = 0, \quad {\bar{u}} =1 , \quad T=1, \quad m=0, \quad M =0.5 , \end{aligned}$$

i.e., the control u belongs to [0, 1] and the holder faces the problem of picking the most favorable price of the commodity, up to a certain total volume M. We set the risk-free interest rate to zero. Moreover, in order be as close as possible to the setting considered in Benth et al. (2012), where $Z^u$ is constrained to fulfil $Z^u _T \le M = 0.5$, we use the penalty function

$$\begin{aligned} \Phi (p,z) = \min (0,- C(z - 0.5)) \end{aligned}$$

(5.1)

with $C=1000$. Indeed, the authors in Basei et al. (2014) prove that when $C \rightarrow \infty $ the price of a contract with penalty $\Phi $ as in (5.1) converges to the price of a contract with the constraint on $Z^u$ as above.

Moreover, with a view towards the comparison with Benth et al. (2012), we choose a special case of the linear dynamics model of Example 4.6 with $k = 0.01$ and where

$$\begin{aligned} \delta = 0.4, \quad \sigma = 0.55, \quad \theta = 3.5, \quad \sigma _F = 0.3, \quad a = 0.03, \quad \rho = 0.5. \end{aligned}$$

(5.2)

Finally, the risk-aversion parameter is set to be $\gamma = 0.02$.

Remark 5.1

Notice that the coefficients $\delta , \theta $ and $\sigma $ above correspond, respectively, to $\kappa , \mu $ and $\sigma $ in Benth et al. (2012), and they have the same numerical values as in Benth et al. (2012). The remaining coefficients $\sigma _F$ and a refer to the dynamics of the forward contract F, which is not part of the model in Benth et al. (2012), and $\rho $ is the correlation between (the logarithms of) the spot price P and F.

We compute both kinds of price (the risk-neutral price and the UIP) for such a contract, solving numerically the corresponding PDE via finite difference methodology with a backward time stepping scheme. In all the numerical experiments we use an approximating domain for the logarithm of the spot price which is wider than the one in Benth et al. (2012) (where $x_{min} = \ln (21.6)$ and $x_{max} = \ln (73.9)$) and the domain for Z is obviously $[0,{\bar{u}} T] = [0,1]$, thus leading to a global domain

$$\begin{aligned} \mathcal D:= [0,T] \times [x_{min}, x_{max}] \times [0,1] \end{aligned}$$

with $x_{min} = \ln (0.001)$ and $x_{max} = \ln (500)$. Notice that $[x_{min}, x_{max}] $ here is wider with respect to the interval used in Benth et al. (2012), so that the probability that X belongs to this interval is higher, thus leading to more stable numerical results. The boundary conditions are the same as in Benth et al. (2012) as well as the numerical approximations of $v_t$ and $v_z$: denoting by $v_{i,j}^n$ the approximation of $v(t_n, x_i,z_j;1)$ with $n \in \{ 0, \dots ,N \}, i \in \{ 0, \dots , I \}$ and $j \in \{ 0, \dots , J \}$ we have

$$\begin{aligned} v_t (t_n, x_i,z_j) \mathop {=}\limits ^{\sim } \frac{v_{i,j}^{n+1} - v_{i,j}^{n}}{ \Delta t}, \qquad v_{z} (t_n, x_i,z_j) \mathop {=}\limits ^{\sim } \frac{v_{i,j+1}^{n+1} - v_{i,j}^{n+1}}{\Delta z}, \end{aligned}$$

with $\Delta t:= \frac{T}{N}, \Delta z:= \frac{1}{J}$, while we use a fully explicit scheme also for the derivatives in x

$$\begin{aligned} v_x (t_n, x_i,z_j) \mathop {=}\limits ^{\sim } \frac{v_{i+1,j}^{n+1} - v_{i-1,j}^{n+1}}{ 2 \Delta x}, \qquad v_{xx} (t_n, x_i,z_j) \mathop {=}\limits ^{\sim } \frac{v_{i+1,j}^{n+1} -2 v_{i,j}^{n+1} + v_{i-1,j}^{n+1}}{(\Delta x)^2} \end{aligned}$$

with $\Delta x:= \frac{ x_{max} - x_{min}}{I}$. We set

$$\begin{aligned} N=3500 , \quad I= 100, \quad J=225 \end{aligned}$$

in order to have convergence of our numerical solution to the UIP. The proof of the convergence can be found in “Appendix 3”.

5.1.1 Numerical results

We plot in Fig. 1 the prices of the swing contract at time $t=0.5$, obtained with the two approaches (a similar picture can be provided at any other date). In order to stress the difference between the two prices, we do not plot the surfaces for $z \in [0.25 , 0.5]$ (remember that $M=0.5$). As we can see, the two price surfaces have similar shapes, even though the “classical” procedure slightly overprices the option with respect to the UIP when the log spot price is high. The difference between the two prices is clearly due to the risk aversion $\gamma $ and, secondarily, to the correlation $\rho $ between the underlying and the forward market where the buyer can invest.

We conclude this part by illustrating in Tables 1 and 2 below the effect that those two parameters separately have on the UIP. Concerning the dependence of the UIP on $\gamma $, which are summarized in Table 1, we choose x, z and t so that the difference between the UIP for $\gamma $ that varies and the UIP for $\gamma = 0.01$ is as large as possible (on the domain of Fig. 1 with $x_{min} = \ln (21.6)$ and $x_{max} = \ln (73.9)$). Similarly, Table 2 shows how the UIP varies with $\rho $. As we can see, the UIP is decreasing in $\gamma $, while it is neither increasing, nor decreasing in $\rho $. The first effect is very natural, since a higher risk aversion for the buyer is expected to induce a lower price. Concerning $\rho $, one would expect a higher price as the correlation $\rho $ with the forward market increases (in absolute value), since this widens the hedging opportunities for the buyer. Nevertheless, in this model also $J_0$ (i.e., the log-value function without investing in the structured product) depends on $\rho $ and this seems to produce a more complicated, non necessarily monotonic, dependence on $\rho $. The combined effect of $\gamma $ and $\rho $ is not clear in general.

Table 1 Different values of UIP for a varying $\gamma $ and $x= \log 73.9 \simeq 4.30$, $z= 0$, $t=0.5$, for $\rho $ fixed to 0.5

Full size table

Table 2 Different values of UIP for a varying $\rho $ and $x= \log 73.9 \simeq 4.30$, $z= 0$, $t=0.5$, for $\gamma $ fixed to 0.01

Full size table

5.2 A more realistic example

We now focus on computing the UIP of a more realistic swing option contract, with $q=1$,

$$\begin{aligned} K = \exp (2.5), \quad {\bar{u}}=1, \quad T=1, \quad m=0.1, \quad M=0.5. \end{aligned}$$

Indeed, swing contracts usually have strictly positive strike price and a nonzero minimal cumulated quantity to be purchased. The penalty function we use is the one in Equation (2.2) with $C = 1000$. We keep working under the linear dynamics model in Example 4.6, with $k=0.01$ and with parameters as in (5.2). We solve the PDE for v using a backward time stepping finite difference method on the domain $ {\mathcal D} = [0,T] \times [x_{min}, x_{max}] \times [0,1]$, where $x_{min}= \ln (0.001), x_{max}= \ln (500)$.

The approximating schemes for $v_t, v_z,v_x$ and $v_{xx}$ are as in Sect. 5.1, as well as the boundary conditions, except for $x=x_{min}$: in fact, if $x=x_{min}$ the optimal operational behavior still consists in waiting as long as possible before exercising (this is because $x_{min}$ is much smaller than the expectation of X in the long run and the price is thus expected to increase), but now we have to take into account the constraint $m =0.1$ [recall that $m=0$ in Benth et al. (2012)]. Hence we set:

$$\begin{aligned} u_s = \left\{ \begin{array}{lcl} 0, &{}&{} s \in \left( t, T - \frac{(m-z)^+}{{\bar{u}}} \right] \\ {\bar{u}}, &{}&{} s \in \left( T - \frac{(m-z)^+}{{\bar{u}}}, T \right) . \end{array} \right. \end{aligned}$$

With this choice of u, it is possible to explicitly compute the approximating price (recall that in the linear dynamics model in Example 4.6 the spot price is $P_t = e^{X_t}$ and that $\bar{u} =1$)

$$\begin{aligned}&{\mathbb {E}}_{t,x_{min},z} \left[ \int _t^T u_s (e^{X_s} - K) ds + \Phi (e^{X_T},Z_T^u) \right] \\&\quad = {\mathbb {E}}_{t,x_{min},z}\left[ \int _{T - (m-z)^+}^T (e^{X_s} - K) ds + \Phi (e^{X_T},Z_T^u) \right] \end{aligned}$$

as done in Benth et al. (2012), Appendix A.

In Fig. 2 we plot the price of the swing option at two different dates.

Notice that in both Fig. 2a, b we cut the domain in z in order to focus on positive prices: for $0.5 =M< z < 1$ the penalty function plays a crucial role and the price becomes negative. We see that the UIP is decreasing in z [as in Benth et al. (2012)] and increasing in x. Moreover, from Fig. 2b it is clear that for $z > 0.25$ the price is strictly decreasing. This might be explained as follows: for a fixed value of the log spot x and for $t=0.75$, if $z>0.25$ the value of the contract is lower than when $z \le 0.25$ and it even becomes lower and lower as z increases, since the time to maturity is equal to 0.25 and so if $z>0.25$ the buyer has less opportunities to exercise the option, hence less possibilities to take advantage of (possibly) higher prices. This is analogous to what happens with linear prices, see e.g. Basei et al. (2014).

Moreover, as an example, in Fig. 3 we show the optimal exercise strategy ${\hat{u}}$ at time $t=0.75$ as a function of the (log) spot price x and of the cumulated quantity z. In the grey region ${\hat{u}} = {\bar{u}}$, while in the white region ${\hat{u}} =0$.

From Fig. 3 it is clear that, unless the spot price is very low, if the cumulated quantity $z < m=0.1$, then it is always optimal to exercise the option, to avoid the penalty. Furthermore, when $ x > 2.5$, equivalently the spot price $e^x$ is bigger than the strike price $ K= \exp (2.5)$ and so the optimal policy consists in exercising the option (i.e., ${\bar{u}} = 1$) whenever $z \in [0,0.25]$. On the other hand, if the spot price is higher than the strike, $x > 2.5$, and if the cumulated quantity satisfies $z>0.25$ then it is not optimal to exercise the option: in the current state $m< z < M$, thus we are not incurring the penalty and the more we have used of our control, the higher the spot price has to be before we are willing to exercise.

We conclude this section by showing in Fig. 4 the candidate optimal hedging strategy ${\hat{h}}^1$ found in Eq. (3.21) as a function of the (log) spot price x and of the cumulated quantity z, at time $t=0.5$.

We notice that, being the UIP increasing in x, $v_x$ is positive on our domain [recall Eq. (3.21)], so that ${\hat{h}}^1$ is always negative: in order to hedge a buyer position in a swing option it is always “optimal” to sell the forward contract. Moreover, for a fixed z, as the (log) spot price increases, the quantity of forward contracts to sell increases. On the other hand, for a fixed x, ${\hat{\pi }}$ is increasing as a function of z, for $z \in [0,0.5]$ (meaning that as the cumulated quantity z increases towards $M=0.5$, selling forward contracts is less and less needed), while ${\hat{h}}^1 =0$ for $z \ge M=0.5$, as expected.

6 Conclusions

In this paper, we considered the problem of pricing and hedging of structured products in energy markets from a buyer’s perspective using the (exponential) utility indifference pricing approach. The main novelty with respect to the existing literature is that buyer has the possibility to trade in the forward market in order to hedge the risk coming from the structured contract.

We characterized the UIP in terms of continuous viscosity solutions of a suitable nonlinear PDE. As a consequence, we were able to identify a candidate for the optimal exercise strategy of the structured product as well as a portfolio strategy partially hedging the financial position.

Moreover, in a more specific setting with two assets and constant correlation, we showed that the UIP equals the value function of an auxiliary simpler optimization problem under a risk neutral probability, that can be interpreted as a perturbation of the minimal entropy martingale measure.

Finally, we provided some numerical applications in the case of swing options. In particular, we computed the UIP price as well as the optimal exercise and hedging strategies for a buyer of one swing option in the linear dynamics model, by solving the corresponding nonlinear PDEs via finite difference schemes. We highlighted the differences with respect to the classical price as in Benth et al. (2012) and discussed some qualitative properties.

Notes

Notice that in our case the function G appearing in the statement of Theorem 4.3.2 in Pham (2009) can be chosen to be any positive number.
All the numerical tests were performed in MATLAB R2015b.

References

Aïd R (2015) Electricity derivatives. Springer, Springer Briefs in Quantitative Finance
Aïd R, Campi L, Langrené N, Pham H (2014) A probabilistic numerical method for optimal multiple switching problems in high dimension. SIAM J Financ Math 5(1):191–231
Article MathSciNet MATH Google Scholar
Barles G, Souganidis PE (1991) Convergence of approximation schemes for fully nonlinear second order equations. Asymptot Anal 4:271–283
MathSciNet MATH Google Scholar
Basei M, Cesaroni A, Vargiolu T (2014) Optimal exercise of swing contracts in energy markets: an integral constrained stochastic optimal control problem. SIAM J Financ Math 5(1):581–608
Article MathSciNet MATH Google Scholar
Becherer D (2003) Rational hedging and valuation of integrated risks under constant absolute risk aversion. Insur Math Con 33:1–28
Becherer D (2004) Utility-indifference hedging and valuation via reaction-diffusion systems. Proc R Soc A 460:27–51
Article MathSciNet MATH Google Scholar
Becherer D, Schweizer M (2005) Classical solutions to reaction-diffusion systems for hedging problems with interacting Itô and point processes. Ann Appl Probab 15(2):1111–1144
Article MathSciNet MATH Google Scholar
Benedetti G, Campi L (2016) Utility indifference valuation for non-smooth payoffs with an application to power derivatives. Appl Math Optim 73(2):349–389
Article MathSciNet MATH Google Scholar
Benth FE, Eriksson MKV (2013) Energy derivatives with volume control. In: Kovacevic R et al (eds) Chapter 16 of Handbook of risk management in energy production and trading. International series in operations research & management science, vol 199. Springer, New York, pp 413–432
Benth FE, Karlsen KH (2005) A note on Merton’s portfolio selection problem for the Schwartz mean-reversion model. Stoch Anal Appl 23(4):687–704
Article MathSciNet MATH Google Scholar
Bouchard B, Touzi N (2011) Weak dynamic programming principle for viscosity solutions. SIAM J Control Optim 49(3):948–962
Article MathSciNet MATH Google Scholar
Benth FE, Lempa J, Nilssen TK (2012) On the optimal exercise of swing options in electricity markets. J Energy Mark 4(4):1–27
Google Scholar
Carmona R, Ludkovski M (2006) Pricing commodity derivatives with basis risk and partial observation, 2006, preprint. http://www.pstat.ucsb.edu/faculty/ludkovski/CarmonaLudkovskiBasis
Carmona R, Ludkovski M (2010) Valuation of energy storage: an optimal switching approach. Quant Finance 10(4):359–374
Article MathSciNet MATH Google Scholar
Carmona R, Touzi N (2008) Optimal multiple stopping and valuation of swing options. Math Finance 18(2):239–268
Article MathSciNet MATH Google Scholar
Cartea A, Villaplana P (2008) Spot price modeling and the valuation of electricity forward contracts: the role of demand and capacity. J Bank Finance 32(12):2502–2519
Article Google Scholar
Chen Z, Forsyth PA (2007) A semi-lagrangian approach for natural gas storage, valuation and optimal operation. SIAM J Sci Comput 30(1):339–368
Article MathSciNet MATH Google Scholar
Da Lio F, Ley O (2006) Uniqueness results for second-order Bellman–Isaacs equations under quadratic growth assumptions and applications. SIAM J Control Optim 45(1):74–106
Article MathSciNet MATH Google Scholar
Davis MH (2006) Optimal hedging with basis risk. In: From stochastic calculus to mathematical finance. Springer, Berlin, pp 169–187
El Karoui N, Rouge R (2000) Pricing via utility maximization and entropy. Math Finance 10(2):259–276
Article MathSciNet MATH Google Scholar
Edoli E, Fiorenzani S, Ravelli S, Vargiolu T (2013) Modeling and valuing make-up clauses in gas swing contracts. Energy Econ 35:58–73
Article Google Scholar
Felix B (2012) Gas storage valuation: a comparative study. EWL working paper N. 01/2012. University of Duisburg-Essen
Fiorenzani S (2006) Pricing illiquidity in energy markets. Energy Risk 65:65–71
Google Scholar
Fleming W, Rishel RW (1975) Deterministic and stochastic optimal control. Springer, New York
Fleming W, Soner HM (1993) Controlled Markov processes and viscosity solutions. Springer, New York
Frittelli M (2000) The minimal entropy martingale measure and the valuation problem in incomplete markets. Math Finance 10:39–52
Article MathSciNet MATH Google Scholar
Henaff P, Laachir I, Russo F (2013) Gas storage valuation and hedging. A quantification of the model risk, 2013, preprint. http://arxiv.org/abs/1312.3789
Henderson V (2002) Valuation of claims on nontraded assets using utility maximization. Math Finance 12:351–373
Article MathSciNet MATH Google Scholar
Henderson V, Hobson D (2009) In: Carmona R (ed) Utility indifference pricing—an overview. Chapter 2 of indifference pricing: theory and applications. Princeton University Press, Princeton
Jaillet P, Ronn EI, Tompaidis S (2004) Valuation of commodity-based swing options. Manag Sci 50(7):909–921
Article MATH Google Scholar
Kushner H (1977) Probability methods for approximations in stochastic control and for Elliptic equations. Academic Press, New York
MATH Google Scholar
Li X, Song QS (2007) Markov Chain approximation methods on generalized HJB equation. In: Proceedings of the 46th IEEE conference on decision and control New Orleans, LA, USA. 12–14 Dec 2007
Liptser R, Shiryaev AN, (1977) Statistics of random processes: I. General theory, vol 5. Springer, New York
Ludkovski M (2008) Financial hedging of operational flexibility. Int J Theor Appl Finance 11(8):799–839
Article MathSciNet MATH Google Scholar
Mijatović A, Urusov M (2012) On the martingale property of certain local martingales. Probab Theory Relat Fields 152(1–2):1–30
Article MathSciNet MATH Google Scholar
Mnif M (2007) Portfolio optimization with stochastic volatilities and constraints: an application in high dimension. Appl Math Optim 56(2):243–264
Article MathSciNet MATH Google Scholar
Monoyios M (2004) Performance of utility-based strategies for hedging basis risk. Quant Finance 4(3):245–255
Article MathSciNet Google Scholar
Pagès G, Bardou O, Bouthemy S (2009) Optimal quantization for the pricing of swing options. Appl Math Finance 16(2):183–217
Article MathSciNet MATH Google Scholar
Pagès G, Bronstein A-L, Wilbertz B (2010) How to speed up the quantization tree algorithm with an application to swing options. Quant Finance 10(9):995–1007
Article MathSciNet MATH Google Scholar
Pham H (2002) Smooth solutions to optimal investment models with stochastic volatilities and portfolio constraints. Appl Math Optim 46(1):55–78
Article MathSciNet MATH Google Scholar
Pham H (2009) Continuous-time stochastic control and optimization with financial applications. Springer, New York
Porchet A, Touzi N, Warin X (2009) Valuation of power plants by utility indifference and numerical computation. Math Methods Oper Res 70(1):47–75
Article MathSciNet MATH Google Scholar
Oberman A, Zariphopoulou T (2003) Pricing early exercise contracts in incomplete markets. CMS 1(1):75–107
Article MATH Google Scholar
Rockafellar RT (1970) Convex analysis. Princeton University Press, Princeton, NJ
Book MATH Google Scholar
Rogers LCG, Williams D (2000) Diffusions, Markov processes and martingales, vol 2. Cambridge University Press, Itô calculus
Book MATH Google Scholar
Schwartz E, Smith JE (2000) Short-term variations and long-term dynamics in commodity prices. Manage Sci 46(7):893–911
Article Google Scholar
Thompson M, Davison M, Rasmussen H (2009) Natural gas storage valuation and optimization: a real option application. Naval Res Logist 56(3):226–238
Article MathSciNet MATH Google Scholar
Valdez ARL, Vargiolu T (2013) Optimal portfolio in a regime-switching model. In: Dalang RC, Dozzi M, Russo E (eds) Proceedings of the Ascona ’11 seminar on stochastic analysis, random fields and applications. Springer, New York, pp 435–449
Warin X (2012) Gas storage hedging. In: Numerical methods in finance. Springer, Berlin, pp 421–445

Download references

Author information

Authors and Affiliations

Department of Mathematics, University of Padova, via Trieste 63, 35121, Padova, Italy
Giorgia Callegaro & Tiziano Vargiolu
Department of Statistics, London School of Economics, Columbia House, 10 Houghton Street, London, WC2A 2AE, UK
Luciano Campi
Phinergy S.r.l.s., Via della Croce Rossa 112, 35129, Padova, Italy
Valeria Giusto

Authors

Giorgia Callegaro
View author publications
You can also search for this author in PubMed Google Scholar
Luciano Campi
View author publications
You can also search for this author in PubMed Google Scholar
Valeria Giusto
View author publications
You can also search for this author in PubMed Google Scholar
Tiziano Vargiolu
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Luciano Campi.

Additional information

This work was partly supported by the grant CPDA138873-2013 of the University of Padova “Stochastic models with spatial structure and applications to new challenges in Mathematical Finance, with a focus on the post-2008 financial crisis environment and on energy markets”. Part of this work was done while the first and fourth authors were visiting LSE in November 2013 and while the second author was Visiting Scientist at the University of Padova in June 2014 and July 2015: the financial contribution of the two institutions is kindly acknowledged. Moreover, the authors wish to thank René Aïd, Matt Davison, Enrico Edoli, Paola Mannucci, Mario Putti and the participants to the 2014 ISEFI Conference in Paris and the Conference in Mathematics for Energy Markets in Vienna (2016) for valuable comments.

Appendices

Appendix 1: Proof of Proposition 3.4

The maximisation problem (2.8) fits the setting of Section 5 in the paper Bouchard and Touzi (2011) on weak dynamic programming principle. In particular, their Corollary 5.6 applies. More precisely, the essential ingredients in the proof of Corollary 5.6 are the a-priori estimate (5.2) in Bouchard and Touzi (2011), the local boundedness of the value function and the lower semi-continuity of the objective function in (t, x, y, z) for all admissible controls. First, the a-priori estimate holds due to (2.11). Concerning the local boundedness of the value function, it can be easily checked that in our setting the value function is bounded since it is trivially nonpositive and, being $(u,\pi )=(0,0)$ an admissible strategy, we have

$$\begin{aligned} V(t,x,y,z;q) \ge -\frac{1}{\gamma } \exp \left\{ -\gamma \left[ y+ q \inf _{p \in {\mathbb {R}}} \left( (T-t) L(p,0,0) + \Phi (p,0) \right) \right] \right\} > -\infty \end{aligned}$$

since the functions L and $\Phi $ are bounded (cf. Assumption 2.1). Let $(u,\pi )$ be an admissible given control. Since the control is now fixed, we drop it from the notation of the state variable at maturity and denote them as $A_T ^{t,a}:= (X_T ^{t,x},Y_T ^{t,a}, Z_T ^{t,a})$ with $a =(x,y,z)$, to stress the dependence on the initial data. Now consider the objective function

$$\begin{aligned}{}[0,T] \times {\mathbb {R}}^m \times {\mathbb {R}} \times [0,{\bar{u}}T] \ni (t,x,y,z)=(t,a) \mapsto {\mathbb {E}} [ G(A_T ^{t,a})], \end{aligned}$$

where G is defined in (2.9). From the continuity of the function G and of the state variables $A^{t,a} _T$ with respect to the initial data (t, a), we get that $G(A^{t,a} _T )$ is also continuous in (t, a). Moreover, notice that since L and $\Phi $ are bounded (ref. Assumption 2.1) we have

$$\begin{aligned} | G(A_T ^{t,a}) | \le C \exp \left( -\gamma \left( y + \int _t ^T \left\langle \pi _s , \frac{dF_s}{F_s}\right\rangle \right) \right) , \end{aligned}$$

for some constant $C>0$. Therefore, to prove the lower semi-continuity of the objective function it suffices to show that the family of random variables

$$\begin{aligned} \left\{ \exp \left( -\gamma \int _t ^T \left\langle \pi _s , \frac{dF_s}{F_s}\right\rangle \right) : t \in [0,T] \right\} \end{aligned}$$

is uniformly integrable. We prove that they are bounded in $L^2$ for all admissible controls, i.e.

$$\begin{aligned} \sup _{t\in [0,T]} {\mathbb {E}} \left[ \exp \left( -2\gamma \int _t ^T \left\langle \pi _s , \frac{dF_s}{F_s}\right\rangle \right) \right] < \infty , \end{aligned}$$

which will imply the uniform integrability. Let $\mathcal F_{t,T}$ be the smallest filtration generated by the Brownian increment after t and satisfying the usual conditions. Consider the following change of measure on $\mathcal F_{t,T}$:

$$\begin{aligned} \frac{d{\mathbb {Q}} _t}{d{\mathbb {P}}}:= \exp \left( -2\gamma \int _t ^T \pi ^*_s \sigma ^*_F (s,X^{t,x}_s) dW_s - 2\gamma ^2 \int _t ^T | \pi ^*_s \sigma ^*_F(s,X^{t,x}_s) | ^2 ds \right) , \end{aligned}$$

(7.1)

which is well defined. Indeed, the boundedness of $\sigma _F^* \sigma _F$ (cf. Assumption 3.2 (iii)) and the admissibility property (2.5) imply that $\sup _{t\le s\le T} {\mathbb {E}} [ \exp (\varepsilon | \pi ^*_s \sigma _F ^*(s,X_s^{t,x}) |^2)] < \infty $ for some $\varepsilon >0$, hence the criterion in Liptser and Shiryaev (1977), Example 3, Sect. 6.2.3 is fulfilled. Moreover, the change of measure (7.1) satisfies $\sup _{t\in [0,T]} {\mathbb {E}} [ (d{\mathbb {Q}}_t / d{\mathbb {P}})^2 ] < \infty $. This is a consequence of the admissibility of $\pi $ as in (2.5). Indeed,

$$\begin{aligned} \frac{d{\mathbb {Q}}_t}{d{\mathbb {P}}} \le \exp \left( -2\gamma \int _t ^T \pi ^*_s \sigma ^*_F (s,X^{t,x}_s) dW_s \right) , \end{aligned}$$

giving that

$$\begin{aligned} {\mathbb {E}}\left[ \left( \frac{d{\mathbb {Q}}_t}{d{\mathbb {P}}}\right) ^2 \right]\le & {} {\mathbb {E}} \left[ \exp \left( -4\gamma \int _t ^T \pi ^*_s \sigma ^*_F (s,X^{t,x}_s) dW_s \right) \right] \\\le & {} {\mathbb {E}}\left[ \exp \left( -8\gamma \int _t ^T \pi ^*_s \sigma ^*_F (s,X^{t,x}_s) dW_s - 2\delta \int _t ^T | \pi ^*_s \sigma ^*_F(s,X^{t,x}_s) | ^2 ds \right) \right] ^{1/2} \\&\times \, {\mathbb {E}}\left[ e^{2\delta \int _t ^T | \pi ^*_s \sigma ^*_F(s,X^{t,x}_s) | ^2 ds}\right] ^{1/2} \\= & {} {\mathbb {E}}\left[ e^{2\delta \int _0 ^T | \pi ^*_s \sigma ^*_F(s,X^{t,x}_s) | ^2 ds}\right] ^{1/2}, \end{aligned}$$

with $\delta $ such that $2\delta = (8\gamma )^2 /2$, since the first exponential in the second inequality above is a true martingale. Moreover, since $\pi $ is admissible we have ${\mathbb {E}}\left[ e^{2\delta \int _0 ^T | \pi ^*_s \sigma ^*_F(s,X^{t,x}_s) | ^2 ds}\right] < \infty $. As a consequence, we obtain that $d{\mathbb {Q}}_t / d{\mathbb {P}}$ is square integrable.

Therefore we have

$$\begin{aligned}&{\mathbb {E}} \left[ \exp \left( -2\gamma \int _t ^T \left\langle \pi _s , \frac{dF_s}{F_s}\right\rangle \right) \right] \\&\quad = {\mathbb {E}}_{{\mathbb {Q}}_t} \left[ \exp \left( -2\gamma \int _t ^T \pi ^* _s \mu _F(s,X^{t,x}_s) ds + 2\gamma ^2 \int _t ^T | \pi _s ^* \sigma ^* _F (s,X^{t,x}_s) |^2 ds \right) \right] \\&\quad \le {\mathbb {E}} \left[ \left( \frac{d{\mathbb {Q}}_t}{d{\mathbb {P}}} \right) ^2 \right] {\mathbb {E}} \left[ \exp \left( -4\gamma \int _t ^T \pi ^* _s \mu _F(s,X^{t,x}_s) ds + 4\gamma ^2 \int _t ^T | \pi _s ^* \sigma ^* _F (s,X^{t,x}_s) |^2 ds \right) \right] . \end{aligned}$$

Using the linear growth condition of $\mu _F$ and the boundedness of $\sigma _F ^* \sigma _F$ [cf. Assumption 3.2 (ii) and (iii)], we have

$$\begin{aligned}&\exp \left( -4\gamma \int _t ^T \pi ^* _s \mu _F(s,X^{t,x}_s) ds + 4\gamma ^2 \int _t ^T | \pi _s ^* \sigma ^* _F (s,X^{t,x}_s) |^2 ds \right) \\&\quad \le \exp \left( \int _0 ^T \left( c_1 | \pi _s | + c_2 |\pi _s|^2 + c_3 |X_s|^2 \right) ds \right) , \end{aligned}$$

for some positive constants $c_1,c_2,c_3$. To conclude it suffices to prove that the RHS above is integrable for ${\mathbb {P}}$. This follows from the admissibility of $\pi $ as in (2.5) and the exponential uniform bound (2.12) for X.

Finally, even though the space of admissible controls in our setting is smaller than the one in Bouchard and Touzi (2011), the value functions are the same since any controls in their space $\mathcal U_0$ can be clearly approximated by admissible controls in $\mathcal A$ through truncation. The result follows.

Appendix 2: Regularity properties of the log-value function

In order to prove the next lemma we follow closely the approach in Pham (2002), which has also been used in Mnif (2007) in a slightly different model with stochastic volatility with jumps and for an agent with exponential utility. Since the proof mimicks closely the arguments in Pham (2002), we only sketch them pointing out the main differences.

Lemma 8.1

Let $q\ge 0$. Let Assumptions 2.1 and 2.5 hold. Under Assumption 3.2 the log-value function J(t, x, z; q) defined as in (3.1) has quadratic growth in (x, z) uniformly in t.

Proof

Since the claim $C^u _{t,T}$ is bounded in (x, z) uniformly in the controls u (cf. Assumption 2.1), it suffices to prove that $J^0 (t,x)$, the log-value function of the pure investment problem, has quadratic growth in x uniformly in t.

First of all, repeating exactly the same arguments as in the proof of Theorem 3.1 in Pham (2002), we get that if the PDE (3.10) with terminal condition $J^0(T,x) = \frac{\log \gamma }{\gamma }$ admits a unique solution belonging to $C^{1,2}([0,T) \times {\mathbb {R}}^m)\cap C^0 ([0,T]\times \mathbb R^m)$, whose x-derivative has linear growth, then such a solution coincides with $J^0 (t,x)$.

To conclude the proof, we need to show that the PDE (3.10) has a unique smooth solution as above, whose x-derivative has linear growth. We adapt to our setting the arguments in the proof of Pham (2002), Th. 4.1 under his Assumptions (H3a). Indeed, notice that our Assumption 3.2(i), together with the Lipschitz continuity of b postulated in Assumption 2.5 (ii), corresponds to $\mathbf (H3a) (i)$ in Pham (2002). Moreover Assumption 3.2 (ii) implies $\mathbf (H3a) (ii)$, while Assumption 3.2 (iii) guarantees $\mathbf (H2) (b)$ [see Remark 2.3 in Pham 2002].

Consider the PDE (3.17) in the case $q=0$, with F(w) replaced by

$$\begin{aligned} F_k (w):= \inf _{\alpha \in \mathcal B_k} \left\{ -{\tilde{F}} (\alpha ) - \langle \alpha , w \rangle \right\} , \quad w\in {\mathbb {R}}^m , \end{aligned}$$

(8.1)

where $\mathcal B_k$ is the centered ball in ${\mathbb {R}}^m$ with radius $k\ge 1$. Recall that ${\tilde{F}}$ is the convex conjugate of F and that is given by

$$\begin{aligned} {\tilde{F}}(\alpha ) = -\frac{1}{2} \langle \alpha , B^{-1} \alpha \rangle , \quad \alpha \in \text {Im}(B), \end{aligned}$$

while it equals $-\infty $ otherwise. Proceeding as in the proof of Pham (2002), Th. 4.1, we can apply Theorem 6.2 in Fleming and Rishel (1975), giving the existence of a unique solution $J^{0,k} \in C^{1,2}([0,T) \times {\mathbb {R}}^m)\cap C^0 ([0,T]\times {\mathbb {R}}^m)$ with polynomial growth in x, for the parabolic PDE

$$\begin{aligned} \begin{array}{c} \displaystyle J^{0,k}_t + \frac{1}{2 \gamma } \langle (\sigma _F^* \sigma _F)^{-1} \mu _F,\mu _F\rangle +\gamma F_k (J^{0,k}_x) + \frac{1}{2} \text {tr} \left( \Sigma ^* \Sigma J^{0,k}_{xx}\right) = 0, \end{array} \end{aligned}$$

(8.2)

with terminal condition $J^{0,k} (T,x) = \frac{\log \gamma }{\gamma }$. Notice that the convex conjugate ${\tilde{F}}$ of F, appearing in the definition of $F_k (w)$ in (8.1), can take the value $-\infty $, which is not a problem here since this value does not contribute to the infimum over $\alpha $.

The next step consists, as in Pham (2002), in using a stochastic control representation of the solution $J^{0,k}$ to derive a uniform bound on the derivative, independently of the approximation. Indeed, from standard verification arguments we get that

$$\begin{aligned} J^{0,k}(t,x) = \inf _{\alpha \in {\mathbb {B}}_k} \mathbb E^{\mathcal Q} \left[ \int _t ^T \Lambda (s, X_s , \alpha _s ) ds \mid X_t =x\right] , \end{aligned}$$

where

$$\begin{aligned} \Lambda (s,x,\alpha ) = \frac{1}{2 \gamma } \langle (\sigma _F^* \sigma _F)^{-1} \mu _F, \mu _F\rangle (s,x) - \gamma {\tilde{F}} (\alpha ), \end{aligned}$$

where ${\mathbb {B}}_k$ is the set of ${\mathbb {R}}^m$-valued adapted processes $\alpha $ bounded by k, and the controlled dynamics of X under $\mathcal Q$ is given by

$$\begin{aligned} dX_s = ({\bar{b}}(s,X_s) - \gamma \alpha _s ) ds + \Sigma ^* (s,X_s) dW^{\mathcal Q}_s , \end{aligned}$$

where $W^{\mathcal Q}$ is a d-dimensional Brownian motion under $\mathcal Q$ and ${\bar{b}}$ has been defined in (3.7). Notice that, since $\Lambda $ takes the value $-\infty $ outside the image of B, then the optimal Markov control evaluated along the optimal path ${\hat{\alpha }} (s,{\hat{X}}_s)$ will lie on $\mathrm {Im} (B)$ a.s. for every $s\in [t,T]$. We can use Lemma 11.4 in Fleming and Soner (1993) and the same estimates as in Pham (2002), Lemma 4.1 to obtain

$$\begin{aligned} | J_x ^{0,k} (t,x) | \le C (1+ |x|) , \quad \forall (t,x) \in [0,T]\times {\mathbb {R}}^m , \end{aligned}$$

for some positive constant C, which does not depend on k. Now we argue as in the proof of Pham (2002), Th. 4.1, Case (H3a), to deduce that $|{\hat{\alpha }}_k (t,x)| \le C$ for all $t\in [0,T]$ and $|x| \le M$ for some positive constant C (independent of k) and an arbitrarily large $M>0$. Therefore, we get that, for $k \le C$, $F_k (J^{0,k}_x) = F(J^{0,k}_x)$ for all $(t,x) \in [0,T]\times \mathcal B_M$. Letting M tend to $+\infty $, we finally get that $J^{0,k}$ is a smooth solution with linear growth on derivative to the PDE (3.17) (with $q=0$). To conclude, we have that $J^0 = J^{0,k}$ for k sufficiently large, giving, in particular, that $J^0$ has quadratic growth in x uniformly in t. Therefore the proof is complete. $\square $

Appendix 3: Convergence of the numerical scheme

In this section we show that the value function obtained from the finite difference scheme converges to v. We will follow an approach originally developed by Kushner (1977) and based on stochastic control theory, which specifically requires that the finite difference scheme has a Markov chain interpretation.

First of all we notice that Eq. (4.11) can be written in the form of a Bellman–Isaacs equation, as done in the proof of Theorem 3.3 part (i):

$$\begin{aligned} \begin{array}{c} \displaystyle v_t + \inf _{\alpha \in \mathbb {R}} \ \sup _{u \in [0,{\bar{u}}]} \left\{ b^\alpha (t,x) v_x + \frac{1}{2} \sigma ^2 v_{xx} + u v_z+ q L + \frac{\gamma \alpha ^2}{2 \sigma ^2 (1 - \rho ^2)} \right\} = 0, \end{array} \end{aligned}$$

(9.1)

with

$$\begin{aligned} b^\alpha (t,x):= \delta (\theta - x) - \rho \frac{\sigma }{{\bar{\sigma }}_F} (a - k x) - \gamma \sigma ^2 (1 - \rho ^2) \left[ \beta (t) + 2 \Gamma (t) x \right] - \gamma \alpha , \end{aligned}$$

where $\beta (t)$ and $\Gamma (t)$ can be computed explicitly as solutions of the system of ODEs (4.10).

We now want to use the results in Barles and Souganidis (1991), or equivalently in Fleming and Soner (1993), Ch. IX (in the spirit of Li and Song (2007)), which work well when the min-max is taken on compact sets. To do this, we approximate $\inf _{\alpha \in \mathbb {R}}$ by $\inf _{\alpha \in \mathcal B_R}$, where $\mathcal B_R:= [-R,R]$, $R\ge 1$ (we will eventually let R go to $+ \infty $), obtaining a finite-difference approximation of the form [see the analogous equation (3.26) in Fleming and Soner (1993), Ch. IX]

$$\begin{aligned} v^{n}_{i,j}= & {} \inf _{\alpha \in \mathcal B_R} \sup _{u \in [0,{\bar{u}}]} \left\{ p^{1,n}_{\alpha ,u;i,j} v^{n+1}_{i+1,j} + p^{2,n}_{\alpha ,u;i,j} v^{n+1}_{i,j} + p^{3,n}_{\alpha ,u;i,j} v^{n+1}_{i-1,j} \right. \nonumber \\&\quad \left. +\, p^{4,n}_{\alpha ,u;i,j} v^{n+1}_{i,j+1} + \Delta t \ L_{\alpha ,u;i,j} \right\} \end{aligned}$$

(9.2)

where

$$\begin{aligned} p^{1,n}_{\alpha ,u;i,j}:= & {} \frac{\sigma ^2 \Delta t}{2 (\Delta x)^2} + \frac{\Delta t}{2 \Delta x} b^\alpha (t_n,x_i), \\ p^{2,n}_{\alpha ,u;i,j}:= & {} 1 - \frac{\sigma ^2 \Delta t}{(\Delta x)^2} - u \frac{\Delta t}{\Delta z}, \\ p^{3,n}_{\alpha ,u;i,j}:= & {} \frac{\sigma ^2 \Delta t}{2 (\Delta x)^2} - \frac{\Delta t}{2 \Delta x} b^\alpha (t_n,x_i), \\ p^{4,n}_{\alpha ,u;i,j}:= & {} u \frac{\Delta t}{\Delta z}, \\ L_{\alpha ,u;i,j}:= & {} L(e^{x_i},z_j,u). \end{aligned}$$

Notice that above quantities can be interpreted as the one-step transition probabilities of, respectively, going up, nowhere or down in x and up in z, when at time $t_n$ the processes (X, Z) is in the state $(x_i,z_j)$: more explicitly, e.g.,

$$\begin{aligned} p^{1,n}_{\alpha ,u;i,j} = {\mathbb {P}} (X_{t_{n+1}} = x_{i+1}, Z_{t_{n+1}}=z_{j} | X_{t_{n+1}} = x_{i}, Z_{t_{n+1}}=z_{j}), \end{aligned}$$

and the other ones can be written analogously. Hence, we are dealing with a Markov chain approximation of the state variable. Notice that the sum of the above four probabilities is equal to one.

For this Markov chain approximation to be rigorous, we must impose that $p^{i,n}_{\alpha ,u;i,j} \in [0,1]$ for every $i \in \{1,2,3,4 \}$ and for $n \in \{ 0, \dots , N-1 \}$ and for all possible states i, j and controls $u,\alpha $.

Taking into account that the domain in (x, z) is bounded and $\alpha ,u$ are also taken to be valued in a compact domain, the two conditions $p^{1,n}_{\alpha ,u;i,j} \le 1$ and $p^{3,n}_{\alpha ,u;i,j} \le 1$ are satisfied as soon as $\Delta t$ is small enough, since they read, respectively:

$$\begin{aligned} \frac{\sigma ^2 }{2 (\Delta x)^2} + \frac{b^\alpha (t_n,x_i)}{2 \Delta x} \le \frac{1}{\Delta t} , \quad \frac{\sigma ^2 }{2 (\Delta x)^2} - \frac{b^\alpha (t_n,x_i)}{2 \Delta x} \le \frac{1}{\Delta t} , \end{aligned}$$

while imposing $p^{1,n}_{\alpha ,u;i,j} \ge 0$ and $ p^{3,n}_{\alpha ,u;i,j} \ge 0$ yields

$$\begin{aligned} \vert b^\alpha (t_n,x_i) \vert \le \frac{\sigma ^2 }{\Delta x} , \end{aligned}$$

which has to hold true for every control $\alpha $, every n and every state $x_i$, so that we find

$$\begin{aligned} \Delta x \le \frac{\sigma ^2}{\sup _\alpha \Vert b^\alpha \Vert _\infty } . \end{aligned}$$

(9.3)

Moreover $p^{2,n}_{\alpha ,u;i,j}$ is always smaller than 1, while asking its non-negativity gives as necessary and sufficient condition

$$\begin{aligned} \frac{1}{\Delta t} \ge \frac{\sigma ^2}{(\Delta x)^2} + \frac{{\overline{u}}}{\Delta z}, \end{aligned}$$

(9.4)

which implies the well-known Courant-Friedrichs-Lewy condition $ {\overline{u}} \Delta t \le \Delta z $ [also present in Benth et al. (2012)] implying in turn $p^{4,n}_{\alpha ,u;i,j} \le 1$. Finally, $p^{4,n}_{\alpha ,u;i,j} $ is always positive. We are now ready to verify the conditions of monotonicity, stability and consistency required by the framework in Barles and Souganidis (1991), which correspond to assumptions (4.3)–(4.6) in Fleming and Soner (1993), Ch. IX. We proceed as done in Fleming and Soner (1993), Ch. IX, Example 4.1. The monotonicity is automatically given by the Markov chain interpretation in Eq. (9.2). The stability is implied by the same equation (which also has a unique solution) and by the fact that L and $\Phi $ are bounded on the bounded domain: in fact, one can easily prove by backward induction on n that

$$\begin{aligned} |v^{n}_{i,j}| \le \sup _{\alpha \in \mathcal B_R} \sup _{u \in [0,{\bar{u}}]} (T - t_n) \Vert L_{\alpha ,u} \Vert _\infty + \Vert \Phi \Vert _\infty \le \sup _{u \in [0,{\bar{u}}]} T \Vert L(\cdot ,\cdot ,u) \Vert _\infty + \Vert \Phi \Vert _\infty , \end{aligned}$$

(9.5)

where in the last inequality we have used the fact the $L_{\alpha ,u}$ does not depend on $\alpha $ and where the upper bound is uniform in i, j and in the discretization step $\Delta t$. Finally, the consistency property holds because the finite differences converge uniformly on compact sets to the corresponding derivatives (see, e.g., Fleming and Soner 1993, Theorem 4.2 and remember that we are working under the linear dynamics model and in the case of swing contracts). So we now have the solution $ v_R$ to Equation (9.1) where $\inf _{\alpha \in \mathbb {R}}$ is replaced by $\inf _{\alpha \in \mathcal B_R}$. Because of the stochastic game interpretation of this equation [as in Barles and Souganidis (1991)], letting $R \rightarrow + \infty $ gives that the sequence ${(v_R)}_{R \ge 1}$ decreases pointwise and, by Eq. (9.5), it is bounded uniformly in $\alpha $ (here $L_{\alpha ,u}$ does not depend on $\alpha $). Thus it admits a finite limit, v, which is the solution to Eq. (9.1).

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.

Reprints and permissions

About this article

Cite this article

Callegaro, G., Campi, L., Giusto, V. et al. Utility indifference pricing and hedging for structured contracts in energy markets. Math Meth Oper Res 85, 265–303 (2017). https://doi.org/10.1007/s00186-016-0569-6

Download citation

Received: 20 February 2016
Accepted: 23 December 2016
Published: 04 February 2017
Issue Date: April 2017
DOI: https://doi.org/10.1007/s00186-016-0569-6

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Utility indifference pricing and hedging for structured contracts in energy markets

Abstract

Similar content being viewed by others

Duality in optimal consumption–investment problems with alternative data

A stochastic Asset Liability Management model for life insurance companies

A Multiscale study of flexible customer’s energy demand under smart grid architecture: A modeling and simulation study

1 Introduction

2 Formulation of the problem

2.1 Structured products

Assumption 2.1

Example 2.2

Example 2.3

Remark 2.4

2.2 The market model

Assumption 2.5

2.3 Admissible strategies and utility indifference price

Definition 2.6

Definition 2.7

Remark 2.8

Remark 2.9

Remark 2.10

3 Characterization of the UIP with viscosity solutions

3.1 Heuristics on the value function PDE

Remark 3.1

3.2 Existence and uniqueness results

Assumption 3.2

Theorem 3.3

Proposition 3.4

Proof of Theorem 3.3

Remark 3.5

Remark 3.6

4 Examples

4.1 A class of models with two assets and constant correlation

Assumption 4.1

Assumption 4.2

Remark 4.3

Lemma 4.4

Proposition 4.5

Proof

Example 4.6

4.2 The Cartea–Villaplana model with correlation

Assumption 4.7

4.2.1 The case of one forward contract

4.2.2 The case of two forward contracts

5 Numerical results

5.1 Comparison with the results in Benth et al. (2012)

Remark 5.1

5.1.1 Numerical results

5.2 A more realistic example

6 Conclusions

Notes

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Appendices

Appendix 1: Proof of Proposition 3.4

Appendix 2: Regularity properties of the log-value function

Lemma 8.1

Proof

Appendix 3: Convergence of the numerical scheme

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation