Optimal Control of Predictive Mean-Field Equations and Applications to Finance

Øksendal, Bernt; Sulem, Agnès

doi:10.1007/978-3-319-23425-0_12

Bernt Øksendal^3,4 &
Agnès Sulem^5,6

Part of the book series: Springer Proceedings in Mathematics & Statistics ((PROMS,volume 138))

10k Accesses
4 Citations

Abstract

We study a coupled system of controlled stochastic differential equations (SDEs) driven by a Brownian motion and a compensated Poisson random measure, consisting of a forward SDE in the unknown process X(t) and a predictive mean-field backward SDE (BSDE) in the unknowns $Y(t), Z(t), K(t,\cdot )$. The driver of the BSDE at time t may depend not just upon the unknown processes $Y(t), Z(t), K(t,\cdot )$, but also on the predicted future value $Y(t+\delta )$, defined by the conditional expectation $A(t):= E[Y(t+\delta ) | \mathscr {F}_t]$. We give a sufficient and a necessary maximum principle for the optimal control of such systems, and then we apply these results to the following two problems: (i) Optimal portfolio in a financial market with an insider influenced asset price process. (ii) Optimal consumption rate from a cash flow modeled as a geometric Itô-Lévy SDE, with respect to predictive recursive utility.

You have full access to this open access chapter, Download conference paper PDF

A Maximum Principle for Mean-Field SDEs with Time Change

Article 30 June 2017

An optimal control problem for linear SDE of mean-field type with terminal constraint and partial information

Article Open access 27 April 2019

Stochastic optimal control of McKean–Vlasov equations with anticipating law

Article Open access 29 April 2019

Keywords

MSC (2010):

1 Introduction

The purpose of this paper is to introduce and study a pricing model where beliefs about the future development of the price process influence its current dynamics. We think this can be a realistic assumption in price dynamics where human psychology is involved, for example in electricity prices, oil prices and energy markets in general. It can also be a natural model of the risky asset price in an insider influenced market. See Sect. 5.1.

We model such price processes as backward stochastic differential equations (BSDEs) driven by Brownian motion and a compensated Poisson random measure, where the coefficients depend not only of the current values of the unknown processes, but also on their predicted future values. These predicted values are expressed mathematically in terms of conditional expectation, and we therefore name such equations predictive mean-field equations. To the best of our knowledge such systems have never been studied before.

In applications to portfolio optimization in a financial market where the price process is modeled by a predictive mean-field equation, we are led to consider coupled systems of forward-backward stochastic differential equations (FBSEDs), where the BSDE is of predictive mean-field type. In this paper we study solution methods for the optimal control of such systems in terms of maximum principles. Then we apply these methods to study

(i)
optimal portfolio in a financial market with an insider influenced asset price process. (Sect. 5.1), and
(ii)
optimal consumption rate from a cash flow modeled as a geometric Itô-Lévy SDE, with respect to predictive recursive utility (Sect. 5.2).

2 Formulation of the Problem

We now present our model in detail. We refer to [5] for information about stochastic control of jump diffusions.

Let $B(t) = B(t,\omega ); \; (t,\omega ) \in [0, \infty ) \times \varOmega $ and $\tilde{N}(dt, d\zeta ) = N(dt, d \zeta ) - \nu (d\zeta )dt$ be a Brownian motion and an independent compensated Poisson random measure, respectively, on a filtered probability space $\left( \varOmega , \mathbb {E}, \mathbb {F}= \{ \mathscr {F}_t\}_{t \ge 0}, P\right) $ satisfying the usual conditions. We consider a controlled system of predictive (time-advanced) coupled mean-field forward-backward stochastic differential equations (FBSDEs) of the form ($T > 0$ and $\delta >0$ are given constants)

Forward SDE in X(t):
$$\begin{aligned} {\left\{ \begin{array}{ll} dX(t) &{}= dX^{u}(t)= b(t,X(t),Y(t), A(t), Z(t),K(t,\cdot ), u(t),\omega )dt\\ &{} \quad + \sigma (t,X(t),Y(t),A(t), Z(t),K(t,\cdot ),u(t),\omega )dB(t) \\ &{} \quad + \int _{\mathbb {R}} \gamma (t,X(t),Y(t),A(t), Z(t),K(t,\cdot ),u(t),\zeta ,\omega )\tilde{N}(dt,d\zeta )\; ; \; t \in [0,T] \\ X(0)&{}=x \in \mathbb {R}\end{array}\right. } \end{aligned}$$
Predictive BSDE in Y(t), Z(t), K(t):
$$\begin{aligned} {\left\{ \begin{array}{ll} dY(t) &{}= -g(t, X(t),Y(t), A(t), Z(t),K(t,\cdot ),u(t), \omega )dt + Z(t) dB(t) \\ &{} \quad + \int _\mathbb {R}K(t, \zeta ) \tilde{N}(dt, d\zeta ) \; ; \; t \in [0,T) \\ Y(T) &{} = h(X(T),\omega ). \end{array}\right. } \end{aligned}$$
(1)
We set
$$\begin{aligned} Y(t):= L\; ; \; t \in (T, T + \delta ] , \end{aligned}$$
(2)
where L is a given bounded $\mathscr {F}$-measurable random variable, representing a “cemetery” state of the process Y after time T. The process A(t) represents our predictive mean-field term. It is defined by
$$\begin{aligned} A(t) : = E[Y(t + \delta ) \mid \mathscr {F}_t ] \; ; \; t \in [0, T]. \end{aligned}$$
(3)

Here $\mathscr {R}$ is the set of functions from $\mathbb {R}_0 := \mathbb {R}\backslash \{0\}$ into $\mathbb {R}$, $h(x,\omega )$ is a $C^1$ function (with respect to x) from $\mathbb {R}\times \varOmega $ into $\mathbb {R}$ such that $h(x,\cdot )$ is $\mathscr {F}_T$-measurable for all x, and

$$g : [0,T] \times \mathbb {R}\times \mathbb {R}\times \mathbb {R}\times \mathbb {R}\times \mathscr {R} \times \mathbb {U}\times \varOmega \rightarrow \mathbb {R}$$

is a given function (driver) such that $g(t,x,y,a,z,k,u,\cdot )$ is an $\mathbb {F}$-adapted process for all $x,y,a,z \in \mathbb {R}, k\in \mathscr {R}$ and $u \in \mathbb {U}$, which is the set of admissible control values. The process u(t) is our control process, assumed to be in a given family $\mathscr {A}= \mathscr {A}_{\mathbb {G}}$ of admissible processes, assumed to be càdlàg and adapted to a given subfiltration $\mathbb {G}= \{ \mathscr {G}_t\}_{t \ge 0}$ of the filtration $\mathbb {F}$, i.e. $\mathscr {G}_t \subseteq \mathscr {F}_t$ for all t. The sigma-algebra $\mathscr {G}_t$ represents the information available to the controller at time t.

We assume that for all $u \in \mathscr {A}$ the coupled system (1)–(3) has a unique solution $X(t)=X^{u}(t)\in L^{2}(m \times P),Y(t)=Y^{u}(t)\in L^{2}(m \times P), A(t)=A^{u}(t)\in L^{2}(m \times P),Z(t)=Z^{u}(t)\in L^{2}(m \times P),K(t,\zeta )=K^{u}(t,\zeta )\in L^{2}(m \times \nu \times P)$, with X(t), Y(t), A(t) being càdlàg and $Z(t), K(t,\zeta )$ being predictable. Here and later m denotes Lebesgue measure on [0, T].

To the best of our knowledge this system, (1)–(3), of predictive mean-field FBSDEs has not been studied before. However, the predictive BSDE (1)–(3) is related to the time-advanced BSDE which appears as an adjoint equation for stochastic control problems of a stochastic differential delay equation. See [7] and the references therein.

The process A(t) models the predicted future value of the state Y at time $t + \delta $. Therefore (1)–(3) represent a system where the dynamics of the state is influenced by beliefs about the future. This is a natural model for situations where human behavior is involved, for example in pricing issues in financial or energy markets.

The performance functional associated to $u \in \mathscr {A}$ is defined by

$$\begin{aligned} J(u) = E \left[ \int _0^{T} f(t,X(t),Y(t), A(t), u(t),\omega )dt + \varphi (X(T),\omega )+\psi (Y(0)) \right] \end{aligned}$$

(4)

where $f : [0,T] \times \mathbb {R}\times \mathbb {R}\times \mathbb {U}\times \varOmega \rightarrow \mathbb {R}$, $\varphi : \mathbb {R}\times \varOmega \rightarrow \mathbb {R}$ and $\psi : \mathbb {R}\rightarrow \mathbb {R}$ are given $C^{1}$ functions, with $f(t,x,y,a,u,\cdot )$ being $\mathbb {F}$-adapted for all $x,y,a \in \mathbb {R}$, $u \in \mathbb {U}$. We assume that $\varphi (x,\cdot )$ is $\mathscr {F}_{T}$-measurable for all x.

We study the following predictive mean-field stochastic control problem:

Find $u^* \in \mathscr {A}$ such that

$$\begin{aligned} \sup _{u \in \mathscr {A}} J(u) = J(u^*). \end{aligned}$$

(5)

In Sect. 3 we give a sufficient and a necessary maximum principle for the optimal control of forward-backward predictive mean-field systems of the type above.

An existence and uniqueness result for predictive mean-field BSDEs is given in Sect. 4.

Then in Sect. 5 we apply the results to the following problems:

Portfolio optimization in a market where the stock price is modeled by a predictive mean-field BSDE,
Optimization of consumption with respect to predictive recursive utility.

3 Solution Methods for the Stochastic Control Problem

3.1 A Sufficient Maximum Principle

For notational simplicity we suppress the dependence of $\omega $ in $f,g,h,\varphi $ and $\psi $ in the sequel. We first give sufficient conditions for optimality of the control u by modifying the stochastic maximum principle given in, for example, [6], to our new situation:

We define the Hamiltonian $H : [0,T] \times \mathbb {R}\times \mathbb {R}\times \mathbb {R}\times \mathbb {R}\times \mathscr {R}\times \mathbb {U}\times \mathbb {R}\times \mathbb {R}\times \mathbb {R}\times \mathbb {R}) \rightarrow \mathbb {R}$ associated to the problem (5) by

$$\begin{aligned} H(t,x,y,a,z,k,u,p,q,r,\lambda )&= f(t,x,y,a,u)+b(t,x,y,a,z,k,u)p+\sigma (t,x,y,a,z,k,u)q\nonumber \\&\quad +\int _{\mathbb {R}}\gamma (t,x,y,a,z,k,u,\zeta )\tilde{N}(dt,d\zeta ) + g(t,x,y,a,z,k,u)\lambda . \end{aligned}$$

(6)

We assume that $f,b,\sigma ,\gamma $ and g, and hence H, are Fréchet differentiable $(C^1)$ in the variables x, y, a, z, k, u and that the Fréchet derivative $\nabla _k H$ of H with respect to $k \in \mathscr {R}$ as a random measure is absolutely continuous with respect to $\nu $, with Radon-Nikodym derivative $\displaystyle \frac{d \nabla _k H}{d \nu }$. Thus, if $\langle \nabla _k H, h \rangle $ denotes the action of the linear operator $\nabla _k H$ on the function $h \in \mathscr {R}$ we have

$$\begin{aligned} \langle \nabla _k H, h \rangle = \int _\mathbb {R}h(\zeta ) d \nabla _k H(\zeta ) = \int _\mathbb {R}h(\zeta ) \frac{d \nabla _kH(\zeta )}{d \nu (\zeta )} d \nu (\zeta ). \end{aligned}$$

(7)

The associated backward-forward system of equations in the adjoint processes $p(t),q(t),r(t),\lambda (t)$ is defined by

BSDE in p(t), q(t), r(t):
$$\begin{aligned} {\left\{ \begin{array}{ll} dp(t) &{} = \displaystyle - \frac{\partial H}{\partial x}(t)dt + q(t)dB(t) + \int _\mathbb {R}r(t,\zeta ) \tilde{N}(dt, d\zeta ) \; ; \; 0 \le t \le T \\ p(T) &{} = \varphi '(X(T)) + \lambda (T) h'(X(T)). \end{array}\right. } \end{aligned}$$
(8)
SDE in $\lambda (t)$:
$$\begin{aligned} {\left\{ \begin{array}{ll} d\lambda (t) \displaystyle &{}= \left\{ \frac{\partial H}{\partial y}(t) + \frac{\partial H}{\partial a}(t-\delta ) \chi _{[\delta ,T]}(t) \right\} dt + \frac{\partial H}{\partial z}(t) dB(t) \\ &{} \qquad + \displaystyle \int _\mathbb {R}\frac{d \nabla _k H}{d \nu }(t,\zeta ) \tilde{N}(dt, d\zeta ) \; ; \; 0 \le t \le T \\ \lambda (0)&{} = \psi '(Y(0)), \end{array}\right. } \end{aligned}$$
(9)

where we have used the abbreviated notation

$$\begin{aligned} H(t)&= H(t, X(t),Y(t), A(t),Z(t),K(t,\cdot ),u(t),p(t),q(t),r(t),\lambda (t)). \end{aligned}$$

Note that, in contrast to the time advanced BSDE (1)–(3), (9) is a (forward) stochastic differential equation with delay.

Theorem 1

(Sufficient maximum principle) Let $\hat{u}\in \mathscr {A}$ with corresponding solution $\hat{X}(t),\hat{Y}(t), \hat{A}(t), \hat{Z}(t), \hat{K}(t,\cdot ), \hat{p}(t),\hat{q}(t),\hat{r}(t),\hat{\lambda }(t)$ of (1)–(3), (8) and (9). Assume the following:

$$\begin{aligned} \hat{\lambda }(T) \ge 0 \end{aligned}$$
(10)
For all t, the functions
$$\begin{aligned}&x \rightarrow h(x),x \rightarrow \varphi (x), x \rightarrow \psi (x) \text { and }\nonumber \\&(x,y,a,z,k,u) \rightarrow H(t,x,y,a,z,k,u,\hat{p}(t),\hat{q}(t),\hat{r}(t),\hat{\lambda }(t)) \nonumber \\ \text {are concave} \end{aligned}$$
(11)
For all t the following holds,
$$\begin{aligned}&\text {(The conditional maximum principle)} \nonumber \\&\mathop {ess \;sup}_{v \in \mathbb {U}} E[H(t,\hat{X}(t), \hat{Y}(t), \hat{A}(t),\hat{Z}(t), \hat{K}(t,\cdot ),v,\hat{\lambda }(t), \hat{p}(t), \hat{q}(t), \hat{r}(t,\cdot )) \mid \mathscr {G}_t] \nonumber \\&\quad = E[H(t,\hat{X}(t), \hat{Y}(t), \hat{A}(t),\hat{Z}(t), \hat{K}(t,\cdot ), \hat{u}(t), \hat{\lambda }(t), \hat{p}(t), \hat{q}(t), \hat{r}(t,\cdot )) \mid \mathscr {G}_t] \; ; \; t \in [0,T] \end{aligned}$$
(12)
$$\begin{aligned} \left\| \frac{d \nabla _k \hat{H}(t,.)}{d \nu } \right\| < \infty \text { for all } t \in [0,T]. \end{aligned}$$
(13)

Then $\hat{u}$ is an optimal control for the problem (5).

Proof

By replacing the terminal time T by an increasing sequence of stopping times $\tau _n$ converging to T as n goes to infinity, and arguing as in [6] we see that we may assume that all the local martingales appearing in the calculations below are martingales.

Much of the proof is similar to the proof of Theorem 3.1 in [6], but due to the predictive mean-field feature of the BSDE (1)–(3), there are also essential differences. Therefore, for the convenience of the reader, we sketch the whole proof:

Choose $u \in \mathscr {A}$ and consider

$$\begin{aligned} J(u) - J(\hat{u}) = I_1 + I_2+I_3, \end{aligned}$$

(14)

with

$$\begin{aligned} I_1 := E \left[ \int _0^{T} \{f(t) - \hat{f}(t)\} \right] dt,\quad I_2 := E[\varphi (X(T)) - \varphi (\hat{X}(T))],\quad I_3 := \psi (Y(0)) - \psi (\hat{Y}(0)), \end{aligned}$$

(15)

where $\hat{f}(t) = f(t, \hat{Y}(t), \hat{A}(t), \hat{u}(t))$ etc., and $\hat{Y}(t) = Y^{\hat{u}}(t)$ is the solution of (1)–(3) when $u = \hat{u}$, and $\hat{A}(t) = E[ \hat{Y}(t) \mid \mathscr {F}_t]$.

By the definition of H we have

$$\begin{aligned} I_1&= E \left[ \int _0^{T} \{H(t) - \hat{H}(t) -\hat{p}(t)\tilde{b}(t) - \hat{q}(t) \tilde{\sigma }(t)\right. \nonumber \\&\qquad \quad \left. - \int _\mathbb {R}\hat{r}(t,\zeta ) \tilde{\gamma }(t,\zeta ) \nu (d\zeta )- \hat{\lambda }(t) \tilde{g}(t) \right] , \end{aligned}$$

(16)

where we from now on use the abbreviated notation

$$\begin{aligned} H(t)&= H(t, X(t),Y(t), A(t), Z(t),K(t,\cdot ),u(t), \hat{\lambda }(t)) \\ \hat{H}(t)&= H(t, \hat{X}(t),\hat{Y}(t), \hat{A}(t),\hat{Z}(t),\hat{K}(t,\cdot ), \hat{u}(t), \hat{\lambda }(t)) \end{aligned}$$

and we put

$$\tilde{b}(t) := b(t) - \hat{b}(t),$$

and similarly with $\tilde{X}(t):=X(t)-\hat{X}(t), \tilde{Y}(t):=Y(t)-\hat{Y}(t),\tilde{A}(t):=A(t)-\hat{A}(t), $ etc.

By concavity of $\varphi $, (9) and the Itô formula,

$$\begin{aligned} I_2&\le E[ \varphi '(\hat{X}(T))\tilde{X}(T)] \nonumber \\&= E [ \hat{p}(T) \tilde{X}(T)] - E [ \hat{\lambda }(T) h'(\hat{X}(T))\tilde{X}(T)] \nonumber \\&= \left( E \left[ \int _0^{T} \hat{p}(t^-) d\tilde{X}(t) + \int _0^{T} \tilde{X}(t^-) d \hat{p}(t) \right. \right. \nonumber + \int _0^{T} \hat{q}(t) \tilde{\sigma }(t)dt \nonumber \\&\left. \left. \qquad \qquad + \int _0^{T} \int _\mathbb {R}\hat{r}(t,\zeta )\tilde{\gamma }(t,\zeta ) \nu (d \zeta )dt \right] \right) - E [ \hat{\lambda }(T) h'(\hat{X}(T))\tilde{X}(T)] \nonumber \\&= E \left[ \int _0^{T} \hat{p}(t) \tilde{b}(t)dt + \int _0^{T} \tilde{X}(t) \left( - \frac{\partial \hat{H}}{\partial x}(t)\right) dt \right. \nonumber \\&\left. \qquad \qquad + \int _0^{T} \hat{q}(t)\tilde{\sigma }(t)dt + \int _0^{T}\int _\mathbb {R}\hat{r}(t,\zeta ) \tilde{\gamma }(t,\zeta ) \nu (d\zeta ) dt \right] \nonumber \\&\quad - E [ \hat{\lambda }(T) h'(\hat{X}(T))\tilde{X}(T)]. \end{aligned}$$

(17)

By concavity of $\psi $ and h, (10) and the Itô formula we have

$$\begin{aligned} I_3&\le E \left[ \psi '(Y(0)) \tilde{Y}(0)\right] = E[\hat{\lambda }(0) \tilde{Y}(0)] \nonumber \\&= E[\hat{\lambda }(T) \tilde{Y}(T)] - E \left[ \int _0^T \hat{\lambda }(t) d\tilde{Y}(t) + \int _0^T \tilde{Y}(t) d \hat{\lambda }(t)+\int _0^T d[\tilde{Y},\hat{\lambda }](t) \right] \nonumber \\&= E[\hat{\lambda }(T) (h(X(T)) - h(\hat{X}(T)))] \nonumber \\&\quad - E \left[ \int _0^T \hat{\lambda }(t) d\tilde{Y}(t) + \int _0^T \tilde{Y}(t) d \hat{\lambda }(t)+\int _0^T d[\tilde{Y},\hat{\lambda }](t) \right] \nonumber \\&\le E[\hat{\lambda }(T) h'(\hat{X}(T))\tilde{X}(T)] \nonumber \\&\quad + E \left[ \int _0^T \hat{\lambda }(t) \tilde{g}(t) dt + \int _0^T \tilde{Y}(t) \left[ - \frac{\partial \hat{H}}{\partial y}(t) - \frac{\partial \hat{H}}{\partial a} (t-\delta ) \chi _{[\delta ,T]}(t)\right] dt \right. \nonumber \\&\qquad \qquad + \int _0^T \frac{\partial \hat{H}}{\partial z}(t) \tilde{Z}(t)dt \left. + \int _0^T \int _\mathbb {R}\frac{d \nabla _k \hat{H}}{d \nu }(t,\zeta ) \tilde{K}(t,\zeta ) \nu (d\zeta )dt \right] \end{aligned}$$

(18)

Adding (16), (17) and (18) we get, by (9),

$$\begin{aligned} J(u)&- J(\hat{u}) = I_1 +I_2+I_3 \nonumber \\&\le E \left[ \int _0^T\left\{ H(t) - \hat{H}(t) - \frac{\partial \hat{H}}{\partial x} \tilde{X}(t)- \frac{\partial \hat{H}}{\partial y} \tilde{Y}(t) \nonumber \right. \right. \\&\qquad \quad - \frac{\partial H}{\partial a}(t-\delta ) \chi _{[\delta ,T]}(t) \tilde{Y}(t) - \frac{\partial H}{\partial z}(t) \tilde{Z}(t) \left. - \langle \nabla _k \hat{H}(t,\cdot ), \tilde{K}(t,\cdot )\rangle \left. \right\} dt \right] . \end{aligned}$$

(19)

Note that, since $Y(s) = \hat{Y}(s) = L$ for $s \in (T,T+\delta ]$ by (1), we get

$$\begin{aligned} E&\left[ \int _0^T \frac{\partial \hat{H}}{\partial a} (t- \delta ) \tilde{Y}(t) \chi _{[\delta ,T]}(t) dt \right] = E \left[ \int _0^{T-\delta } \frac{\partial \hat{H}}{\partial a}(s) \tilde{Y}(s + \delta )ds \right] \nonumber \\&= E \left[ \int _0^{T - \delta } E \left[ \frac{\partial \hat{H}}{\partial a}(s) \tilde{Y}(s + \delta ) \mid \mathscr {F}_s\right] dt \right] \nonumber \\&= E \left[ \int _0^{T-\delta } \frac{\partial \hat{H}}{\partial a}(s) E \left[ \tilde{Y}(s + \delta ) \mid \mathscr {F}_s\right] ds \right] = E \left[ \int _0^T \frac{\partial \hat{H}}{\partial a}(s) \tilde{A}(s)ds \right] . \end{aligned}$$

(20)

Substituted into (19) this gives, by concavity of H,

$$\begin{aligned} J(u)&- J(\hat{u}) = I_1 +I_2 +I_3\nonumber \\&\le E \left[ \int _0^T\left\{ H(t) - \hat{H}(t) - \frac{\partial \hat{H}}{\partial x} (X(t) - \hat{X}(t))- \frac{\partial \hat{H}}{\partial y} (Y(t) - \hat{Y}(t)) \nonumber \right. \right. \\&\qquad \quad - \frac{\partial H}{\partial a}(t) (A(t) - \hat{A}(t)) - \frac{\partial H}{\partial z}(t) (Z(t) - \hat{Z}(t)) \nonumber \\&\qquad \quad \left. - \langle \nabla _k \hat{H}(t,\cdot ), (K(t,\cdot ) - \hat{K}(t,\cdot )\rangle \left. \right\} dt \right] \nonumber \\&\le E \left[ \int _0^T \frac{\partial \hat{H}}{\partial u} (t) (u(t) - \hat{u}(t)) dt\right] \nonumber \\&= E \left[ \int _0^T E[\frac{\partial \hat{H}}{\partial u} (t)|\mathscr {G}_t] (u(t) - \hat{u}(t)) dt \right] \le 0, \end{aligned}$$

(21)

since $u = \hat{u}(t)$ maximizes $E[\hat{H}(t)|\mathscr {G}_t]$. $\square $

3.2 A Necessary Maximum Principle

We proceed to prove a partial converse of Theorem 1, in the sense that we give necessary conditions for a control $\hat{u}$ to be optimal. In this case we can only conclude that $\hat{u}(t)$ is a critical point for the Hamiltonian, not necessarily a maximum point. On the other hand, we do not need any concavity assumptions, but instead we need some properties of the set $\mathscr {A}$ of admissible controls, as described below.

Theorem 2

(Necessary maximum principle) Suppose $\hat{u}\in \mathscr {A}$ with associated solutions $\hat{X}, \hat{Y}, \hat{Z}, \hat{K}, \hat{p},\hat{q},\hat{r},\hat{\lambda }$ of (1)–(3) and (8) and (9). Suppose that for all processes $\beta (t)$ of the form

$$\begin{aligned} \beta (t) := \chi _{[t_0, T]}(t) \alpha , \end{aligned}$$

(22)

where $t_0 \in [0,T)$ and $ \alpha = \alpha (\omega )$ is a bounded $\mathscr {G}_{t_0}$-measurable random variable, there exists $\delta > 0$ such that the process

$$ \hat{u}(t) + r\beta (t) \in \mathscr {A}\text { for all } r \in [- \delta , \delta ].$$

We assume that the derivative processes defined by

$$\begin{aligned} x(t) = x^\beta (t) = \frac{d}{dr} X^{\hat{u}+ r \beta }(t) \mid _{r=0}, \end{aligned}$$

(23)

$$\begin{aligned} y(t) = y^\beta (t) = \frac{d}{dr} Y^{\hat{u}+ r \beta }(t) \mid _{r=0}, \end{aligned}$$

(24)

$$\begin{aligned} a(t) = a^\beta (t) = \frac{d}{dr} A^{\hat{u}+ r \beta }(t) \mid _{r=0}, \end{aligned}$$

(25)

$$\begin{aligned} z(t) = z^\beta (t) = \frac{d}{dr} Z^{\hat{u}+ r \beta }(t) \mid _{r=0}, \end{aligned}$$

(26)

$$\begin{aligned} k(t) = k^\beta (t) = \frac{d}{dr} K^{\hat{u}+ r \beta }(t) \mid _{r=0}, \end{aligned}$$

(27)

exist and belong to $L^2(m \times P)$, $L^2(m \times P)$, $L^2(m \times P)$, and $L^2(m \times P \times \nu )$, respectively.

Moreover, we assume that x(t) satisfies the equation

$$\begin{aligned} {\left\{ \begin{array}{ll} dx(t) = \displaystyle \left\{ \frac{\partial b}{\partial x}(t) x(t) + \frac{\partial b}{\partial y}(t)y(t)+ \frac{\partial b}{\partial a}(t)a(t) + \frac{\partial b}{\partial z}(t)z(t) + \langle \nabla _k b, k(t,\cdot )\rangle \right. \\ \displaystyle \qquad \qquad \qquad + \left. \frac{\partial b}{\partial u}(t) \beta (t) \right\} dt \\ \quad \qquad \qquad \displaystyle + \left\{ \frac{\partial \sigma }{\partial x}(t) x(t) + \frac{\partial \sigma }{\partial y}(t) y(t) + \frac{\partial \sigma }{\partial a}(t) a(t)+ \frac{\partial \sigma }{\partial z}(t) z(t) + \langle \nabla _k \sigma , k(t,\cdot )\rangle \right. \\ \displaystyle \qquad \qquad \qquad \quad \left. + \frac{\partial \sigma }{\partial u}(t) \beta (t)\right\} dB(t) \\ \qquad \qquad \quad \displaystyle + \int _\mathbb {R}\left\{ \frac{\partial \gamma }{\partial x}(t,\zeta ) x(t) + \frac{\partial \gamma }{\partial y}(t,\zeta ) y(t)+ \frac{\partial \gamma }{\partial a}(t,\zeta ) a(t) + \frac{\partial \gamma }{\partial z}(t,\zeta ) z(t) \right. \\ \qquad \qquad \qquad \qquad \quad \left. \displaystyle + \langle \nabla _k \gamma (t,\zeta ), k(t,\cdot )\rangle + \frac{\partial \gamma }{\partial u}(t,\zeta ) \beta (t)\right\} \tilde{N}(dt,d\zeta ) \; ; \; t \in [0,T] \\ x(0) = 0 \end{array}\right. } \end{aligned}$$

(28)

and that y(t) satisfies the equation

$$\begin{aligned} {\left\{ \begin{array}{ll} dy(t) &{}= -\left\{ \frac{\partial g}{\partial x}(t) x(t)+\frac{\partial g}{\partial y}(t) y(t) + \frac{\partial g}{\partial a}(t)a(t) + \frac{\partial g}{\partial z}(t)z(t) \right. \\ &{}\quad \qquad \left. + \langle \nabla _k g(t), k(t,\cdot )\rangle + \frac{\partial g}{\partial u}(t) \beta (t) \right\} dt \\ &{}\quad + z(t) dB(t) + \int _\mathbb {R}k(t,\zeta ) \tilde{N}(dt, d\zeta )\; ; \; 0 \le t < T \\ y(T)&{}=h'(X(T))x(T)\\ y(t)&{}= 0 \; ; \; T < t \le T+\delta , \end{array}\right. } \end{aligned}$$

(29)

where we have used the abbreviated notation

$$\begin{aligned} \frac{\partial g}{\partial x}(t) = \frac{\partial }{\partial x} g(t,x,y,a,z,k,u)_{x=X(t),y =Y(t),a=A(t),z=Z(t),k=K(t),u=u(t)} \text { etc.} \end{aligned}$$

Then the following, (i) and (ii), are equivalent:

(i)
$\displaystyle \frac{d}{dr} J(\hat{u}+ r \beta )_{r=0} = 0$ for all $\beta $ of the form (22)
(ii)
$\displaystyle \frac{d}{du} E[H(t, \hat{Y}(t), \hat{A}(t),\hat{Z}(t),\hat{K}(t),u, \hat{\lambda }(t))_{u = \hat{u}(t)}|\mathscr {G}_t] = 0,$

where $(\hat{Y}, \hat{A}, \hat{Z},\hat{K},\hat{\lambda })$ is the solution of (1), (3) and (9) corresponding to $u=\hat{u}$.

Proof

As in Theorem 1, by replacing the terminal time T by an increasing sequence of stopping times $\tau _n$ converging to T as n goes to infinity, we obtain as in [6] that we may assume that all the local martingales appearing in the calculations below are martingales. The proof has many similarities with the proof of Theorem 3.2 in [6], but since there are some essential differences due to the predictive mean-field term, we sketch the whole proof. For simplicity of notation we drop the hats in the sequel, i.e. we write u instead of $\hat{u}$ etc.

(i) $\Rightarrow $ (ii): We can write $\displaystyle \frac{d}{dr} J(u + r \beta ) \mid _{r=0} = I_1 + I_2 +I_3$, where

$$\begin{aligned} I_1&= \frac{d}{dr} E \left[ \int _0^T f(t,Y^{u+r \beta }(t), A^{u+r \beta }(t), Z^{u+r \beta }(t), K^{u+r \beta }(t), u(t) + r \beta (t))dt\right] _{r=0} \\ I_2&= \frac{d}{dr} [ \varphi (X^{u+r \beta }(T))]_{r=0}\\ I_3&= \frac{d}{dr} [ \psi (Y^{u+r \beta }(0))]_{r=0}. \end{aligned}$$

By our assumptions on f and $\psi $ we have

$$\begin{aligned} I_1&= \left[ \int _0^T \left\{ \frac{\partial f}{\partial x}(t) x(t) +\frac{\partial f}{\partial y}(t) y(t) + \frac{\partial f}{\partial a}(t) a(t) + \frac{\partial f}{\partial z}(t) z(t)\right. \right. \nonumber \\&\qquad \qquad \quad \left. \left. + \langle \nabla _k f(t,\cdot ), k(t,\cdot )\rangle + \frac{\partial f}{\partial u}(t) \beta (t)\right\} dt\right] \end{aligned}$$

(30)

$$\begin{aligned} \nonumber \\ I_2&=E[\varphi '(X(T)x(T)]=E[p(T)x(T)]\end{aligned}$$

(31)

$$\begin{aligned} I_3&= \psi '(Y(0))y(0) = \lambda (0) y(0). \end{aligned}$$

(32)

By the Itô formula and (28)

$$\begin{aligned} I_2&= E[p(T) x(T)] \nonumber = E \left[ \int _0^{T} p(t) dx(t) + \int _0^{T} x(t) dp(t) + \int _0^{T} d[p,x](t) \right] \nonumber \\&= E \left[ \int _0^{T} p(t) \left\{ \frac{\partial b}{\partial x}(t) x(t) + \frac{\partial b}{\partial y}(t) y(t)+ \frac{\partial b}{\partial a}(t) a(t) + \frac{\partial b}{\partial z}(t) z(t) \right. \right. \nonumber \\&\left. \qquad \quad + \langle \nabla _k b(t),k(t,\cdot )\rangle + \frac{\partial b}{\partial u}(t) \beta (t) \right\} dt + \int _0^{\tau _n}x(t) \left( - \frac{\partial H}{\partial x}(t)\right) dt \nonumber \\&\qquad \quad \left. + \int _0^{\tau _n} q(t) \left\{ \frac{\partial \sigma }{\partial x}(t) x(t) \right. + \frac{\partial \sigma }{\partial y}(t) y(t)+ \frac{\partial \sigma }{\partial a}(t) a(t) + \frac{\partial \sigma }{\partial z}(t) z(t) \right. \nonumber \\&\qquad \quad \left. + \langle \nabla _k \sigma (t), k(t,\cdot )\rangle + \frac{\partial \sigma }{\partial u}(t) \beta (t) \right\} dt\nonumber \\&\qquad \quad + \int _0^{T}\int _\mathbb {R}r(t,\zeta ) \left\{ \frac{\partial \gamma }{\partial x}(t,\zeta )x(t) + \frac{\partial \gamma }{\partial y}(t,\zeta )y(t)+ \frac{\partial \gamma }{\partial a}(t,\zeta )a(t) + \frac{\partial \gamma }{\partial z}(t,\zeta )z(t) \right. \nonumber \\&\qquad \quad \left. \left. + < \nabla _k \gamma (t,\zeta ), k(t,\cdot )> + \frac{\partial \gamma }{\partial u}(t,\zeta )\beta (t)\right\} \nu (d\zeta ) dt \right] \nonumber \\&= E \left[ \int _0^{T} x(t) \left\{ \frac{\partial b}{\partial x}(t) p(t) + \frac{\partial \sigma }{\partial x}(t) q(t) + \int _\mathbb {R}\frac{\partial \gamma }{\partial x}(t,\zeta ) r(t,\zeta ) \nu (d\zeta ) - \frac{\partial H}{\partial x}(t) \right\} dt\right. \nonumber \\&\qquad \quad + \int _0^{T} y(t) \left\{ \frac{\partial b}{\partial y}(t) p(t) + \frac{\partial \sigma }{\partial y}(t) q(t) + \int _\mathbb {R}\frac{\partial \gamma }{\partial y}(t,\zeta ) r(t,\zeta ) \nu (d\zeta ) \right\} dt\nonumber \\&\qquad \quad + \int _0^{T} a(t) \left\{ \frac{\partial b}{\partial a}(t) p(t) + \frac{\partial \sigma }{\partial a}(t) q(t) + \int _\mathbb {R}\frac{\partial \gamma }{\partial a}(t,\zeta ) r(t,\zeta ) \nu (d\zeta ) \right\} dt\nonumber \\&\qquad \quad + \int _0^{T} z(t) \left\{ \frac{\partial b}{\partial z}(t) p(t) + \frac{\partial \sigma }{\partial z}(t) q(t) + \int _\mathbb {R}\frac{\partial \gamma }{\partial z}(t,\zeta ) r(t,\zeta ) \nu (d\zeta ) \right\} dt\nonumber \\&\qquad \quad + \int _0^{T}\int _\mathbb {R}\langle k(t,\cdot ), \nabla _kb(t)p(t) + \nabla _k \sigma (t) q(t) \nonumber \\&\qquad \quad \left. + \int _\mathbb {R}\nabla _k \gamma (t,\zeta ) r(t,\zeta ) \nu (d \zeta ) \rangle \nu (d\zeta )dt\right] \nonumber \\&= E \left[ \int _0^{T} x(t) \left\{ - \frac{\partial f}{\partial x}(t) - \lambda (t) \frac{\partial g}{\partial x}(t) \right\} \right. dt \nonumber \\&\qquad \quad + \int _0^{T} y(t) \left\{ \frac{\partial H}{\partial y}(t)- \frac{\partial f}{\partial y}(t) - \lambda (t) \frac{\partial g}{\partial y}(t) \right\} dt \nonumber \\&\qquad \quad + \int _0^{T} a(t) \left\{ \frac{\partial H}{\partial a}(t)- \frac{\partial f}{\partial a}(t) - \lambda (t) \frac{\partial g}{\partial a}(t) \right\} dt \nonumber \\&\qquad \quad + \int _0^{T} z(t) \left\{ \frac{\partial H}{\partial z}(t)- \frac{\partial f}{\partial z}(t) - \lambda (t) \frac{\partial g}{\partial z}(t) \right\} dt\nonumber \\&\qquad \quad + \int _0^{T} \int _\mathbb {R}k(t,\zeta ) \{ \nabla _kH(t) - \nabla _k f(t) - \lambda (t) \nabla _k g(t)\} \nu (d\zeta ) dt \nonumber \\&\qquad \quad \left. + \int _0^{T} \beta (t) \left\{ \frac{\partial H}{\partial u}(t)- \frac{\partial f}{\partial u}(t) - \lambda (t) \frac{\partial g}{\partial u}(t) \right\} dt \right] \nonumber \\&= - I_1 - E \left[ \int _0^T \lambda (t) \left\{ \frac{\partial g}{\partial x}(t) x(t) + \frac{\partial g}{\partial y} (t) y(t) + \frac{\partial g}{\partial z} (t) z(t) \right. \right. \nonumber \\&\quad \left. \left. + \langle \nabla _kg(t), k(t,\cdot )\rangle + \frac{\partial g}{\partial u}(t) \beta (t) \right\} dt \right] \nonumber \\&\quad + E \left[ \int _0^T \left\{ \frac{\partial H}{\partial y} (t) y(t) + \frac{\partial H}{\partial z} (t) z(t) + \langle \nabla _k H(t), k(t,\cdot )\rangle + \frac{\partial H}{\partial u} (t) \beta (t)\right\} dt\right] \end{aligned}$$

(33)

By the Itô formula and (29),

$$\begin{aligned} I_3&= \lambda (0) y(0) = E \left[ \lambda (T) y(T) - \left( \int _0^{T} \lambda (t) dy(t) + \int _0^{T} y(t) d \lambda (t) + \int _0^{T} d[\lambda , y](t)\right) \right] \nonumber \\&= E[ \lambda (T) y(T)] \nonumber \\&\quad - \left( E \left[ \int _0^{T} \lambda (t) \left\{ - \frac{\partial g}{\partial y}(t) y(t) - \frac{\partial g}{\partial a}(t) a(t) - \frac{\partial g}{\partial z}(t) z(t) \right. \right. \right. \nonumber \\&\left. \qquad \qquad \qquad \qquad \qquad \quad - \langle \nabla _k g(t), k(t,\cdot )\rangle - \frac{\partial g}{\partial u}(t) \beta (t) \right\} dt \nonumber \\&\qquad \qquad \quad + \int _0^{T} y(t) \frac{\partial H}{\partial y}(t) dt + y(t) \frac{\partial H}{\partial a}(t-\delta ) \chi _{[\delta ,T]}(t) dt + \int _0^{T} z(t) \frac{\partial H}{\partial z}(t) dt \nonumber \\&\left. \left. \qquad \qquad \quad + \int _0^{T} \int _\mathbb {R}k(t,\zeta ) \nabla _k H(t,\zeta ) \nu (d \zeta ) dt \right] \right) . \end{aligned}$$

(34)

Adding (30), (33) and (34) and using that

$$\begin{aligned}&E[ \int _0^T y(t)\frac{\partial H}{\partial a}(t-\delta ) \chi _{[\delta ,T]} dt ] =E[ \int _0^{T-\delta } y(s+\delta )\frac{\partial H}{\partial a}(s) ds]\nonumber \\&=E[ \int _0^T \frac{\partial H}{\partial a}(s)E[y(s+\delta )|\mathscr {F}_s] ds] =E[ \int _0^T y(t)\frac{\partial H}{\partial a}(s) a(s) ds], \end{aligned}$$

(35)

we get

$$ \frac{d}{dr} J(u + r \beta ) \mid _{r=0} = I_1 + I_2 = E \left[ \int _0^T \frac{\partial H}{\partial u}(t) \beta (t)dt \right] .$$

We conclude that

$$\frac{d}{dr} J(\hat{u}+ r \beta ) \mid _{r=0} = 0 $$

if and only if

$$E \left[ \int _0^T \frac{\partial \hat{H}}{\partial u}(t) \beta (t) dt \right] = 0 \; \; \text { for all bounded } \beta \in \mathscr {A}_{\mathbb {G}} \text{ of } \text{ the } \text{ form } (12.3.17).$$

Since this holds for all such $\beta $, we obtain that if (i) holds, then

$$\begin{aligned} \int _{t_0}^T E \left[ \frac{\partial \hat{H}}{\partial u}(t) \mid \mathscr {G}_{t_0} \right] dt = 0 \text { for all } t_0 \in [0,T). \end{aligned}$$

(36)

Differentiating with respect to $t_0$ and using continuity of $\displaystyle \frac{\partial \hat{H}}{\partial u}(t)$, we conclude that (ii) holds.

(ii) $\Rightarrow $ (i): This is proved by reversing the above argument. We omit the details. $\square $

4 Existence and Uniqueness of Predictive Mean-Field Equations

In this section we study the existence and uniqueness of predictive mean-field BSDEs in the unknowns $Y(t),Z(t),K(t,\zeta )$ of the form

$$\begin{aligned} {\left\{ \begin{array}{ll} dY(t) &{} = -g(t,Y(t), A(t), Z(t),K(t,\cdot ), \omega )dt + Z(t) dB(t) \\ &{}\quad + \int _\mathbb {R}K(t, \zeta ) \tilde{N}(dt, d\zeta ) \; ; \; t \in [0,T) \\ Y(t) &{} = L \; ; \; t \in [T, T+\delta ] \; ; \; \delta > 0 \text { fixed}, \end{array}\right. } \end{aligned}$$

(37)

where $L \in L^2(P)$ is a given $\mathscr {F}_T$-measurable random variable, and the process A(t) as before is defined by

$$\begin{aligned} A(t) = E[Y(t + \delta ) \mid \mathscr {F}_t ] \; ; \; t \in [0, T]. \end{aligned}$$

(38)

To this end, we can use the same argument which was used to handle a similar, but different, time-advanced BSDE in [7]. For completeness we give the details:

Theorem 3

Suppose the following holds

$$\begin{aligned}&E[\int _0^T g^2(t,0,0,0,0)dt] < \infty \end{aligned}$$

(39)

$$\begin{aligned}&\text {There exists a constant } C \text { such that }\nonumber \\&|g(t,y_1,a_1,z_1,k_1) -g(t,y_2,a_2,z_2,k_2)|\le C(|y_1-y_2| + |z_1-z_2| + \nonumber \\&\qquad \quad (\int _{\mathbb {R}}|k_1(\zeta ) - k_2(\zeta )|^2\nu (d\zeta ))^{\frac{1}{2}}) \end{aligned}$$

(40)

for all $t \in [0,T],$ a.s. Then there exists a unique solution triple $(Y(t),Z(t),K(t,\zeta ))$ of (37) such that the following holds:

$$\begin{aligned} {\left\{ \begin{array}{ll} Y \text { is cadlag and } E[\sup _{t \in [0,T]} Y^2(t)] < \infty , \\ Z,K \text { are predictable and } E[\int _{0}^T\{ Z^2(t) + \int _{\mathbb {R}}K^2(t,\zeta )\nu (d\zeta )\}dt] < \infty . \end{array}\right. } \end{aligned}$$

Proof

We argue backwards, starting with the interval $[T-\delta ,T]$:

Step 1. In this interval we have $A(t)=E[L|\mathscr {F}_t]$ and hence we know from the theory of classical BSDEs (see e.g. [8, 9] and the references therein), that there exists a unique solution triple $(Y(t),Z(t),K(t,\zeta ))$ such that the following holds:

$$\begin{aligned} {\left\{ \begin{array}{ll} Y \text { is cadlag and } E[\sup _{t \in [T-\delta ,T]} Y^2(t)] < \infty , \nonumber \\ Z,K \text { are predictable and } E[\int _{T-\delta }^T\{ Z^2(t) + \int _{\mathbb {R}}K^2(t,\zeta )\nu (d\zeta )\}dt] < \infty . \end{array}\right. } \end{aligned}$$

Step 2. Next, we continue with the interval $[T-2\delta ,T-\delta ]$. For t in this interval, the value of $Y(t+\delta )$ is known from the previous step and hence $A(t)=E[Y(t+\delta )|\mathscr {F}_t]$ is known. Moreover, by Step1 the terminal value for this interval, $Y(T-\delta )$, is known and in $L^2(P)$. Hence we can again refer to the theory of classical BSDEs and get a unique solution in this interval.

Step n. We continue this iteration until we have reached the interval $[0, T-n\delta ]$, where n is a natural number such that

$$T-(n+1)\delta \le 0 < T-n\delta .$$

Combining the solutions from each of the subintervals, we get a solution for the whole interval. $\square $

5 Applications

In this section we illustrate the results of the previous sections by looking at two examples.

5.1 Optimal Portfolio in an Insider Influenced Market

In the seminal papers by Kyle [4] and Back [2] it is proved that in a financial market consisting of

noise traders (where noise is modeled by Brownian motion),
an insider who knows the value L of the price of the risky asset at the terminal time $t=T$ and
a market maker who at any time t clears the market and sets the market price,

the corresponding equilibrium price process (resulting from the insider’s portfolio which maximizes her expected profit), will be a Brownian bridge terminating at the value L at time $t =T$. In view of this we see that a predictive mean-field equation can be a natural model of the risky asset price in an insider influenced market.

Accordingly, suppose we have a market with the following two investment possibilities:

A risk free asset, with unit price $S_0(t):=1$ for all t
A risky asset with unit price $S(t):=Y(t)$ at time t, given by the predictive mean-field equation
$$\begin{aligned} {\left\{ \begin{array}{ll} dY(t) &{} = -A(t)\mu (t)dt + Z(t) dB(t); \; t \in [0,T) \\ Y(t) &{} = L(\omega ); \quad t \in [T,T+\delta ], \end{array}\right. } \end{aligned}$$
(41)
where $\mu (t)=\mu (t,\omega )$ is a given bounded adapted process and L is a given bounded $\mathscr {F}_T$-measurable random variable, being the terminal state of the process Y at time T.

Let u(t) be a portfolio, representing the number of risky assets held at time t. We assume that $\mathbb {G} = \mathbb {F}$. If we assume that the portfolio is self-financing, the corresponding wealth process $X(t)=X^{u}(t)$ is given by

$$\begin{aligned} {\left\{ \begin{array}{ll} dX(t)= u(t) dY(t) = u(t) A(t)\mu (t) dt + u(t) Z(t) dB(t); \; t \in [0,T)\\ X(0) = x > 0. \end{array}\right. } \end{aligned}$$

(42)

Let $U: [0,\infty ) \mapsto [-\infty ,\infty )$ be a given utility function, assumed to be increasing, concave and $C^{1}$ on $(0,\infty )$. We study the following portfolio optimization problem:

Problem 1

Find $u^* \in \mathscr {A}$ such that

$$\begin{aligned} \sup _{u \in \mathscr {A}} E[U(X^{u}(T))] = E[U(X^{u^*}(T))]. \end{aligned}$$

(43)

This is a problem of the type discussed in the previous sections, with $f=\psi =N=0, \varphi =U$ and $ h(x,\omega )=L(\omega )$, and we can apply the maximum principles from Sect. 3 to study it.

By (6) the Hamiltonian gets the form

$$\begin{aligned} H(t,x,y,a,z,k,u,p,q,r,\lambda )&= ua\mu (t)p+uzq+a\mu (t)\lambda . \end{aligned}$$

(44)

The associated backward-forward system of equations in the adjoint processes $p(t),q(t),\lambda (t)$ becomes

BSDE in p(t), q(t):
$$\begin{aligned} {\left\{ \begin{array}{ll} dp(t) &{} = q(t)dB(t) \; ; \; 0 \le t \le T \\ p(T) &{} = U'(X(T)), \end{array}\right. } \end{aligned}$$
(45)
SDE in $\lambda (t)$:
$$\begin{aligned} {\left\{ \begin{array}{ll} d\lambda (t) = \mu (t-\delta )[u(t-\delta )p(t-\delta )+\lambda (t-\delta )] \chi _{[\delta ,T]}(t) dt \\ \qquad \qquad + u(t)q(t) dB(t)\; ; \; 0 \le t \le T \\ \lambda (0) = 0. \end{array}\right. } \end{aligned}$$
(46)

The Hamiltonian can only have a maximum with respect u if

$$\begin{aligned} A(t)\mu (t)p(t)+Z(t)q(t)=0. \end{aligned}$$

(47)

Substituting this into (45) we get

$$\begin{aligned} {\left\{ \begin{array}{ll} dp(t) &{} = -\theta (t) p(t)dB(t) ; 0 \le t \le T \\ p(T) &{} = U'(X(T)), \end{array}\right. } \end{aligned}$$

(48)

where

$$\begin{aligned} \theta (t):= \frac{A(t)\mu (t)}{Z(t)}. \end{aligned}$$

(49)

From this we get

$$\begin{aligned} p(t)=c \exp (-\int _0^t \theta (s)dB(s) -\frac{1}{2}\int _0^t (\theta (s))^2 ds) ; 0 \le t \le T \end{aligned}$$

(50)

where the constant

$$\begin{aligned} c =p(0)= E[U'(X(T)] \end{aligned}$$

(51)

remains to be determined.

In particular, putting $t=T$ in (50) we get

$$\begin{aligned} U'(X(T))= p(T)=c \exp (-\int _0^T \theta (s)dB(s) -\frac{1}{2}\int _0^T (\theta (s))^2 ds) \end{aligned}$$

(52)

or

$$\begin{aligned} X(T)= (U')^{-1}(c \exp (-\int _0^T\theta (s)dB(s) -\frac{1}{2}\int _0^T (\theta (s))^2 ds)). \end{aligned}$$

(53)

Define

$$\begin{aligned} \Gamma (T)= \exp (\int _0^T \theta (s)dB(s) -\frac{1}{2}\int _0^T (\theta (s))^2 ds). \end{aligned}$$

(54)

Then by the Girsanov theorem the measure Q defined on $\mathscr {F}_T$ by

$$\begin{aligned} dQ(\omega )=\Gamma (T) dP(\omega ) \end{aligned}$$

(55)

is an equivalent martingale measure for the market (41). Therefore, by (53),

$$\begin{aligned} x= E_Q[X(T)] =E[(U')^{-1}(c \exp (-\int _0^T\theta (s)dB(s) -\frac{1}{2}\int _0^T (\theta (s))^2 ds))\Gamma (T)]. \end{aligned}$$

(56)

This equation determines implicitly the value of the constant c and hence by (53) the optimal terminal wealth $X(T)=X^{u^*}(T)$. To find the corresponding optimal portfolio $u^*$ we proceed as follows:

Define

$$\begin{aligned} Z_0(t) := u^*(t)Z(t). \end{aligned}$$

(57)

Then $(X^{u^*}(t),Z_0(t))$ is found by solving the linear BSDE

$$\begin{aligned} {\left\{ \begin{array}{ll} dX^{u^*}(t) = \frac{A(t)\mu (t)Z_0(t)}{Z(t)}dt + Z_0(t) dB(t); \; 0 \le t \le T\\ X^{u^*}(T)=E[(U')^{-1}(c \exp (-\int _0^T\theta (s)dB(s) -\frac{1}{2}\int _0^T (\theta (s))^2 ds))\Gamma (T)]. \end{array}\right. } \end{aligned}$$

(58)

We have proved:

Theorem 4

(Optimal portfolio in an insider influenced market) The optimal portfolio $u^*$ for the problem (43) is given by

$$\begin{aligned} u^*(t) = \frac{Z_0(t)}{Z(t)}, \end{aligned}$$

(59)

where $Z_0(t),Z(t)$ are the solutions of the BSDEs (41), (58), respectively, and c and $\theta $ are given by (56) and (49), respectively.

5.2 Predictive Recursive Utility Maximization

Consider a cash flow $X(t)=X^{c}(t)$ given by

$$\begin{aligned} {\left\{ \begin{array}{ll} dX(t)= X(t)[\mu (t) dt + \sigma (t)dB(t)\\ \qquad \qquad \quad +\int _{R}\gamma (t,\zeta )\tilde{N}(dt,d\zeta )]-c(t) X(t) dt; \; t \in [0,T)\\ X(0) = x > 0. \end{array}\right. } \end{aligned}$$

(60)

Here $\mu (t),\sigma (t),\gamma (t,\zeta )$ are given bounded adapted processes, while $u(t):=c(t)$ is our control, interpreted as our relative consumption rate from the cash flow. We say that c is admissible if c is $\mathbb {F}$-adapted, $c(t) > 0$ and $X^c(t) > 0$ for all $t \in [0,T)$. We put $\mathbb {G} = \mathbb {F}$.

Let $Y(t)=Y^c(t),Z(t)=Z^c(t),K(t,\zeta )=K^c(t,\zeta )$ be the solution of the predictive mean-field BSDE defined by

$$\begin{aligned} {\left\{ \begin{array}{ll} dY(t)=- \{\alpha (t)A(t) + \ln (c(t)X(t))\} dt + Z(t) dB(t) \\ \qquad \qquad \quad +\int _{R}K(t,\zeta )\tilde{N}(dt,d\zeta ); \; t \in [0,T)\\ Y(T) = 0, \end{array}\right. } \end{aligned}$$

(61)

where $\alpha (t) > 0$ is a given bounded $\mathbb {F}$-adapted process. Then, inspired by classical definition of recursive utility in [3], we define $Y^c(0)$ to be the predictive recursive utility of the relative consumption rate c.

We now study the following predictive recursive utility maximization problem:

Problem 2

Find $c^* \in \mathscr {A}$ such that

$$\begin{aligned} \sup _{c \in \mathscr {A}} Y^c(0) = Y^{c*}(0). \end{aligned}$$

(62)

We apply the maximum principle to study this problem. In this case we have $f=\varphi =h=0, \psi (x)=x$, and the Hamiltonian becomes

$$\begin{aligned} H(t,x,y,a,z,k,u,p,q,r,\lambda )&= x [(\mu (t) -c)p+\sigma (t) q+\int _{\mathbb {R}}\gamma (t,\zeta )r(\zeta )\nu (dt,d\zeta )] \nonumber \\&\qquad +[a\alpha (t)+\ln c+\ln x]\lambda . \end{aligned}$$

(63)

The associated backward-forward system of equations in the adjoint processes $p(t),q(t),\lambda (t)$ becomes

BSDE in p(t), q(t):
$$\begin{aligned} {\left\{ \begin{array}{ll} dp(t) &{} = -[(\mu (t)-c(t))p(t)+\sigma (t)q(t) +\int _{\mathbb {R}} \gamma (t,\zeta )\nu (dt,d\zeta )+\frac{\lambda (t)}{X(t)}]dt\\ &{} \quad + q(t)dB(t) +\int _{\mathbb {R}}r(t,\zeta ) \tilde{N}(dt,d\zeta ) \; ; \; 0 \le t \le T \\ p(T) &{} = 0, \end{array}\right. } \end{aligned}$$
(64)
SDE in $\lambda (t)$:
$$\begin{aligned} {\left\{ \begin{array}{ll} d\lambda (t) = \alpha (t-\delta )\lambda (t-\delta )] \chi _{[\delta ,T]}(t) dt\; ; \; 0 \le t \le T \\ \lambda (0) = 1. \end{array}\right. } \end{aligned}$$
(65)

The delay SDE (65) does not contain any unknown parameters, and it is easily seen that it has a unique continuous solution $\lambda (t)>1$, which we may consider known.

We can now proceed along the same lines as in Sect. 5.2 of [1]: Maximizing H with respect to c gives the first order condition

$$\begin{aligned} c(t) = \frac{\lambda (t)}{X(t)p(t)}. \end{aligned}$$

(66)

The solution of the linear BSDE (64) is given by

$$\begin{aligned} \Gamma (t)p(t)= E[\int _t^T \frac{\lambda (s)\Gamma (s)}{X(s)}ds | \mathscr {F}_t], \end{aligned}$$

(67)

where

$$\begin{aligned} {\left\{ \begin{array}{ll} d\Gamma (t) = \Gamma (t^-)[(\mu (t)-c(t))dt +\sigma (t) dB(t) + \int _{\mathbb {R}} \gamma (t,\zeta )\tilde{N}(dt,d\zeta )] \; ; \; 0 \le t \le T \\ \Gamma (0)=1. \end{array}\right. } \end{aligned}$$

(68)

Comparing with (60) we see that

$$\begin{aligned} X(t) = x \Gamma (t) \; ; \; 0 \le t \le T. \end{aligned}$$

(69)

Substituting this into (67) we obtain

$$\begin{aligned} p(t)X(t)= E[\int _t^T \lambda (s) ds | \mathscr {F}_t] \; ; \; 0 \le t \le T. \end{aligned}$$

(70)

Substituting this into (66) we get the following conclusion:

Theorem 5

The optimal relative consumption rate $c^*(t)$ for the predictive recursive utility consumption problem (62) is given by

$$\begin{aligned} c^*(t) = \frac{\lambda (t)}{E[\int _t^T \lambda (s) ds | \mathscr {F}_t]}\; ; \; 0 \le t < T, \end{aligned}$$

(71)

where $\lambda (t)$ is the solution of the delay SDE (65).

References

Agram, N., Øksendal, B.: Infinite horizon optimal control of forward-backward stochastic differential equations with delay. J. Comput. Appl. Math. 259, 336–349 (2014)
Article MathSciNet Google Scholar
Back, K.: Insider trading in continuous time. Rev. Financ. Stud. 5(3), 387–409 (1992)
Article Google Scholar
Duffie, D., Epstein, L.: Stochastic differential utility. Econometrica 60, 353–394 (1992)
Article MathSciNet MATH Google Scholar
Kyle, A.S.: Continuous auctions and insider trading. Econometrica 53(6), 1315–1336 (1985)
Article MATH Google Scholar
Øksendal, B., Sulem, A.: Applied Stochastic Control of Jump Diffusions, 2nd edn. Springer (2007)
Google Scholar
Øksendal, B., Sulem, A.: Risk minimization in financial markets modeled by Itô -Lévy processes. Afrika Matematika (2014). doi:10.1007/s13370-014-02489-9
Google Scholar
Øksendal, B., Sulem, A., Zhang, T.: Optimal control of stochastic delay equations and time-advanced backward stochastic differential equations. Adv. Appl. Probab. 43, 572–596 (2011)
Article Google Scholar
Quenez, M.-C.: Backward stochastic differential equations. In: Encyclopedia of Quantitative Finance, pp. 134–145. Wiley (2010)
Google Scholar
Quenez, M.-C., Sulem, A.: BSDEs with jumps, optimization and applications to dynamic risk measures. Stoch. Proc. Appl. 123, 3328–3357 (2013)
Article MathSciNet MATH Google Scholar

Download references

Acknowledgments

This research was carried out with support of CAS—Centre for Advanced Study, at the Norwegian Academy of Science and Letters, within the research program SEFE.

Author information

Authors and Affiliations

Department of Mathematics, University of Oslo, P.O. Box 1053 Blindern, N-0316, Oslo, Norway
Bernt Øksendal
Norwegian School of Economics, Helleveien 30, N-5045, Bergen, Norway
Bernt Øksendal
INRIA Paris-Rocquencourt, Domaine de Voluceau, Rocquencourt, BP 105, 78153, Le Chesnay Cedex, France
Agnès Sulem
Université Paris-Est - Marne la Vallée, Champs-sur-Marne, France
Agnès Sulem

Authors

Bernt Øksendal
View author publications
You can also search for this author in PubMed Google Scholar
Agnès Sulem
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Bernt Øksendal .

Editor information

Editors and Affiliations

Department of Mathematics, University of Oslo, Oslo, Norway
Fred Espen Benth
Department of Mathematics, University of Oslo, Oslo, Oslo, Norway
Giulia Di Nunno

Rights and permissions

Open Access This chapter is distributed under the terms of the Creative Commons Attribution Noncommercial License, which permits any noncommercial use, distribution, and reproduction in any medium, provided the original author(s) and source are credited.

Reprints and permissions

Copyright information

About this paper

Cite this paper

Øksendal, B., Sulem, A. (2016). Optimal Control of Predictive Mean-Field Equations and Applications to Finance. In: Benth, F., Di Nunno, G. (eds) Stochastics of Environmental and Financial Economics. Springer Proceedings in Mathematics & Statistics, vol 138. Springer, Cham. https://doi.org/10.1007/978-3-319-23425-0_12

Download citation

DOI: https://doi.org/10.1007/978-3-319-23425-0_12
Published: 24 October 2015
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-23424-3
Online ISBN: 978-3-319-23425-0
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)

Publish with us

Policies and ethics

Optimal Control of Predictive Mean-Field Equations and Applications to Finance

Abstract

Similar content being viewed by others

A Maximum Principle for Mean-Field SDEs with Time Change

An optimal control problem for linear SDE of mean-field type with terminal constraint and partial information

Stochastic optimal control of McKean–Vlasov equations with anticipating law

Keywords

MSC (2010):

1 Introduction

2 Formulation of the Problem

3 Solution Methods for the Stochastic Control Problem

3.1 A Sufficient Maximum Principle

Theorem 1

Proof

3.2 A Necessary Maximum Principle

Theorem 2

Proof

4 Existence and Uniqueness of Predictive Mean-Field Equations

Theorem 3

Proof

5 Applications

5.1 Optimal Portfolio in an Insider Influenced Market

Problem 1

Theorem 4

5.2 Predictive Recursive Utility Maximization

Problem 2

Theorem 5

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation