On mean-field stochastic maximum principle for near-optimal controls for Poisson jump diffusion with applications

Hafayed, Mokhtar; Abba, Abdelmadjid; Abbas, Syed

doi:10.1007/s40435-013-0040-y

On mean-field stochastic maximum principle for near-optimal controls for Poisson jump diffusion with applications

Published: 03 December 2013

Volume 2, pages 262–284, (2014)
Cite this article

Download PDF

International Journal of Dynamics and Control Aims and scope Submit manuscript

On mean-field stochastic maximum principle for near-optimal controls for Poisson jump diffusion with applications

Download PDF

Mokhtar Hafayed¹,
Abdelmadjid Abba² &
Syed Abbas³

1336 Accesses
17 Citations
Explore all metrics

Abstract

In this paper, we study mean-field type stochastic control problems for systems described by mean-field stochastic differential equations with jump processes, in which the coefficients contains not only the state process but also its marginal distribution. Moreover, the cost functional is also of mean-field type. We derive necessary as well as sufficient conditions of near-optimality for our model, using Ekeland’s variational principle, spike variation method and some estimates of the state and adjoint processes. Under certain concavity conditions with non-negative derivatives, we prove that the near-maximum condition on the Hamiltonian function in integral form is a sufficient condition for near-optimality. Our result differs from the classical one in the sense that here the adjoint equation has a mean-field type, while the second-order adjoint equation remains the same as in the classical case. As an application, our results are applied to a mean-variance portfolio selection where explicit expression of the near-optimal portfolio selection strategy is obtained in the state feedback form involving both state process and its marginal distribution, via the solutions of Riccati ordinary differential equations.

Mean-Field Maximum Principle for Optimal Control of Forward–Backward Stochastic Systems with Jumps and its Application to Mean-Variance Portfolio Problem

Article 29 May 2015

The Stochastic Maximum Principle for a Jump-Diffusion Mean-Field Model Involving Impulse Controls and Applications in Finance

Article 04 January 2019

A general characterization of the stochastic optimal combined control of mean field stochastic systems with application

Article 12 April 2017

1 Introduction

We consider a stochastic control problem for systems driven by a nonlinear controlled jump diffusion processes of mean-field type, which is also called McKean–Vlasov equations, where the coefficients depend on the state of the solution process as well as of its expected value. More precisely, the system under consideration evolves according to the mean-field jump diffusion process

$$\begin{aligned} \left\{ \begin{array}{l} dx^{u}(t)=f(t,x^{u}(t),\mathbb {E}\left( x^{u}(t)\right) , u(t))dt\\ \quad +\,\sigma (t,x^{u}(t),\mathbb {E}\left( x^{u}(t)\right) , u(t))dW(t)\\ \quad +\int _{\Theta }g\left( t,x^{u}(t^{-}),u(t),\theta \right) N(d\theta ,dt), \\ x^{u}(s)=\zeta , \end{array}\right. \end{aligned}$$

(1)

for some functions $f,\sigma , g$. This mean-field jump diffusion process is obtained as the mean-square limit, when $n\rightarrow +\infty $ of a system of interacting particles of the form

$$\begin{aligned} dx_{n}^{j,u}(t)&= f(t,x_{n}^{j,u}(t),\frac{1}{n}\sum \limits _{i=1}^{n}x_{n}^{i,u}(t),u(t))dt \\&+\,\sigma (t,x_{n}^{j,u}(t),\frac{1}{n}\sum \limits _{i=1}^{n}x_{n}^{i,u}(t),u(t))dW^{j}(t) \\&+\int \limits _{\Theta }g(t,x_{n}^{j,u}(t^{-}),u(t),\theta )N\left( d\theta ,dt\right) , \end{aligned}$$

where $(W^{j}(\cdot ):j\ge 1)$ is a collection of independent Brownian motions. The expected cost to be near-minimized over the class of admissible controls is also of mean-field type, which has the form

$$\begin{aligned} J^{^{s,\zeta }}\left( u(\cdot )\right)&= \mathbb {E}\left[ h\left( x^{u}(T),\mathbb {E} \left( x^{u}(T)\right) \right) \nonumber \right. \\&\left. +\int \limits _{s}^{T}\mathfrak {\ell }(t,x^{u}(t),\mathbb {E}\left( x^{u}(t)\right) ,u(t))dt\right] . \end{aligned}$$

(2)

It worth mentioning that since the cost functional $J^{^{s,\zeta }}$ is possibly a nonlinear function of the expected value stands in contrast to the standard formulation of a control problem. This leads to a so called time-inconsistent control problem where the Bellman dynamic programming does not hold. The reason for this is that one cannot apply the law of iterated expectations on the cost functional. The value function is defined as

$$\begin{aligned} V\left( s,\zeta \right) =\inf _{u(\cdot )\in \mathcal {U}}J^{s,\zeta }\left( u(\cdot )\right) , \end{aligned}$$

where the initial time $s$ and the initial state $\zeta $ of the system are fixed.

It is well-known that near-optimization is as sensible and important as optimization for both theory and applications. Since the recent work by Zhou [1], the concept of near-optimal controls was introduced for a class of stochastic control problems. Various kinds of near-optimal stochastic control problems have been investigated in [2–8]. In Hafayed et al. [2], the authors extended Zhou’s maximum principle of near-optimality [1] to singular stochastic control. The near-optimal stochastic control problem for systems governed by diffusions with jump processes, with application to finance has been investigated by Hafayed et al. [3]. The necessary and sufficient conditions of near-optimal mean-field singular stochastic control have been studied in Hafayed and Abbas [4]. Near-optimality necessary and sufficient conditions for singular control in jump diffusion processes has been investigated in Hafayed and Abbas [5]. The necessary and sufficient conditions for near-optimality for forward-backward stochastic differential equations with some applications have been studied in Huang et al. [6]. The near-optimal control problem for recursive stochastic problem has been studied in Hui et al. [7].

The stochastic optimal control problems for jump processes has been investigated by many authors, see for instance [9–17]. The general case, where the control domain is not necessarily convex and the diffusion coefficient depends explicitly on the control variable, was derived via spike variation method by Tang and Li [9]. These conditions are described in terms of two adjoint processes, which are linear classical backward SDEs. A good account and an extensive list of references on stochastic optimal control for jump processes can be founded in [13, 18].

Mathematical mean-field problems play an important role in different fields of, economics, finance, physicals, chemistry and stochastic game theory. Many authors made contributions on mean-field systems and applications, see for instance [4, 10, 17, 19–25]. The existence and uniqueness result of mean-field backward stochastic differential equations (MF-BSDEs) as limit approach have been investigated in Buckdanh et al. [19]. The maximum principle for SDEs of mean-field type was introduced in [20]. Under some convexity assumptions, the mean-field type sufficient conditions for optimality have been established by Shi [21]. In Mayer-Brandis et al. [25] a stochastic maximum principle of optimality for systems governed by controlled Itô-Levy process of mean-field type was proved by using Malliavin calculus. Various local maximum principles of optimality for mean-field stochastic control problem have been derived in [22, 23].

Our main goal in this paper is to establish necessary as well as sufficient conditions of near-optimality for mean-field jump diffusion processes, in which the coefficients depend on the state of the solution process as well as of its expected value. Moreover, the cost functional is also of mean-field type. The proof of our main result is based on some stability results with respect to the control variable of the state process and adjoint processes, along with Ekeland’s variational principle [26] and spike variation method. This near-optimality necessary and sufficient conditions differs from the classical one in the sense that here the first-order adjoint equation turns out to be a linear mean-field backward stochastic differential equation, while the second-order adjoint equation remains the same as in stochastic maximum principle for jump diffusions developed in Tang and Li [9]. The control domain under consideration is not necessarily convex. It is shown that stochastic optimal control may fail to exist even in simple cases, while near-optimal controls always exist. This justifies the use of near-optimal stochastic controls, which exist under minimal conditions and are sufficient in most practical cases. Moreover, since there are many near-optimal controls, it is possible to select among them appropriate ones that are easier for analysis and implementation. Finally, for the reader’s convenience we give some analysis results used in this paper in the “Appendix”.

The rest of the paper is organized as follows. Section 2 begins with a general formulation of a Mean-field control problem with jump processes and give the notations and assumptions used throughout the paper. In Sects. 3 and 4, we derive necessary and sufficient conditions for near-optimality respectively, which are our main results. An example of this kind of mean-field control problem is also given in the last section.

2 Problem formulation and preliminaries

Throughout this paper, we let $(\Omega ,\mathcal {F},\left( \mathcal {F}_{t}\right) _{t\in \left[ 0,T\right] },\mathbb {P})$ be a fixed filtered probability space equipped with a $\mathbb {P}$—completed right continuous filtration on which a $d$—dimensional Brownian motion $W=\left( W(t)\right) _{t\in \left[ 0,T\right] }$ is defined. Let $\eta $ be a homogeneous $\left( \mathcal {F}_{t}\right) $-Poisson point process independent of $W$. We denote by $\widetilde{N}(d\theta , dt)$ the random counting measure induced by $\eta $, defined on $\Theta \times \mathbb {R}_{+}$, where $\Theta $ is a fixed nonempty subset of $\mathbb {R}^{k}$ with its Borel $\sigma $-field $\mathcal {B}\left( \Theta \right) $. Further, let $ \mu \left( d\theta \right) $ be the local characteristic measure of $\eta $, i.e. $\mu \left( d\theta \right) $ is a $\sigma $-finite measure on $\left( \Theta ,\mathcal {B}\left( \Theta \right) \right) $ with $\mu \left( \Theta \right) <+\infty $. We then define

$$\begin{aligned} N(d\theta , dt)=\widetilde{N}(d\theta , dt)-\mu \left( d\theta \right) dt, \end{aligned}$$

where $N$ is Poisson martingale measure on $\mathcal {B}\left( \Theta \right) \times \mathcal {B}\left( \mathbb {R}_{+}\right) $ with local characteristics $ \mu \left( d\theta \right) dt.$ We assume that $\left( \mathcal {F}_{t}\right) _{t\in \left[ 0,T\right] }$ is $\mathbb {P}$-augmentation of the natural filtration $(\mathcal {F}_{t}^{(W,N)})_{t\in \left[ 0,T\right] }$ defined as follows

$$\begin{aligned}&\mathcal {F}_{t}^{(W,N)}=\sigma \left\{ W(s):0\le s\le t\right\} \\&\vee \sigma \left\{ \int \limits _{0}^{s}\int \limits _{B}N(d\theta ,dr):0\le s\le t,B\in \mathcal {B}\left( \Theta \right) \right\} \vee \mathcal {G}, \end{aligned}$$

where $\mathcal {G}$ denotes the totality of $\mathbb {P}$-null sets, and $ \sigma _{1}\vee \sigma _{2}$ denotes the $\sigma $-field generated by $\sigma _{1}\cup \sigma _{2}$.

Basic Notations

We list some notations that will be used throughout this paper.

1.
Any element $x\in \mathbb {R}^{d}$ will be identified to a column vector with $i^{th}$ component, and the norm $|x|= \sum _{i=1}^{d}|x_{i}|.$
2.
The scalar product of any two vectors $x$ and $y$ on $ \mathbb {R}^{d}$ is denoted by $\left\langle x,y\right\rangle $.
3.
We denote $\mathcal {A}^{*}$ the transpose of any vector or matrix $\mathcal {A}$.
4.
For a set $\mathcal {B}$, we denote by $\mathbf{1}_{\mathcal {B}}$ the indicator function of $\mathcal {B}$ and $\overline{co} \left( \mathcal {B}\right) $ the closure convex hull of $\mathcal {B}$ and $ Sgn(\cdot )$ the sign function.
5.
For a function $\Phi $, we denote by $\Phi _{x}$ (resp. $\Phi _{xx}$) the gradient or Jacobian (resp. the Hessian) of a scalar function $\Phi $ with respect to the variable $x$. We denote $\partial _{x}^{{{}^\circ }}\Phi $ the Clarke’s generalized gradient of $\Phi $ with respect to $x.$
6.
We denote by $\mathbb {L}_{\mathcal {F}}^{2}(\left[ s,T \right] , $ $\mathbb {\mathbb {R}}^{n})$ the Hilbert space of $\mathcal {F}_{t}$-adapted processes $x(\cdot )$ such that $\mathbb {E}\int _{s}^{T}\left| x(t)\right| ^{2}dt<+\infty $.
7.
For convenience, we will use $\Phi _{x}(t)=\dfrac{ \partial \Phi }{\partial x}(t,x(t),\mathbb {E}(x(t)),$ $u(t)),$ $\Phi _{xx}(t)=\frac{\partial ^{2}\Phi }{\partial x^{2}}(t,x(t), \mathbb {E}(x(t)),u(t)).$

Basic Assumptions

Throughout this paper we assume the following.

Assumption (H1)

The functions $f:\left[ s,T \right] \times \mathbb {R}^{n}\times \mathbb {R}^{n}\mathbb {\times } \mathbb {A} \rightarrow \mathbb {R}^{n},\,\sigma :\left[ s,T\right] \times \mathbb {R}^{n}\times \mathbb {R}^{n}\mathbb {\times \mathbb {A}\rightarrow }\mathcal {M}_{n\times d}\left( \mathbb {R}\right) $ and $\ell :\left[ s,T\right] \times \mathbb {R} ^{n}\times \mathbb {R}^{n}\mathbb {\times \mathbb {A}}\rightarrow \mathbb {R}$ are measurable in $(t,x,y,u)$ and twice continuously differentiable in $(x,y),\, g:\left[ s,T\right] \times \mathbb {R}^{n}\mathbb {\times }\mathbb {A\times } \Theta \rightarrow \mathbb {R}^{n\times m}$ is twice continuously differentiable in $x$, and there exists a constant $C>0$ such that, for $ \varphi =f,\sigma ,\ell :$

$$\begin{aligned}&\left| \varphi (t,x,y,u)-\varphi (t,x^{\prime },y^{\prime },u)\right| \nonumber \\&\quad \quad +\left| \varphi _{x}(t,x,y,u)-\varphi _{x}(t,x^{\prime },y^{\prime },u)\right| \nonumber \\&\quad \le C\left[ \left| x-x^{\prime }\right| +\left| y-y^{\prime }\right| \right] .\end{aligned}$$

(3)

$$\begin{aligned}&\left| \varphi (t,x,y,u)\right| \le C\left( 1+\left| x\right| +\left| y\right| \right) . \end{aligned}$$

(4)

$$\begin{aligned}&\sup _{\theta \in \Theta }\left| g\left( t,x,u,\theta \right) -g\left( t,x^{\prime },u,\theta \right) \right| \nonumber \\&\quad \quad +\sup _{\theta \in \Theta }\left| g_{x}\left( t,x,u,\theta \right) -g_{x}\left( t,x^{\prime },u,\theta \right) \right| \nonumber \\&\le C\left| x-x^{\prime }\right| \end{aligned}$$

(5)

$$\begin{aligned}&\sup _{\theta \in \Theta }\left| g\left( t,x,u,\theta \right) \right| \le C\left( 1+\left| x\right| \right) . \end{aligned}$$

(6)

Assumption (H2)

The function $h:\mathbb {R}^{n}\times \mathbb {R}^{n}\mathbb {\rightarrow R}$ is twice continuously differentiable in $(x,y)$, and there exists a constant $C>0$ such that

$$\begin{aligned}&\left| h(x,y)-h(x^{\prime },y^{\prime }))\right| +\left| h_{x}(x,y)-h_{x}(x^{\prime },y^{\prime }))\right| \nonumber \\&\quad \le C\left[ \left| x-x^{\prime }\right| +\left| y-y^{\prime }\right| \right] .\end{aligned}$$

(7)

$$\begin{aligned}&\left| h(x,y)\right| \le C\left( 1+\left| x\right| +\left| y\right| \right) . \end{aligned}$$

(8)

Under the above assumptions, the SDE-(1) has a unique strong solution $x^{u}(t)$ which is given by

$$\begin{aligned} x^{u}(t)&= \zeta +\int \limits _{s}^{t}f\left( r,x^{u}(r),\mathbb {E}(x^{u}(r)),u(r) \right) dr \\&+\int \limits _{s}^{t}\sigma \left( r,x^{u}(r),\mathbb {E}(x^{u}(r)),u(r)\right) dW(r)\\&+\int \limits _{s}^{t}\int \limits _{\Theta }g\left( t,x^{u}(r_{-}),u(r),\theta \right) N\left( d\theta , dr\right) , \end{aligned}$$

and by standard arguments it is easy to show that for any $q>0$, it holds that

$$\begin{aligned} \mathbb {E}\left( \sup _{t\in \left[ s,T\right] }\left| x^{u}(t)\right| ^{q}\right) <C\left( q\right) , \end{aligned}$$

where $C\left( q\right) $ is a constant depending only on $q$ and the functional $J^{s,\zeta }$ is well defined.

We introduce the adjoint equations as follows. The first-order adjoint equation turns out to be a linear mean-field backward SDE, while the second-order adjoint equation remains the same as in Tang and Li [9].

Definition 2.1

(Adjoint equation for mean-field jump diffusion processes) For any $u(\cdot )\in \mathcal {U}$ and the corresponding state trajectory $x(\cdot )$, we define the first-order adjoint process $(\Psi (\cdot ),K(\cdot ),\gamma (\cdot ))$ and the second-order adjoint process $(Q(\cdot ),R(\cdot ),\Gamma (\cdot ))$ as the ones satisfying the following equations:

(1)
First-order adjoint equation: linear Backward SDE of mean-field type with jump processes
$$\begin{aligned} \left\{ \begin{array}{l} -\,d\Psi (t)=\left\{ f_{x}^{*}\left( t,x(t),\mathbb {E}(x(t),u(t)\right) \Psi (t)\right. \\ +\,\mathbb {E}\left[ f_{y}^{*}\left( t,x(t),\mathbb {E}(x(t),u(t)\right) \Psi (t)\right] \\ +\,\sigma _{x}^{*}\left( t,x(t),\mathbb {E}(x(t),u(t)\right) K(t) \\ +\,\mathbb {E}\left[ \sigma _{y}^{*}\left( t,x(t),\mathbb {E} (x(t),u(t)\right) K(t)\right] \\ +\,\ell _{x}\left( t,x(t),\mathbb {E}(x(t),u(t)\right) +\mathbb {E}\left[ \ell _{y}\left( t,x(t),\mathbb {E}(x(t),u(t)\right) \right] \\ +\,\left. \int _{\Theta }g_{x}^{*}\left( t,x(t^{-}),u(t),\theta \right) \gamma _{t}(\theta )\mu (d\theta )\right\} dt\\ -\,K(t)dW(t)-\int _{\Theta }\gamma _{t}(\theta )N(dt,d\theta )\\ \Psi (T)=h_{x}\left( x(T),\mathbb {E}(x(T)\right) +\mathbb {E}\left[ h_{y}\left( x(T),\mathbb {E}(x(T)\right) \right] . \end{array} \right. \!\!\nonumber \\ \end{aligned}$$
(9)
(2)
Second-order adjoint equation: classical linear Backward SDE with jump processes
$$\begin{aligned} \left\{ \begin{array}{l} -\,dQ(t)=\left\{ f_{x}^{*}\left( t,x(t),\mathbb {E}(x(t)),u(t)\right) Q(t)\right. \\ +\,Q_{t}f_{x}^{*}\left( t,x(t),\mathbb {E}(x(t),u(t)\right) \\ +\,\sigma _{x}^{*}\left( t,x(t),\mathbb {E}(x(t)),u(t)\right) Q(t)\sigma _{x}^{*}\left( t,x(t),\mathbb {E}(x(t)),u(t)\right) \\ +\,\sigma _{x}^{*}\left( t,x(t),\mathbb {E}(x(t)),u(t)\right) R(t)\\ +\,R(t)\sigma _{x}(t,x(t),\mathbb {E}(x(t)),u(t))\\ -\,\int _{\Theta }g_{x}^{*}\left( t,x(t^{-}),u(t),\theta \right) \left( \Gamma _{t}(\theta )+Q(t)\right) \\ g_{x}\left( t,x(t^{-}),u(t),\theta \right) \mu (d\theta )-\int _{\Theta }\Gamma _{t}(\theta )g_{x}\left( t,x(t^{-}),u(t),\theta \right) \\ +\,g_{x}^{*}\left( t,x(t^{-}),u(t),\theta \right) \Gamma _{t}(\theta )\mu (d\theta )\\ \left. -\,H_{_{xx}}(t,x(t),E(x(t)),u(t),\Psi (t),K(t),\gamma _{t}(\theta ))\right\} dt\\ -\,R(t)dW(t)-\int _{\Theta }\Gamma _{t}(\theta )N(dt,d\theta ) \\ Q(T)=h_{xx}\left( x(T),\mathbb {E}(x(T))\right) , \end{array} \right. \!\!\nonumber \\ \end{aligned}$$
(10)
As it is well known that under conditions (H1) and (H2) the first-order adjoint equation (7) admits one and only one $ \mathcal {F}_{t}$-adapted solution pair $\left( \Psi (\cdot ),K(\cdot ), \gamma (\cdot )\right) \in $ $\mathbb {L}_{\mathcal {F}}^{2}\left( \left[ s,T\right] ;\mathbb {R}^{n}\right) \times \mathbb {L}_{\mathcal {F} }^{2}\left( \left[ s,T\right] ;\mathbb {R}^{n\times d}\right) \times \mathbb {L }_{\mathcal {F}}^{2}\big (\left[ s,T\right] ;\mathbb {R}^{n\times m}\big ) $. This equation reduces to the standard one, when the coefficients do not explicitly depend on the expected value (or the marginal law) of the underlying diffusion process. Also the second-order adjoint equation (8) admits one and only one $\mathcal {F}_{t}$-adapted solution pair $ \left( Q(\cdot ),R(\cdot ),\Gamma (\cdot )\right) \in \mathbb {L}_{\mathcal {F} }^{2}\left( \left[ s,T\right] ;\mathbb {R}^{n\times n}\right) $ $\times \mathbb {L}_{\mathcal {F}}^{2}(\left[ s,T\right] ;\left( \mathbb {R}^{n\times n}\right) ^{d})$ $\times \mathbb {L}_{\mathcal {F}}^{2}\left( \left[ s,T\right] ;\left( \mathbb {R}^{n\!\times \! n}\right) ^{m}\right) .$ Moreover, since $ f_{x},f_{y},\sigma _{x},\sigma _{y},\ell _{x},\ell _{x}$ and $h_{x}$ are bounded, by $C$ by assumptions (H1) and (H2), we have the following estimate
$$\begin{aligned}&\mathbb {E}\left[ \sup _{s\le t\le T}\left| \Psi (t)\right| ^{2}+\int \limits _{s}^{T}\left| K(t)\right| ^{2}dt\right. \nonumber \\&\quad +\int \limits _{s}^{T}\int \limits _{\Theta }\left| \gamma _{t}(\theta )\right| ^{2}\mu (d\theta )dt+\sup _{s\le t\le T}\left| Q(t)\right| ^{2} \nonumber \\&\quad \left. +\int \limits _{s}^{T}\left| R(t)\right| ^{2}dt+\int \limits _{s}^{T}\int \limits _{\Theta }\left| \Gamma _{t}(\theta )\right| ^{2}\mu (d\theta )dt\right] \le C.\nonumber \\ \end{aligned}$$
(11)

Definition 2.2

(Usual Hamiltonian and $\mathcal {H}$-function). We define the usual Hamiltonian associated with the mean-field stochastic control problem (3)–(4) as follows

$$\begin{aligned}&H\left( t,X,\mathbb {E}\left( X\right) , u,p,q,\varphi \right) \nonumber \\&\quad =-pf\left( t,X,\mathbb {E}\left( X\right) , u\right) -q\sigma \left( t,X, \mathbb {E}\left( X\right) , u\right) \nonumber \\&\qquad -\int \limits _{\Theta }\varphi g\left( t,x(t^{-}),u(t),\theta \right) \mu (d\theta )-\ell \left( t,X,\mathbb {E}\left( X\right) , u\right) , \end{aligned}$$

where $(t,X,u)\in [s,T]\times R^{n}\times \mathbb {A}$ and $X$ is a random variable such that $X\in \mathbb {L}^{1}(\left[ s,T\right] ; \mathbb { R}^{n})$. Furthermore, we define the $\mathcal {H}$-function corresponding to a given admissible pair $\left( z\left( \cdot \right) , v(\cdot )\right) $ as follows

$$\begin{aligned}&\mathcal {H}^{\left( z\left( \cdot \right) , v(\cdot )\right) }(t,x,u) \\&\quad =H\left( t,x,\mathbb {E}\left( x\right) , u,\Psi (t),K(t)\right. \\&\quad -Q(t)\sigma \left( t,z(t),\mathbb {E}\left( z(t)\right) ,v(t)\right) , \\&\quad \left. \, \gamma _{t}(\theta )-\left( Q(t)+\gamma _{t}(\theta )\right) g\left( t,z(t^{-}),v(t),\theta \right) \right) ,\\&\quad -\frac{1}{2}\sigma ^{*}\left( t,x,\mathbb {E}(x),u\right) Q(t)\sigma \left( t,x,\mathbb {E}(x),u\right) \\&\quad -\frac{1}{2}\int \limits _{\Theta }g^{*}\left( t,x,u,\theta \right) \left( Q(t)+ \gamma _{t}(\theta )\right) g\left( t,x,u,\theta \right) \mu (d\theta ). \end{aligned}$$

This shows that

$$\begin{aligned}&\mathcal {H}^{\left( z\left( .\right) , v(\cdot )\right) }(t,x,u)=H\left( t,x, \mathbb {E}\left( x\right) , u,\Psi (t),K(t),\gamma _{t}(\theta )\right) \\&\quad +\,\sigma ^{*}\left( t,x,\mathbb {E}\left( x\right) , u\right) Q(t)\sigma \left( t,z(t),\mathbb {E}\left( z(t)\right) , v(t)\right) \\&\quad -\,\frac{1}{2}\sigma ^{*}\left( t,x,\mathbb {E}(x),u\right) Q(t)\sigma \left( t,x,\mathbb {E}(x),u\right) \\&\quad +\int \limits _{\Theta }g^{*}\left( t,x,u,\theta \right) \left( Q(t)+\gamma _{t}(\theta )\right) \\&\quad \quad \times g\left( t,z(t),v(t),\theta \right) \mu (d\theta ) \\&\quad -\,\frac{1}{2}\int \limits _{\Theta }g^{*}\left( t,x,u,\theta \right) \left( Q(t)+ \gamma _{t}(\theta )\right) g\left( t,x,u,\theta \right) \mu (d\theta ), \end{aligned}$$

where $\Psi (t),K(t),\gamma _{t}(\theta )$ and $Q(t)$ are determined by adjoint equations (9) and (10) corresponding to $\left( z\left( \cdot \right) , v(\cdot )\right) .$

Before concluding this section, let us recall the definition of near-optimal controls as given in Zhou ([1], Definitions (2.1)–(2.2)), and Ekeland’s variational principle, which will be used in the sequel.

Definition 2.3

(Near-optimal control of order $\varepsilon ^{\lambda }.$) For a given $\varepsilon >0$ the admissible control $u^{\varepsilon }(\cdot )$ is near-optimal with respect $\left( s,\zeta \right) $ iff

$$\begin{aligned} \left| J^{s,\zeta }\left( u^{\varepsilon }(\cdot )\right) -V\left( s,\zeta \right) \right| \le \mathcal {O}\left( \varepsilon \right) , \end{aligned}$$

(12)

where $\mathcal {O}\left( \cdot \right) $ is a function of $\varepsilon $ satisfying $\lim _{\varepsilon \rightarrow 0} \mathcal {O}\left( \varepsilon \right) =0.$ The estimator $\mathcal { O}\left( \varepsilon \right) $ is called an error bound.

1.
If $\mathcal {O}\left( \varepsilon \right) =C\varepsilon ^{\lambda }$ for some $\lambda >0$ independent of the constant $C$ then $u^{\varepsilon }(\cdot )$ is called near-optimal control of order $\varepsilon ^{\lambda }$.
2.
If $\mathcal {O}\left( \varepsilon \right) =C\varepsilon , $the admissible control $u^{\varepsilon }(\cdot )$ called $\varepsilon $-optimal.

Lemma 2.1

(Ekeland’s Variational Principle [26]) Let $(F,\,d_{F})$ be a complete metric space and $f:F\rightarrow \overline{\mathbb {R}}$ be a lower semi-continuous function which is bounded from below. For a given $\varepsilon >0,$ suppose that $u^{\varepsilon }\in F$ satisfying $f\left( u^{\varepsilon }\right) \le \inf _{u\in F}f(u)+\varepsilon $. Then for any $\delta >0,$ there exists $u^{\delta }\in F$ such that

1.
$f\left( u^{\delta }\right) \le f\left( u^{\varepsilon }\right) $.
2.
$d_{F}\left( u^{\delta },u^{\varepsilon }\right) \le \delta .$
3.
$f\left( u^{\delta }\right) \le f\left( u\right) +\dfrac{\varepsilon }{\delta }d_{F}\left( u,u^{\delta }\right) ,$ for all $u\in F.$

Now, in order to apply Ekeland’s principle to our Mean-field control problem, we have to endow the set of admissible controls $\mathcal {U}$ with an appropriate metric. We define a distance function $d$ on the space of admissible controls $\mathcal {U}$ such that $\left( \mathcal { U},d\right) $ becomes a complete metric space. For any $u(\cdot )$ and $v(\cdot )\in \mathcal {U}$ we set

$$\begin{aligned}&d\left( u(\cdot ),v(\cdot )\right) \nonumber \\&\quad =\mathbb {P}\otimes dt\left\{ \left( w,t\right) \in \Omega \times \left[ s,T \right] :u\left( w,t\right) \ne v\left( w,t\right) \right\} ,\nonumber \\ \end{aligned}$$

(13)

where $\mathbb {P}\otimes dt$ is the product measure of $\mathbb {P}$ with the Lebesgue measure $dt$ on $[s,T]$. Moreover, it has been shown in the book by Yong and Zhou ([27], 146–147) that

1.
$\left( \mathcal {U},d\right) $ is a complete metric space
2.
The cost function $J^{s,\zeta }$ is continuous from $ \mathcal {U}$ into $\mathbb {R}$.

3 Necessary conditions of near-optimality for mean-field jump diffusion processes

In this section, we obtain a Zhou-type necessary conditions of near-optimality, where the system is described by nonlinear controlled jump diffusion processes of mean-field type. The control domain is not need to be convex. (a general action space). The proof follows the general ideas as in [1, 9].

The following theorem constitutes the main contribution of this paper.

Let $\left( \Psi ^{\varepsilon }(\cdot ),K^{\varepsilon }(\cdot ), \gamma ^{\varepsilon }(\cdot )\right) $ and $\left( Q^{\varepsilon }(\cdot ),R^{\varepsilon }(\cdot ),\Gamma ^{\varepsilon }(\cdot )\right) $ be the solution of adjoint equations (7) and (8) respectively, corresponding to $u^{\varepsilon }(\cdot ).$

Theorem 3.1

(Mean-field stochastic maximum principle for any near-optimal control). For any $\delta \in [0, \frac{1}{3}),$ and any near-optimal control $u^{\varepsilon }(\cdot )$ there exists a positive constant $C=C\left( \delta ,\mu (\Theta )\right) $ such that for each $\varepsilon >0$ it holds that

$$\begin{aligned}&\mathbb {E}\int \limits _{s}^{T}\left\{ \frac{1}{2}[\sigma \left( t,x^{\varepsilon }(t),\mathbb {E}(x^{\varepsilon }(t)),u\right) \right. \nonumber \\&\quad -\,\sigma \left( t,x^{\varepsilon }(t),\mathbb {E}(x^{\varepsilon } (t)),u^{\varepsilon }(t)\right) )]^{*}Q^{\varepsilon }(t)[\sigma \left( t,x^{\varepsilon }(t),\mathbb {E}(x^{\varepsilon }(t)),u\right) \nonumber \\&\quad -\,\sigma \left( t,x^{\varepsilon }(t),\mathbb {E}(x^{\varepsilon }(t)), u^{\varepsilon }(t)\right) ]+\Psi ^{\varepsilon }(t)[f\left( t,x^{\varepsilon }(t),\mathbb {E}(x^{\varepsilon }(t)),u\right) \nonumber \\&\quad -\,f\left( t,x^{\varepsilon }(t),\mathbb {E}(x^{\varepsilon }(t)), u^{\varepsilon }(t)\right) ]+K^{\varepsilon }(t)[\sigma \left( t,x^{\varepsilon }(t),\mathbb {E}(x^{\varepsilon }(t)),u\right) \nonumber \\&\quad -\,\sigma \left( t,x^{\varepsilon }(t),\mathbb {E}(x^{\varepsilon }(t)), u^{\varepsilon }(t)\right) ]+\int \limits _{\Theta }\gamma ^{\varepsilon } (t)g\left( t,x^{\varepsilon }(t),u,\theta \right) \nonumber \\&\quad -\,g\left( t,x^{\varepsilon }(t),u^{\varepsilon }(t),\theta \right) \mu (d\theta )+\frac{1}{2}\int \limits _{\Theta }(g^{*}\left( t,x^{\varepsilon } (t),u,\theta \right) \nonumber \\&\quad -\,g^{*}\left( t,x^{\varepsilon }(t),u^{\varepsilon }(t), \theta \right) ) \left( Q^{\varepsilon }(t)+\gamma _{t}^{\varepsilon }(\theta )\right) \nonumber \\&\quad \times (g\left( t,x^{\varepsilon }(t),u,\theta \right) -g\left( t,x^{\varepsilon } (t),u^{\varepsilon }(t),\theta \right) )\mu (d\theta ), \nonumber \\&\quad +\left. \ell \left( t,x^{\varepsilon }(t),\mathbb {E}(x^{\varepsilon }(t)), u\right) -\ell \left( t,x^{\varepsilon }(t),\mathbb {E} (x^{\varepsilon }(t)),u^{\varepsilon }(t)\right) \right\} dt \nonumber \\&\ge -C\varepsilon ^{\delta }. \end{aligned}$$

(14)

Corollary 3.1

Under the assumptions of Theorem 3.1 it holds that

$$\begin{aligned}&\mathbb {E}\int \limits _{s}^{T}\mathcal {H}^{\left( x^{\varepsilon }(\cdot ),u^{\varepsilon }(\cdot \right) }(t,x^{\varepsilon }(t),\mathbb {E}( x^{\varepsilon }(t)),u^{\varepsilon }(t))dt \nonumber \\&\quad \!\ge \! \sup _{u(\cdot )\!\in \! \mathcal {U}}\mathbb {E}\int \limits _{s}^{T}\mathcal {H} ^{\left( x^{\varepsilon }(\cdot ),u^{\varepsilon }(\cdot )\right) }(t,x^{\varepsilon }(t),\mathbb {E}(x^{\varepsilon }(t)),u(t))dt \!-\!C\varepsilon ^{\delta }.\nonumber \\ \end{aligned}$$

(15)

To prove Theorem 3.1 and Corollary 3.1, we need the following auxiliary results on the stability of the state and adjoint processes with respect to the control variable.

In what follows, $C$ represents a generic constant, which can be different from line to line.

Our first Lemma below deals with the continuity of the state processes under distance $d.$

Lemma 3.1

If $x^{u}(t)$ and $x^{v}(t)$ be the solution of the state equation (1) associated respectively with $u(t)$ and $v(t)$. For any $\alpha \in (0,1)$ and $\beta \ge 0$ satisfying $\alpha \beta <1$, there exists a positive constants $C=C\left( T,\alpha ,\beta ,\mu (\Theta )\right) $ such that

$$\begin{aligned} \mathbb {E}\left( \sup _{s\le t\le T}\left| x^{u}(t)-x^{v}(t)\right| ^{2\beta }\right) \le Cd^{\alpha \beta }\left( u(\cdot ),v(\cdot )\right) . \end{aligned}$$

(16)

Proof

Case 1. First, we assume that $\beta \ge 1$. Using Burkholder–Davis–Gundy inequality for the martingale part and Proposition 6.2 (see “Appendix”) we can compute, for any $r\ge s:$

$$\begin{aligned}&\mathbb {E}\left[ \sup _{s\le t\le r}\left| x^{u}(t)-x^{v}(t)\right| ^{2\beta }\right] \\&\quad \le C\mathbb {E}\int \limits _{s}^{r}\left[ \left| f(t,x^{u}(t),\mathbb {E}(x^{u}(t)),u(t))\right. \right. \\&\qquad -\left. f((x^{v}(t),\mathbb {E}(x^{v}(t)),v(t))\right| ^{2\beta }\\&\qquad +\left| \sigma \left( t,x^{u}(t),\mathbb {E}\left( x^{u}(t)\right) , u(t)\right) -\sigma (x^{v}(t),\mathbb {E}\left( x^{v}(t)\right) , v(t)\right| ^{2\beta } \\&\qquad +\left. \int \limits _{\Theta }\left| g\left( t,x^{u}(t),u,\theta \right) -g\left( t,x^{v}(t),v(t),\theta \right) \right| ^{2\beta }\mu (d\theta )\right] dt\\&\quad \le I_{1}+I_{2}, \end{aligned}$$

where

$$\begin{aligned} I_{1}&\le C\mathbb {E}\int \limits _{s}^{r}\left\{ \left| f\left( x^{u}(t),\mathbb {E}(x^{u}(t)),u(t)\right) \right. \right. \\&-\left. f\left( x^{u}(t),\mathbb {E}(x^{u}(t)),v(t)\right) \right| ^{2\beta }\\&+\,\left| \sigma \left( x^{u}(t),\mathbb {E}(x^{u}(t)),u(t)\right) -\sigma \left( x^{u}(t),\mathbb {E}(x^{u}(t)),v(t)\right) \right| ^{2\beta } \\&+\,\mu (\Theta )\sup _{\theta \in \Theta }\left| g\left( t,x^{u}(t),u(t),\theta \right) \right. \\&\left. -\,\left. g\left( t,x^{v}(t),v(t),\theta \right) \right| ^{2\beta }\right\} \mathbf{1}_{\left\{ u(t)\ne v(t)\right\} }\left( t\right) dt \end{aligned}$$

and

$$\begin{aligned} I_{2}&\le C\mathbb {E}\left( \int \limits _{s}^{r}\left\{ \left| f\left( x^{u}(t), \mathbb {E}\left( x^{u}(t)\right) , v(t)\right) \right. \right. \right. \\&-\left. f\left( x^{v}(t),\mathbb {E}\left( x^{v}(t)\right) , v(t)\right) \right| ^{2\beta }\\&+\int \limits _{s}^{r}\left| \sigma \left( x^{u}(t),\mathbb {E}\left( x^{u}(t)\right) , v(t)\right) \right. \\&-\left. \sigma \left( x^{v}(t),\mathbb {E}(x^{v}(t)),v\left( t\right) \right) \right| ^{2\beta }\\&+\,\mu (\Theta )\left( \sup _{\theta \in \Theta }\left| g\left( t,x^{u}(t),v(t),\theta \right) \right. \right. \\&\left. \left. -\left. g\left( t,x^{v}(t),v(t),\theta \right) \right| \right) ^{2\beta }\right\} \end{aligned}$$

Now arguing as in ([1], Lemma 3.1) taking $b=\frac{1}{ \alpha \beta }>1$ and $a>1$ such that $\frac{1}{a}+\frac{1}{b}=1,$ and applying Cauchy–Schwarz inequality, we get

$$\begin{aligned}&\mathbb {E}\int \limits _{s}^{r}\left| f\left( t,x^{u}(t),\mathbb {E}\left( x^{u}(t)\right) , u(t)\right) \right. \\&\quad \quad -\left. f\left( x^{u,\eta }(t),\mathbb {E}\left( x^{u}(t)\right) ,v(t)\right) \right| ^{2\beta }\mathbf{1}_{\left\{ u(t)\ne v(t)\right\} }\left( t\right) dt \\&\quad \le \left\{ \mathbb {E}\int \limits _{s}^{r}\left| f\left( t,x^{u}(t),\mathbb {E} \left( x^{u}(t)\right) , u(t)\right) \right. \right. \\&\qquad \left. \left. -f\left( t,x^{u}(t),\mathbb {E}\left( x^{u}(t)\right) , v(t)\right) \right| ^{2\beta a}dt\right\} ^{\frac{1}{a}} \\&\qquad \times \left\{ \mathbb {E}\int \limits _{s}^{r}\mathbf{1}_{\left\{ u(t)\ne v(t)\right\} }\left( t\right) dt\right\} ^{\frac{1}{b}}, \end{aligned}$$

by using definition of $d$ and linear growth condition on $f$ with respect to $x$ and $y$, (Assumption 4) we obtain

$$\begin{aligned}&\mathbb {E}\int \limits _{s}^{r}\Big \vert f\left( t,x^{u}(t),\mathbb {E}\left( x^{u}(t)\right) , u(t)\right) \\&\quad \quad -f\left( t,x^{u}(t),\mathbb {E}\left( x^{u}(t)\right) , v(t)\right) \Big \vert ^{2\beta } \mathbf{1}_{\left\{ u(t)\ne v(t)\right\} }\left( t\right) dt \\&\quad \le C\left\{ \mathbb {E}\int \limits _{s}^{r}(1+\left| x^{u}(t)\right| ^{2\beta a}+\left| \mathbb {E}\left( x^{u}(t)\right) \right| ^{2\beta a})dt\right\} ^{\frac{1}{a}} \\&\qquad \times \, d\left( u(\cdot ),v(\cdot )\right) ^{\alpha \beta }\le Cd\left( u(\cdot ),v(\cdot )\right) ^{\alpha \beta }. \end{aligned}$$

Similarly, the same inequality holds if $f$ above is replaced by $\sigma $ and $g$ then we get

$$\begin{aligned}&\mathbb {E}\int \limits _{s}^{r}\Big \vert \sigma \left( t,x^{u}(t),\mathbb {E}\left( x^{u}(t)\right) , u(t)\right) \\&\qquad -\sigma \left( t,x^{u}(t),\mathbb {E}\left( x^{u}(t)\right) ,v(t)\right) \Big \vert ^{2\beta }\mathbf{1}_{\left\{ u(t)\ne v(t)\right\} }\left( t\right) dt\\&\quad \le Cd\left( u(\cdot ),v(\cdot )\right) ^{\alpha \beta }. \end{aligned}$$

and

$$\begin{aligned}&\mathbb {E}\int \limits _{s}^{r}\left( \sup _{\theta \in \Theta }\left| g\left( t,x^{u}(t),u,\theta \right) -g\left( t,x^{v}(t),v(t),\theta \right) \right| \right) ^{2\beta } \\&\quad \times \, \mathbf{1}_{\left\{ u(t)\ne v(t)\right\} }\left( t\right) dt\le Cd\left( u(\cdot ),v(\cdot )\right) ^{\alpha \beta }. \end{aligned}$$

This implied that $I_{1}\le Cd\left( u(\cdot ),v(\cdot )\right) ^{\alpha \beta }.$

Since the coefficients $f,\sigma $ and $g$ are Lipschitz with respected to $x$ and $y$ (assumption (H1)) we conclude that

$$\begin{aligned}&\mathbb {E}\left( \underset{s\le t\le r}{\sup }\left| x^{u}(t)-x^{v}(t)\right| ^{2\beta }\right) \\&\quad \le C\left[ \!\mathbb {E}\int \limits _{s}^{r}\underset{s\le r\le \tau }{\sup }\left| x^{u}(t)-x^{v}(t)\right| ^{2\beta }d\tau +d\left( u(\cdot ),v(\cdot )\right) ^{\alpha \beta }\!\right] \end{aligned}$$

Hence (16) follows immediately from Gronwall’s inequality.

Case 2. Now we assume $0\le \beta <1$. Since $\frac{2}{ \alpha }>1$ then the Cauchy–Schwarz inequality yields

$$\begin{aligned}&\mathbb {E}\left( \underset{s\le t\le T}{\sup }\left| x^{u}(t)\!-\!x^{v}(t)\right| ^{2\beta }\right) \!\le \!\left[ \!\mathbb {E}(\underset{s\le t\le T}{\sup }\left| x^{u}(t)\!-\!x^{v}(t)\right| ^{2})\!\right] ^{\beta }\\&\quad \le \left[ Cd\left( u(\cdot ),v(\cdot )\right) ^{\alpha }\right] ^{\beta }\le Cd\left( u(\cdot ),v(\cdot )\right) ^{\alpha \beta }. \end{aligned}$$

This completes the proof of Lemma 3.1. $\square $

The next result gives the $\beta ^{-th}$ moment continuity of the solutions to adjoint equations with respect to the metric $d.$ This Lemma is an extension of Lemma 3.2 in Zhou [1] to mean-field SDEs with jump processes.

Lemma 3.2

For any $\alpha \in (0,1)$ and $\beta \in (1,2)$ satisfying $\left( 1+\alpha \right) \beta <2,$ there exist a positive constant $C=C\left( \alpha , \beta , \mu (\Theta )\right) $ such that for any $u(\cdot ), v(\cdot )\in \mathcal {U}$, along with the corresponding trajectories $x^{u}(\cdot )$, $x^{v}(\cdot )$ and the solutions $(\Psi ^{u}(\cdot ),K^{u}(\cdot ), \gamma ^{u}(\cdot ),Q^{u}(\cdot ),$ $R^{u}(\cdot ),\Gamma ^{u}(\cdot ))$ and $\left( \Psi ^{v}(\cdot ),K^{v}(\cdot ),\gamma ^{v}(\cdot ), Q^{v}(\cdot ),R^{v}(\cdot ),\Gamma ^{v}(\cdot )\right) $ of the corresponding adjoint equations (9)–(10), it holds that

$$\begin{aligned}&\mathbb {E}\int \limits _{s}^{T}\left( \left| \Psi ^{u}(t)-\Psi ^{v}(t)\right| ^{\beta }+\left| K^{u}(t)-K^{v}(t)\right| ^{\beta }\right) dt \nonumber \\&\quad +\,\mathbb {E}\int \limits _{s}^{T}\int \limits _{\Theta }\left| \gamma _{t}^{u}(\theta )\!-\!\gamma _{t}^{v}(\theta )\right| ^{\beta }\mu (d\theta )dt\!\le \! Cd\left( u(\cdot ),v(\cdot )\right) ^{\frac{\alpha \beta }{2}},\!\!\nonumber \\ \end{aligned}$$

(17)

and

$$\begin{aligned}&\mathbb {E}\int \limits _{s}^{T}(\left| Q^{u}(t)-Q^{v}(t)\right| ^{\beta }+\left| R^{u}(t)-R^{v}(t)\right| ^{\beta })dt \nonumber \\&\quad +\,\mathbb {E}\int \limits _{s}^{T}\int \limits _{\Theta }\left| \Gamma _{t}^{u}(\theta )\!-\!\Gamma _{t}^{v}(\theta )\right| ^{\beta }\mu (d\theta )dt\!\le \! Cd\left( u(\cdot ),v(\cdot )\right) ^{\frac{\alpha \beta }{2}}.\!\!\nonumber \\ \end{aligned}$$

(18)

Proof

Note that $\widetilde{\Psi }(t)=\Psi ^{u}(t)-\Psi ^{v}(t),$ $\widetilde{K}(t)=K^{u}(t)-K^{v}(t)$ and $\widetilde{\gamma }_{t}(\theta )=\gamma _{t}^{u}(\theta )-\gamma _{t}^{v}(\theta )$ satisfied the following Backward SDEs:

$$\begin{aligned} \left\{ \begin{array}{l} -d\widetilde{\Psi }(t)=\left[ f_{x}^{*}\left( t,x^{u}(t),\mathbb {E}( x^{u}(t)),u(t)\right) \widetilde{\Psi }(t)\right. \\ +\sigma _{x}^{*}\left( t,x^{u}(t),\mathbb {E}(x^{u}(t)),u(t)\right) \widetilde{K}(t)\\ \left. +\,\int \nolimits _{\Theta }g_{x}^{*}\left( t,x^{u}(t),u,\theta \right) \widetilde{\gamma }_{t}(\theta )\mu (d\theta )+\mathcal {L}(t)\right] dt\\ -\widetilde{K}(t)dW(t)-\,\int \nolimits _{\Theta }\widetilde{\gamma }_{t}(\theta )N(d\theta , dt)\\ \widetilde{\Psi }(T)=h_{x}\left( x^{u}(T),\mathbb {E}(x^{u}(T))\right) -h_{x}\left( x^{v}(T),\mathbb {E}(x^{v}(T))\right) \!\!\!\!\\ +\,\mathbb {E[}h_{y}\left( x^{u}(T),\mathbb {E}(x^{u}(T))\right) -h_{y}\left( x^{v}(T),\mathbb {E}(x^{v}(T)\right) ], \end{array} \right. \end{aligned}$$

(19)

where the process $\mathcal {L}(t)$ is given by

$$\begin{aligned} \mathcal {L}(t)&= [f_{x}^{*}\left( t,x^{u}(t),\mathbb {E}( x^{u}(t)),u(t)\right) \nonumber \\&-\,f_{x}^{*}\left( t,x^{v}(t),\mathbb {E}(x^{v}(t)),v(t)\right) ]\Psi ^{v}(t)\nonumber \\&+\,[\sigma _{x}^{*}\left( t,x^{u}(t),\mathbb {E}(x^{u}(t)),u(t)\right) \nonumber \\&-\,\sigma _{x}^{*}\left( t,x^{v}(t),\mathbb {E}(x^{v}(t)),v(t)\right) ]K^{v}(t)\nonumber \\&+\,[\ell _{x}\left( t,x^{u}(t),\mathbb {E}(x^{u}(t)),u(t)\right) \nonumber \\&-\,\ell _{x}\left( t,x^{v}(t),\mathbb {E}(x^{v}(t)),v(t)\right) ]\nonumber \\&+\,\mathbb {E}[f_{y}^{*}\left( t,x^{u}(t),\mathbb {E}(x^{u}(t)),u(t)\right) \Psi ^{u}(t)\nonumber \\&-\,f_{y}^{*}\left( t,x^{v}(t),\mathbb {E}(x^{v}(t)),v(t)\right) \Psi ^{v}(t)]\nonumber \\&+\,\mathbb {E}[\sigma _{y}^{*}\left( t,x^{u}(t),\mathbb {E}(x^{u}(t)),u(t)\right) K^{u}(t)\nonumber \\&-\,\sigma _{y}^{*}\left( t,x^{v}(t),\mathbb {E}(x^{v}(t)),v(t)\right) K^{v}(t)]\nonumber \\&+\,\mathbb {E}[\ell _{y}\left( t,x^{u}(t),\mathbb {E}(x^{u}(t)),u(t)\right) \nonumber \\&-\,\ell _{y}\left( t,x^{v}(t),\mathbb {E}(x^{v}(t)),v(t)\right) ]\nonumber \\&+\,\int \limits _{\Theta }(g_{x}^{*}\left( t,x^{u}(t_{-}),u,\theta \right) \nonumber \\&-\,g_{x}^{*}\left( t,x^{v}(t_{-}),v,\theta \right) )\gamma _{t}^{v}(\theta )\mu (d\theta ). \end{aligned}$$

(20)

Let $\phi (\cdot )$ be the solution of the following linear SDE

$$\begin{aligned}&\left\{ \begin{array}{l} d\phi (t)=\Big [f_{x}\left( t,x^{u}(t),\mathbb {E}(x^{u}(t)),u(t)\right) \phi (t)\\ +\,\left| \widetilde{\Psi }(t)\right| ^{\beta -1}Sgn(\widetilde{\Psi }(t))\Big ]dt \\ +\,\left[ \sigma _{x}\left( t,x^{u}(t),\mathbb {E}(x^{u}(t)),u(t)\right) \phi (t)\right. \\ +\,\left. \left| \widetilde{K}(t)\right| ^{\beta -1}Sgn(\widetilde{K}(t))\right] dW(t) \\ +\,\Big [\int _{\Theta }g_{x}^{*}\left( t,x^{u}(t_{-}),u,\theta \right) \phi (t)\\ +\,\left| \widetilde{\gamma }_{t}(\theta )\right| ^{\beta -1}Sgn(\widetilde{\gamma }_{t}(\theta ))\Big ]N(d\theta , dt), \phi (s)=0, \end{array} \right. \end{aligned}$$

(21)

where $Sgn\left( a\right) \equiv (Sgn(a_{1}),Sgn(a_{2}),\ldots ,Sgn(a_{n}))^{*}$ for any vector $ a=(a_{1},a_{2},\ldots ,a_{n})^{*}.$

It is worth mentioning that since $f_{x}$ $\sigma _{x}$ and $g_{x}$ are bounded and the fact that

$$\begin{aligned}&\mathbb {E}\int \limits _{s}^{T}\!\left\{ \left| \left| \widetilde{\Psi } (t)\right| ^{\beta -1}Sgn(\widetilde{\Psi }(t))\right| ^{2}\right. \left. \!\!+\!\left| \left| \widetilde{K}(t)\right| ^{\beta -1}Sgn(\widetilde{K}(t))\right| ^{2}\right\} dt \!\!\!\nonumber \\&\quad +\,\mathbb {E}\int \limits _{s}^{T}\int \limits _{\Theta }\left| \left| \widetilde{ \gamma }_{t}(\theta )\right| ^{\beta -1}Sgn(\widetilde{ \gamma }_{t}(\theta ))\right| ^{2}\mu (d\theta )dt<\infty , \end{aligned}$$

(22)

then the SDE (21) has a unique strong solution.

Let $\eta \ge 2$ such that $\frac{1}{\eta }+\frac{1}{\beta }=1,$ $ \beta \in \left( 1,2\right) $ then we get

$$\begin{aligned}&\mathbb {E}\left( \sup _{s\le t\le T}\left| \phi (t)\right| ^{\eta }\right) \!\le \! C\mathbb {E}\int \limits _{s}^{T}\left\{ \left| \widetilde{\Psi }(t)\right| ^{\beta \eta -\eta }+\left| \widetilde{K}(t)\right| ^{\beta \eta -\eta }\right\} dt \nonumber \\&\qquad +\,\mathbb {E}\int \limits _{s}^{T}\int \limits _{\Theta }\left| \widetilde{\gamma }_{t}(\theta )\right| ^{\beta \eta -\eta }\mu (d\theta )dt \nonumber \\&\quad \le C\mathbb {E}\int \limits _{s}^{T}\left\{ \left| \widetilde{\Psi } (t)\right| ^{\beta }+\left| \widetilde{K}(t)\right| ^{\beta }+\int \limits _{\Theta }\left| \widetilde{\gamma }_{t}(\theta )\right| ^{\beta }\mu (d\theta )\right\} dt. \end{aligned}$$

Note that the right hand side term of the above inequality is bounded due to (9), then we get

$$\begin{aligned} \mathbb {E}\Bigg (\sup _{s\le t\le T}\left| \phi (t)\right| ^{\eta }\Bigg )<\infty . \end{aligned}$$

(23)

By applying Itô’s formula for jump processes (see Appendix Lemma 6.1) to $\widetilde{\Psi }(t)\phi (t)$ on $\left[ s,T\right] $ and taking expectation, we get

$$\begin{aligned}&\mathbb {E}\int \limits _{s}^{T}\left\{ \widetilde{\Psi }(t)\left| \widetilde{\Psi }(t)\right| ^{\beta -1}Sgn(\widetilde{\Psi }(t))\right. \nonumber \\&\qquad +\widetilde{K}(t)\left| \widetilde{K}(t)\right| ^{\beta -1}Sgn(\widetilde{K}(t))\nonumber \\&\qquad \left. +\int \limits _{\Theta }\widetilde{\gamma }_{t}(\theta )\left| \widetilde{\gamma }_{t}(\theta )\right| ^{\beta -1}Sgn(\widetilde{\gamma }_{t}(\theta ))\mu (d\theta )\right\} dt\nonumber \\&\quad =\mathbb {E}\left\{ \int \limits _{s}^{T}\mathcal {L}(t)\phi (t)dt+\widetilde{\Psi } (T)\phi (T)\right\} \nonumber \\&\quad =\mathbb {E}\int \limits _{s}^{T}\mathcal {L}(t)\phi (t)dt+\mathbb {E}\{(h_{x}\left( x^{u}(T),\mathbb {E}(x^{u}(T))\right) \nonumber \\&\qquad -\,h_{x}\left( x^{v}(T),\mathbb {E}(x^{v}(T))\right) )\phi (T)\}\!+\!\mathbb {E}[ h_{y}\left( x^{u}(T),\mathbb {E}(x^{u}(T))\right) \nonumber \\&\qquad -\,h_{y}\left( x^{v}(T),\mathbb {E}(x^{v}(T)\right) ]\mathbb {E}\left( \phi (T)\right) . \end{aligned}$$

Since

$$\begin{aligned}&\mathbb {E}\int \limits _{s}^{T}\left\{ \widetilde{\Psi }(t)\left| \widetilde{\Psi }(t)\right| ^{\beta -1}Sgn(\widetilde{\Psi }(t))\right. \!+\!\widetilde{K} (t)\left| \widetilde{K}(t)\right| ^{\beta -1}Sgn(\widetilde{K} (t))\\&\qquad +\left. \int \limits _{\Theta }\widetilde{\gamma }_{t}(\theta )\left| \widetilde{\gamma }_{t}(\theta )\right| ^{\beta -1}Sgn(\widetilde{\gamma }_{t}(\theta ))\mu (d\theta )\right\} dt\\&\quad =\mathbb {E}\int \limits _{s}^{T}\left[ \left| \widetilde{\Psi }(t)\right| ^{\beta }+\left| \widetilde{K}(t)\right| ^{\beta }\right. \left. +\int \limits _{\Theta }\left| \widetilde{\gamma }_{t}(\theta )\right| ^{\beta }\mu (d\theta )\right] dt, \end{aligned}$$

and fact that

$$\begin{aligned}&\mathbb {E}\left\{ \int \limits _{s}^{T}\mathcal {L}(t)\phi (t)dt\right. \\&\qquad +\left[ \left( h_{x}\left( x^{u}(T),\mathbb {E}(x^{u}(T))\right) -h_{x}\left( x^{v}(T), \mathbb {E}(x^{v}(T))\right) \right) \right. \\&\qquad +\,\left. \mathbb {E}\left( h_{y}\left( x^{u}(T),\mathbb {E}(x^{u}(T))\right) -h_{y}\left( x^{v}(T),\mathbb {E}(x^{v}(T)\right) \right) \right] \\&\qquad \times \, \left. \left( \phi (T)\right) \right\} \\&\quad \le \left[ \mathbb {E}\int \limits _{s}^{T}\left| \mathcal {L}(t)\right| ^{\beta }dt\right] ^{\frac{1}{\beta }}\left[ \mathbb {E}\int \limits _{s}^{T}\left| \phi (t)\right| ^{\eta }dt\right] ^{\frac{1}{\eta }}\\&\qquad +\left[ \mathbb {E}\left| \left( h_{x}\left( x^{u}(T),\mathbb {E}( x^{u}(T))\right) -h_{x}\left( x^{v}(T),\mathbb {E}(x^{v}(T)\right) \right) \right. \right. \\&\qquad +\!\left. \left. \mathbb {E}\left( h_{y}\left( x^{u}(T),\mathbb {E}(x^{u}(T))\right) \!-\!h_{y}\left( x^{v}(T),\mathbb {E}(x^{v}(T)\right) \right) ]\right| ^{\eta } \right] ^{\frac{1}{\beta }} \\&\qquad \times \left[ \mathbb {E}\left| \phi (T)\right| ^{\eta }\right] ^{\frac{1}{\eta }}, \end{aligned}$$

then according to (23) we deduce

$$\begin{aligned}&\mathbb {E}\int \limits _{s}^{T}\left[ \left| \widetilde{\Psi }(t)\right| ^{\beta }+\left| \widetilde{K}(t)\right| ^{\beta }+\int \limits _{\Theta }\left| \widetilde{\gamma }_{t}(\theta )\right| ^{\beta }\mu (d\theta )\right] dt\nonumber \\&\quad \le C\mathbb {E}\int \limits _{s}^{T}\left| \mathcal {L}(t)\right| ^{\beta }dt\nonumber \\&\qquad +\,C\mathbb {E}\Big \{\left| h_{x}(x^{u}(T),\mathbb {E}(x^{u}(T)))-h_{x}(x^{v}(T), \mathbb {E}(x^{v}(T)))\right| ^{\beta }\nonumber \\&\qquad +\,\left| \mathbb {E}(h_{y}(x^{u}(T),\mathbb {E}(x^{u}(T))))\!-\!\mathbb {E}( h_{y}(x^{v}(T),\mathbb {E}(x^{v}(T))))\right| ^{\beta }\Big \}.\!\!\!\nonumber \\ \end{aligned}$$

(24)

We proceed to estimate the right hand side of (24). First noting that $\frac{\alpha \beta }{2}<1-\frac{\beta }{2}<1$ then by using assumption (H2) and Lemma 3.1, we obtain

$$\begin{aligned}&\mathbb {E}\left| h_{x}(x^{u}(T),\mathbb {E}(x^{u}(T)))-h_{x}(x^{v}(T), \mathbb {E}(x^{v}(T)))\right| ^{\beta }\nonumber \\&\quad \le C\mathbb {E}\left| x^{u}(T)-x^{v}(T)\right| ^{\beta }\le Cd(u(\cdot ),v(\cdot ))^{\frac{\alpha \beta }{2}}.\nonumber \\&\mathbb {E}\left| \mathbb {E}(h_{y}(x^{u}(T),\mathbb {E}(x^{u}(T))))\right. -\left. \mathbb {E}(h_{y}(x^{v}(T),\mathbb {E}(x^{v}(T))))\right| ^{\beta }\nonumber \\&\quad \le Cd(u(\cdot ),v(\cdot ))^{\frac{\alpha \beta }{2}}. \end{aligned}$$

(25)

Now, to prove inequality (17) it sufficient to estimate $\mathbb {E} \int _{s}^{T}\left| \mathcal {L}(t)\right| ^{\beta }dt.$ By repeatedly using Cauchy–Schwarz inequality and assumption (H2) we can estimate

$$\begin{aligned}&\mathbb {E}\int \limits _{s}^{T}\left| f_{x}^{*}\left( t,x^{u}(t),\mathbb {E}( x^{u}(t)),u(t)\right) \right. \\&\qquad -\left. f_{x}^{*}\left( t,x^{v}(t),\mathbb {E}(x^{v}(t)),v(t)\right) \right| ^{\beta }\left| \Psi ^{v}(t)\right| ^{\beta }dt\\&\quad \le C\mathbb {E}\int \limits _{s}^{T}\bigg \{\left| f_{x}^{*}\left( t,x^{u}(t), \mathbb {E}(x^{u}(t)),u(t)\right) \right. \\&\qquad -\left. f_{x}^{*}\left( t,x^{u}(t),\mathbb {E}(x^{u}(t)),v(t)\right) \right| ^{\beta } \left| \Psi ^{v}(t)\right| ^{\beta }\\&\qquad +\left| f_{x}^{*}\left( t,x^{u}(t),\mathbb {E}(x^{u}(t)),v(t)\right) \right. \\&\qquad \left. -f_{x}^{*}\left( t,x^{v}(t),\mathbb {E}(x^{v}(t)),v(t)\right) \right| ^{\beta }\left| \Psi ^{v}(t)\right| ^{\beta }\bigg \}dt\\&\quad \le C\mathbb {E}\int \limits _{s}^{T}\left\{ \mathbf{1}_{\left\{ u(t)\ne v(t)\right\} }(t)\left| \Psi ^{v}(t)\right| ^{\beta }\right. \\&\qquad +\left. \left[ \left| x^{u}(t)\!-\!x^{v}(t)\right| \!+\!\left| \mathbb {E}(x^{u}(t))\!-\!\mathbb {E}(x^{v}(t))\right| \right] ^{\beta }\left| \Psi ^{v}(t)\right| ^{\beta }\right\} dt\\&\quad \le C\left[ \mathbb {E}\int \limits _{s}^{T}\left| \Psi ^{v}(t)\right| ^{2}dt \right] ^{\frac{\beta }{2}}d(u(.),v(.))^{\frac{2-\beta }{2}}\\&\qquad +\,C\left[ \mathbb {E}\int \limits _{s}^{T}\left| \Psi ^{v}(t)\right| ^{2}dt \right] ^{\frac{\beta }{2}} \left[ \mathbb {E}\int \limits _{s}^{T}\left| x^{u}(t)x^{v}(t)\right| ^{\frac{2\beta }{2-\beta }}dt \right] ^{\frac{2-\beta }{2}}. \end{aligned}$$

By using the fact that $d(u(\cdot ),v(\cdot ))\le 1$ and $\frac{\alpha \beta }{2}<1-\frac{\beta }{2},$ the first term of the right side of the above inequality is dominated by $d(u(\cdot ),v(\cdot ))^{\frac{\alpha \beta }{2}}.$ Since $\frac{\alpha \beta }{2-\beta }<1$ and we have from Lemma 3.1 that

$$\begin{aligned} \mathbb {E}\int \limits _{s}^{T}\left| x^{u}(t)-x^{v}(t)\right| ^{\frac{2\beta }{2-\beta }}dt\le d(u(\cdot ),v(\cdot ))^{\frac{\alpha \beta }{2-\beta }}, \end{aligned}$$

then we have

$$\begin{aligned}&C\left[ \mathbb {E}\int \limits _{s}^{T}\left| \Psi ^{v}(t)\right| ^{2}dt \right] ^{\frac{\beta }{2}}d(u(\cdot ),v(\cdot ))^{\frac{2-\beta }{2}}\\&\qquad +\left[ \mathbb {E}\int \limits _{s}^{T}\left| \Psi ^{v}(t)\right| ^{2}dt \right] ^{\frac{\beta }{2}}\left[ \mathbb {E}\int \limits _{s}^{T}\left| x^{u}(t)-x^{v}(t)\right| ^{\frac{2\beta }{2-\beta }}dt\right] ^{\frac{ 2-\beta }{2}} \\&\quad \le Cd\left( u(\cdot ),v(\cdot )\right) ^{\frac{\alpha \beta }{2}}, \end{aligned}$$

we conclude that

$$\begin{aligned}&\mathbb {E}\int \limits _{s}^{T}\left| f_{x}^{*}\left( t,x^{u}(t),\mathbb {E}( x^{u}(t)),u(t)\right) \right. \nonumber \\&\qquad -\left. f_{x}^{*}\left( x^{v}(t),\mathbb {E}(x^{v}(t)),v(t)\right) \right| ^{\beta }\left| \Psi ^{v}(t)\right| ^{\beta }dt\nonumber \\&\quad \le Cd\left( u(\cdot ),v(\cdot )\right) ^{\frac{\alpha \beta }{2}}. \end{aligned}$$

(26)

A similar argument shows that

$$\begin{aligned}&\mathbb {E}\int \limits _{s}^{T}\left| \sigma _{x}\left( t,x^{u}(t),\mathbb {E}( x^{u}(t)),u(t)\right) \right. \nonumber \\&\qquad -\left. \sigma _{x}\left( t,x^{v}(t),\mathbb {E}(x^{v}(t)),v(t)\right) \right| ^{\beta }\left| K^{v}(t)\right| ^{\beta }dt\nonumber \\&\quad \le Cd\left( u(\cdot ),v(\cdot )\right) ^{\frac{\alpha \beta }{2}}, \end{aligned}$$

(27)

and

$$\begin{aligned}&\mathbb {E}\int \limits _{s}^{T}\left| \ell _{x}\left( t,x^{u}(t),\mathbb {E}( x^{u}(t)),u(t)\right) \right. \nonumber \\&\quad -\left. \ell _{x}\left( t,x^{v}(t),\mathbb {E}(x^{v,\xi }(t)),v(t)\right) \right| ^{\beta }dt\le Cd\left( u(\cdot ),v(\cdot )\right) ^{\frac{ \alpha \beta }{2}}.\!\!\nonumber \\ \end{aligned}$$

(28)

Now, by using similar arguments developed above and (9) we get

$$\begin{aligned}&\mathbb {E}\int \limits _{s}^{T}\left| \mathbb {E}\left\{ [f_{y}^{*}\left( t,x^{u}(t),\mathbb {E}(x^{u}(t)),u(t)\right) \right. \right. \nonumber \\&\qquad - f_{y}^{*}\left( x^{v}(t),\mathbb {E}(x^{v}(t)),v(t)\right) ]\left. \left. \Psi ^{v}(t)\right\} \right| ^{\beta }dt \nonumber \\&\quad \le C\mathbb {E}\int \limits _{s}^{T}\mathbb {E}f_{y}^{*}\left| \left( t,x^{u}(t),\mathbb {E}(x^{u}(t)),u(t)\right) \right. \nonumber \\&\qquad -\left. f_{y}^{*}\left( x^{v}(t),\mathbb {E}(x^{v}(t)),v(t)\right) \right| ^{\beta }\mathbb {E}\left[ \left| \Psi ^{v}(t)\right| \right] ^{\beta }dt \nonumber \\&\quad \le C\mathbb {E}\int \limits _{s}^{T}\mathbb {E}\left| f_{y}^{*}\left( t,x^{u}(t),\mathbb {E}(x^{u}(t)),u(t)\right) \right. \nonumber \\&\qquad -\left. f_{y}^{*}\left( x^{v}(t),\mathbb {E}(x^{v}(t)),v(t)\right) \right| ^{\beta }dt \nonumber \\&\quad \le Cd\left( u(\cdot ),v(\cdot )\right) ^{\frac{\alpha \beta }{2}}. \end{aligned}$$

(29)

A similar argument shows that

$$\begin{aligned}&\mathbb {E}\int \limits _{s}^{T}\left| \mathbb {E}\{[\sigma _{y}^{*}\left( t,x^{u}(t), \mathbb {E}(x^{u}(t)),u(t)\right) \right. \nonumber \\&\qquad -\left. \sigma _{y}^{*}\left( x^{v}(t),\mathbb {E}(x^{v}(t)),v(t)\right) ]\Psi ^{v}(t)\}\right| ^{\beta }dt \nonumber \\&\quad \le Cd\left( u(\cdot ),v(\cdot )\right) ^{\frac{\alpha \beta }{2}}, \end{aligned}$$

(30)

$$\begin{aligned}&\mathbb {E}\int \limits _{s}^{T}\left| \mathbb {E\{}[f_{y}^{*}\left( t,x^{u}(t), \mathbb {E}(x^{u}(t)),u(t)\right) \right. \nonumber \\&\qquad -\left. f_{y}^{*}\left( x^{v}(t),\mathbb {E}(x^{v}(t)),v(t)\right) ]\Psi ^{v}(t)\}\right| ^{\beta }dt \nonumber \\&\quad \le Cd\left( u(\cdot ),v(\cdot )\right) ^{\frac{\alpha \beta }{2}}, \end{aligned}$$

(31)

and

$$\begin{aligned}&\mathbb {E}\int \limits _{s}^{T}\left| \mathbb {E}[\ell _{y}\left( t,x^{u}(t), \mathbb {E}(x^{u}(t)),u(t)\right) \right. \nonumber \\&\quad -\left. \ell _{y}\left( x^{v}(t),\mathbb {E}(x^{v}(t)),v(t)\right) ]\right| ^{\beta }dt\le Cd\left( u(\cdot ),v(\cdot )\right) ^{\frac{ \alpha \beta }{2}}.\nonumber \\ \end{aligned}$$

(32)

Next, by applying Cauchy–Schwarz inequality, we get

$$\begin{aligned}&\mathbb {E}\int \limits _{s}^{T}\left| \int \limits _{\Theta }(g_{x}^{*}\left( t,x^{u}(t),u(t),\theta \right) \right. \\&\qquad -\left. g_{x}^{*}\left( t,x^{v}(t),v(t),\theta \right) )\gamma _{t}^{v}(\theta )\mu (d\theta )\right| ^{\beta }dt \\&\quad =\mathbb {E}\int \limits _{s}^{T}\left| \int \limits _{\Theta }(g_{x}^{*}\left( t,x^{u}(t),u(t),\theta \right) \right. \\&\qquad -\left. g_{x}^{*}\left( t,x^{u}(t_{-}),v(t),\theta \right) ) \gamma _{t}^{v}(\theta )\mu (d\theta )\right| ^{\beta }dt \\&\qquad +\,\mathbb {E}\int \limits _{s}^{T}\left| \int \limits _{\Theta }(g_{x}^{*}\left( t,x^{u}(t),v(t),\theta \right) \right. \\&\qquad -\left. g_{x}^{*}\left( t,x^{v}(t_{-}),v(t),\theta \right) ) \gamma _{t}^{v}(\theta )\mu (d\theta )\right| ^{\beta }dt \\&\quad \le \mathbb {I}_{1}+\mathbb {I}_{2}, \end{aligned}$$

where

$$\begin{aligned} \mathbb {I}_{1}&= \mathbb {E}\int \limits _{s}^{T}\left| \int \limits _{\Theta }(g_{x}^{*}\left( t,x^{u}(t_{-}),u(t),\theta \right) \right. \\&-\left. g_{x}^{*}\left( t,x^{u}(t_{-}),v(t),\theta \right) ) \gamma _{t}^{v}(\theta )\mu (d\theta )\right| ^{\beta }\mathbf{1}_{\left\{ u(t)\ne v(t)\right\} }(t)dt, \end{aligned}$$

and

$$\begin{aligned} \mathbb {I}_{2}&= \mathbb {E}\int \limits _{s}^{T}(\sup _{\theta \in \Theta }\left| (g_{x}^{*}\left( t,x^{u}(t),u(t),\theta \right) \right. \\&\left. -g_{x}^{*}\left( t,x^{u}(t),v(t),\theta \right) )\right| )^{\beta }\left( \left| \int \limits _{\Theta }\gamma _{t}^{v}(\theta )\mu (d\theta )\right| ^{\beta }\right) dt. \end{aligned}$$

By using the fact that $g_{x}$ is bounded, $d(u(\cdot ),v(\cdot ))\le 1$ and $\frac{\alpha \beta }{2}<1-\frac{\beta }{2}$, then due to (11) we get

$$\begin{aligned}&\mathbb {I}_{1}\le C\mathbb {E}\left\{ \int \limits _{s}^{T}\int _{\Theta }\left| \gamma _{t}^{v}(\theta )\right| ^{2}\mu (d\theta )\right\} ^{ \frac{\beta }{2}} \nonumber \\&\qquad \times \left\{ \int \limits _{s}^{T}\mathbf{1}_{\left\{ u(t)\ne v(t)\right\} }(t)dt\right\} ^{1-\frac{\beta }{2}} \nonumber \\&\quad \le C\mathbb {E}\left\{ \int \limits _{s}^{T}\int \limits _{\Theta }\left| \gamma _{t}^{v}(\theta )\right| ^{2}\mu (d\theta )\right\} ^{\frac{\beta }{2} }d\left( u(\cdot ),v(\cdot )\right) ^{1-\frac{\beta }{2}} \nonumber \\&\quad \le Cd\left( u(\cdot ),v(\cdot )\right) ^{\frac{\alpha \beta }{2}}. \end{aligned}$$

(33)

Further, since $\frac{\alpha \beta }{2-\beta }<1$ we conclude from Lemma 3.1 and (11) that

$$\begin{aligned}&\mathbb {I}_{2}\le C\mathbb {E}\left( \int \limits _{s}^{T}\left| x^{u}(t)-x^{v}(t)\right| ^{\frac{2\beta }{2-\beta }}dt\right) ^{1-\frac{\beta }{ 2}} \nonumber \\&\quad \mathbb {E}\left( \int \limits _{s}^{T}\left| \int \limits _{\Theta }\gamma _{t}^{v}(\theta )\mu (d\theta )\right| ^{2}dt\right) ^{\frac{\beta }{2}}\le Cd\left( u(\cdot ),v(\cdot )\right) ^{\frac{\alpha \beta }{2}},\nonumber \\ \end{aligned}$$

(34)

It follows from (33) and (34) that

$$\begin{aligned}&\mathbb {E}\int \limits _{s}^{T}\left| \int \limits _{\Theta }(g_{x}^{*}\left( t,x^{u}(t),u(t),\theta \right) -g_{x}^{*}\left( t,x^{v}(t),v(t),\theta \right) )\right. \nonumber \\&\quad \times \left. \gamma _{t}^{v}(\theta )\mu (d\theta )\right| ^{\beta }dt\le Cd\left( u(\cdot ),v(\cdot )\right) ^{\frac{\alpha \beta }{2}}. \end{aligned}$$

(35)

We conclude from (26)–(35) that

$$\begin{aligned} \mathbb {E}\int \limits _{s}^{T}\left| \mathcal {L}(t)\right| ^{\beta }dt\le Cd\left( u(\cdot ),v(\cdot )\right) ^{\frac{\alpha \beta }{2}}. \end{aligned}$$

(36)

Finally, combining (24)–(25) and (36), the proof of (17) is complete. Similarly one can prove (19). This completes the proof of Lemma 3.2. $\square $

Now, let $(\overline{\Psi }^{\varepsilon }(\cdot ),\overline{K} ^{\varepsilon }(\cdot ),\overline{\gamma }^{\varepsilon }(\cdot ))$ and $(\overline{Q}^{\varepsilon }(\cdot ),\overline{R}^{\varepsilon }(\cdot ), \overline{\Gamma }^{\varepsilon }(\cdot ))$ be the solution of adjoint equations (9)–(10) corresponding to $\left( \overline{ x}^{\varepsilon }(\cdot ),\mathbb {E}\left( \overline{x}^{\varepsilon }(\cdot )\right) , \overline{u}^{\varepsilon }(\cdot )\right) .$

Lemma 3.3

For any $\varepsilon >0,$ there exists near-optimal control $\overline{u}^{\varepsilon }(\cdot )$ such that for any $u\in \mathbb {A}$:

$$\begin{aligned}&\mathbb {E}\int \limits _{s}^{T}\left\{ \frac{1}{2}\left( \sigma \left( t,\overline{x}^{\varepsilon }(t),\mathbb {E}(\overline{x}^{\varepsilon }(t)), u\right) -\sigma \left( t,\overline{x}^{\varepsilon }(t),\mathbb {E}(\overline{x}^{\varepsilon }(t)),\overline{u}^{\varepsilon }(t)\right) \right) ^{*}\right. \nonumber \\&\quad \times \overline{Q}^{\varepsilon }(t)\left( \sigma \left( t,\overline{x}^{\varepsilon }(t),\mathbb {E}(\overline{x}^{\varepsilon }(t)),u\right) -\sigma \left( t,\overline{x}^{\varepsilon }(t),\mathbb {E}(\overline{x}^{\varepsilon }(t)),\overline{u}^{\varepsilon }(t)\right) \right) \nonumber \\&\quad +\,\overline{\Psi }^{\varepsilon }(t)\left( f\left( t,\overline{x} ^{\varepsilon }(t),\mathbb {E}(\overline{x}^{\varepsilon }(t)) ,u\right) -f\left( t,\overline{x}^{\varepsilon }(t),\mathbb {E}(\overline{x} ^{\varepsilon }(t)),\overline{u}^{\varepsilon }(t)\right) \right) \nonumber \\&\quad +\,\overline{K}^{\varepsilon }(t)\left( \sigma \left( t,\overline{x} ^{\varepsilon }(t),\mathbb {E}(\overline{x}^{\varepsilon }(t)) ,u\right) -\sigma \left( t,\overline{x}^{\varepsilon }(t),\mathbb {E}(\overline{x}^{\varepsilon }(t)),\overline{u}^{\varepsilon }(t)\right) \right) \nonumber \\&\quad +\,\int \limits _{\Theta }\overline{\gamma }^{\varepsilon }(t)g\left( t, \overline{x}^{\varepsilon }(t_{-}),u,\theta \right) -g\left( t,\overline{x} ^{\varepsilon }(t_{-}),\overline{u}^{\varepsilon }(t),\theta \right) \mu (d\theta ) \nonumber \\&\quad +\,\frac{1}{2}\int \limits _{\Theta }(g^{*}\left( t,\overline{x}^{\varepsilon }(t_{-}),u,\theta \right) -g^{*}\left( t,\overline{x}^{\varepsilon }(t_{-}),\overline{u}^{\varepsilon }(t),\theta \right) ) \nonumber \\&\quad \quad (\overline{Q}^{\varepsilon }(t)\!+\!\overline{\gamma }_{t}^{\varepsilon }(\theta ))(g\left( t,\overline{x}^{\varepsilon }(t_{-}),u,\theta \right) \!-\!g\left( t,\overline{x}^{\varepsilon }(t_{-}),\overline{u}^{\varepsilon }(t),\theta \right) )\mu (d\theta ), \!\!\!\!\!\!\nonumber \\&\quad \left. +\,\left( \ell \left( t,\overline{x}^{\varepsilon }(t),\mathbb {E}(\overline{x}^{\varepsilon }(t)),u\right) -\ell \left( t,\overline{x} ^{\varepsilon }(t),\mathbb {E}(\overline{x}^{\varepsilon }(t)), \overline{u}^{\varepsilon }(t)\right) \right) \right\} dt \nonumber \\&\quad \ge -\varepsilon ^{\frac{1}{3}}, \end{aligned}$$

(37)

Proof

By using Ekeland’s variational principle with $ \lambda =\varepsilon ^{\frac{2}{3}},$ there is an admissible control $ \overline{u}^{\varepsilon }(\cdot )$ such that for any $u(\cdot )\in \mathcal {U} :$

$$\begin{aligned} d\left( u^{\varepsilon }(\cdot ),\overline{u}^{\varepsilon }(\cdot )\right) \le \varepsilon ^{\frac{2}{3}}, \end{aligned}$$

(38)

and

$$\begin{aligned} J^{s,\zeta }\left( u^{\varepsilon }(\cdot )\right) \le J^{s,\zeta }\left( u^{\varepsilon }(\cdot )\right) +\varepsilon ^{\frac{1}{3}}d\left( u(\cdot ), \overline{u}^{\varepsilon }(\cdot )\right) . \end{aligned}$$

Notice that $u^{\varepsilon }(\cdot )$ which is near-optimal for the initial cost $J^{s,\zeta }$ defined in (2) is an optimal control for the new cost $J^{s,\zeta ,\varepsilon }$ given by

$$\begin{aligned} J^{s,\zeta ,\varepsilon }\left( u(\cdot )\right) =J^{s,\zeta }\left( u(\cdot )\right) +\varepsilon ^{\frac{1}{3}}d\left( u(\cdot ),\overline{u} ^{\varepsilon }(\cdot )\right) . \end{aligned}$$

Therefore we have

$$\begin{aligned} J^{s,\zeta ,\varepsilon }\left( \overline{u}^{\varepsilon }(\cdot )\right) \le J^{s,\zeta ,\varepsilon }\left( u(\cdot )\right) \text { for any } u(\cdot )\in \mathcal {U}. \end{aligned}$$

Next, we use the spike variation techniques for $\overline{u}^{\varepsilon }(\cdot )$ to derive the variational inequality as follows. For $\hbar >0$, we choose a Borel subset $\mathcal {E}_{\hbar }\subset \left[ s,T\right] $ such that $\left| \mathcal {E}_{\hbar }\right| =\hbar $, and we consider the control process which is the spike variation of $\overline{u}^{\varepsilon }(\cdot ):$

$$\begin{aligned} \overline{u}^{\varepsilon ,\hbar }(t)=\left\{ \begin{array}{l} u:t\in \mathcal {E}_{\hbar }, \\ \overline{u}^{\varepsilon }(t):t\in \left[ s,T\right] \mid \mathcal {E}_{\hbar }, \end{array} \right. \end{aligned}$$

where $u$ is an arbitrary element of $\mathbb {A}$ be fixed. By using the fact that $J^{s,\zeta ,\varepsilon }\left( \overline{u}^{\varepsilon }(\cdot ) \right) \le J^{s,\zeta ,\varepsilon }(\overline{u}^{\varepsilon ,\hbar }(\cdot )),$ and $d(\overline{u}^{\varepsilon ,\hbar }(\cdot ),\overline{u} ^{\varepsilon }(\cdot ))=d(\overline{u}^{\varepsilon ,\hbar }(\cdot ),\overline{u} ^{\varepsilon }(\cdot ))\le \hbar ,$ we get

$$\begin{aligned}&J^{s,\zeta }(\overline{u}^{\varepsilon ,\hbar }(\cdot ))-J^{s,\zeta }(\overline{u }^{\varepsilon }(\cdot ))\ge -\varepsilon ^{1/3}d(\overline{u}^{\varepsilon }(\cdot ),\overline{u}^{\varepsilon ,\hbar }(\cdot )) \nonumber \\&\quad \ge -\varepsilon ^{1/3}\hbar . \end{aligned}$$

(39)

Arguing as in Hafayed and Abbas ([17], Theorem 3.1), the left-hand side of (39) is equal to

$$\begin{aligned}&\mathbb {E}\int \limits _{\mathcal {E}_{\hbar }}\left\{ \frac{1}{2}[\sigma \left( t,\overline{ x}^{\varepsilon }(t),\mathbb {E}(\overline{x}^{\varepsilon }(t)) ,u\right) ) \right. \nonumber \\&\quad -\sigma (t,\overline{x}^{\varepsilon }(t),\mathbb {E}(\overline{x} ^{\varepsilon }(t)),\overline{u}^{\varepsilon }(t))]^{*}\times \overline{Q}^{\varepsilon }(t) \nonumber \\&\quad \left[ \sigma \left( t,\overline{x}^{\varepsilon }(t),\mathbb {E}(\overline{x }^{\varepsilon }(t)),u\right) -\sigma \left( t,\overline{x} ^{\varepsilon }(t),\mathbb {E}(\overline{x}^{\varepsilon }(t)), \overline{u}^{\varepsilon }(t)\right) \right] \nonumber \\&\quad +\,\overline{\Psi }^{\varepsilon }(t)[f\left( t,\overline{x}^{\varepsilon }(t), \mathbb {E}(\overline{x}^{\varepsilon }(t)),u\right) \nonumber \\&\quad -\,f\left( t,\overline{x}^{\varepsilon }(t),\mathbb {E}(\overline{x} ^{\varepsilon }(t)),\overline{u}^{\varepsilon }(t)\right) ] \nonumber \\&\quad +\,\overline{K}^{\varepsilon }(t)[\sigma \left( t,\overline{x}^{\varepsilon }(t),\mathbb {E}(\overline{x}^{\varepsilon }(t)),u\right) \nonumber \\&\quad -\,\sigma \left( t,\overline{x}^{\varepsilon }(t),\mathbb {E}(\overline{x} ^{\varepsilon }(t)),\overline{u}^{\varepsilon }(t)\right) ] \nonumber \\&\quad +\,\int \limits _{\Theta }\overline{\gamma }^{\varepsilon }(t)g\left( t, \overline{x}^{\varepsilon }(t_{-}),u,\theta \right) \nonumber \\&\quad -\,g\left( t,\overline{x} ^{\varepsilon }(t_{-}),\overline{u}^{\varepsilon }(t),\theta \right) \mu (d\theta ) \nonumber \\&\quad +\,\frac{1}{2}\int \limits _{\Theta }[g^{*}\left( t,\overline{x}^{\varepsilon }(t_{-}),u,\theta \right) -g^{*}\left( t,\overline{x}^{\varepsilon }(t_{-}),\overline{u}^{\varepsilon }(t),\theta \right) ] \nonumber \\&\quad \times \, (\overline{Q}^{\varepsilon }(t)+\overline{\gamma } _{t}^{\varepsilon }(\theta ))[g\left( t,\overline{x}^{\varepsilon }(t_{-}),u,\theta \right) \nonumber \\&\quad -\,g\left( t,\overline{x}^{\varepsilon }(t_{-}),\overline{u}^{\varepsilon }(t),\theta \right) ]\mu (d\theta )\!+\![\ell \left( t,\overline{x}^{\varepsilon }(t),\mathbb {E}(\overline{x}^{\varepsilon }(t)),u\right) \!\!\nonumber \\&\quad \left. -\,\ell \left( t,\overline{x}^{\varepsilon }(t),\mathbb {E}(\overline{x} ^{\varepsilon }(t)),\overline{u}^{\varepsilon }(t)\right) ]\right\} dt+\tau (\hbar ), \end{aligned}$$

(40)

where $\tau (\hbar )\longrightarrow 0$ as $\hbar \longrightarrow 0.$ Finally, replacing (40) in (39), then dividing inequality (39) by $\hbar $ and sending $\hbar $ to zero, the near-maximum condition (37) follows. $\square $

Proof of Theorem 3.1

First, we are about to derive an estimate for the term similar to the left side of inequality (34) and (35) with all the $\left( \overline{x}^{\varepsilon }(\cdot ), \mathbb {E}(\overline{x}^{\varepsilon }(\cdot )),\overline{u}^{\varepsilon }(\cdot )\right) $ etc. replaced by $(x^{\varepsilon }(\cdot ),\mathbb {E}( x^{\varepsilon }(\cdot )),$ $u^{\varepsilon }(\cdot ))$ etc,.

Now, to prove (14) it remains to estimate the following differences

$$\begin{aligned} S_{1}(\varepsilon )&= \mathbb {E}\int \limits _{s}^{T}\left[ \overline{K}^{\varepsilon }(t)(\sigma \left( t,\overline{x}^{\varepsilon }(t),\mathbb {E}(\overline{x}^{\varepsilon }(t)),u\right) \right. \nonumber \\&-\,\sigma \left( t,\overline{x}^{\varepsilon }(t),\mathbb {E}(\overline{x} ^{\varepsilon }(t)),\overline{u}^{\varepsilon }(t)\right) ) \nonumber \\&-\,K^{\varepsilon }(t)(\sigma \left( t,x^{\varepsilon }(t),\mathbb {E}( x^{\varepsilon }(t)),u\right) \nonumber \\&-\,\left. \sigma \left( t,x^{\varepsilon }(t),\mathbb {E}(x^{\varepsilon }(t)),u^{\varepsilon }(t)\right) )\right] dt,\end{aligned}$$

(41)

$$\begin{aligned} S_{2}(\varepsilon )&= \mathbb {E}\int \limits _{s}^{T}\left\{ \frac{1}{2} (\sigma \left( t,\overline{x}^{\varepsilon }(t),\mathbb {E}(\overline{x} ^{\varepsilon }(t)),u\right) \right. \nonumber \\&-\,\sigma \left( t,\overline{x}^{\varepsilon }(t),\mathbb {E}(\overline{x} ^{\varepsilon }(t)),\overline{u}^{\varepsilon }(t)\right) )^{*}\overline{ Q}^{\varepsilon }(t)\nonumber \\&\times \, \left( \sigma \left( t,\overline{x}^{\varepsilon }(t),\mathbb {E}( \overline{x}^{\varepsilon }(t)),u\right) \right. \nonumber \\&\left. -\,\sigma \left( t,\overline{x} ^{\varepsilon }(t),\mathbb {E}(\overline{x}^{\varepsilon }(t)),\overline{u} ^{\varepsilon }(t)\right) \right) \nonumber \\&-\,\frac{1}{2}\left( \sigma \left( t,x^{\varepsilon }(t),\mathbb {E}( x^{\varepsilon }(t)),u\right) \right. \nonumber \\&\left. -\,\sigma \left( t,x^{\varepsilon }(t),\mathbb {E}(x^{\varepsilon }(t)),u^{\varepsilon }(t)\right) \right) ^{*}\nonumber \\&\times \, Q^{\varepsilon }(t)[\sigma \left( t,x^{\varepsilon }(t),\mathbb {E}( x^{\varepsilon }(t)),u\right) \nonumber \\&-\,\sigma \left( t,x^{\varepsilon }(t),\mathbb {E}(x^{\varepsilon }(t)),u^{\varepsilon }(t)\right) ]\nonumber \\&+\,\overline{\Psi }^{\varepsilon }(t)[f\left( t,\overline{x}^{\varepsilon }(t), \mathbb {E}(\overline{x}^{\varepsilon }(t)),u\right) \nonumber \\&-\,f\left( t,\overline{x}^{\varepsilon }(t),\mathbb {E}(\overline{x} ^{\varepsilon }(t)),\overline{u}^{\varepsilon }(t)\right) ]\nonumber \\&-\,\Psi ^{\varepsilon }(t)[f\left( t,x^{\varepsilon }(t),\mathbb {E}( x^{\varepsilon }(t)),u\right) \nonumber \\&\quad -\,f\left( t,x^{\varepsilon }(t),\mathbb {E}(x^{\varepsilon }(t)),u^{\varepsilon }(t)\right) ]\nonumber \\&\quad +\,[\ell \left( t,\overline{x}^{\varepsilon }(t),\mathbb {E}(\overline{x}^{\varepsilon }(t)),u\right) \nonumber \\&-\,\ell \left( t,\overline{x}^{\varepsilon }(t),\mathbb {E}(\overline{x} ^{\varepsilon }(t)),\overline{u}^{\varepsilon }(t)\right) ]\nonumber \\&\quad -\,[\ell \left( t,x^{\varepsilon }(t),\mathbb {E}(x^{\varepsilon }(t)),u\right) \nonumber \\&\left. -\,\ell \left( t,x^{\varepsilon }(t),\mathbb {E}(x^{\varepsilon }(t)),u^{\varepsilon }(t)\right) ]\right\} dt. \end{aligned}$$

(42)

and

$$\begin{aligned} S_{3}(\varepsilon )\!&= \!\mathbb {E}\int \limits _{s}^{T}\int \limits _{\Theta }\left[ \overline{\gamma } _{t}^{\varepsilon }(\theta )\left( g\left( t,\overline{x}^{\varepsilon }(t_{-}),u,\theta \right) \right. \right. \nonumber \\&\left. \left. -\,g\left( t,\overline{x}^{\varepsilon }(t_{-}), \overline{u}^{\varepsilon }(t)\right) \right) \!-\! \gamma _{t}^{\varepsilon }(\theta )\left( g\left( t,x^{\varepsilon }(t_{-}),u,\theta \right) \right. \right. \nonumber \\&\left. \left. -\,g\left( t,x^{\varepsilon }(t_{-}),u^{\varepsilon }(t),\theta \right) \right) \right] \mu (d\theta )dt, \end{aligned}$$

(43)

Then we have

$$\begin{aligned} S_{1}(\varepsilon )&= \mathbb {E}\int \limits _{s}^{T}\left[ \overline{K}^{\varepsilon }(t)-K^{\varepsilon }(t)\right] \left[ \sigma \left( t,\overline{x}^{\varepsilon }(t),\mathbb {E}(\overline{x}^{\varepsilon }(t)),u\right) \right. \\&\left. -\,\sigma \left( t,\overline{x}^{\varepsilon }(t),\mathbb {E}(\overline{x} ^{\varepsilon }(t)),\overline{u}^{\varepsilon }(t)\right) \right] \\&+\,\mathbb {E}\int \limits _{s}^{T}K^{\varepsilon }(t)[\sigma \left( t,\overline{x} ^{\varepsilon }(t),\mathbb {E}(\overline{x}^{\varepsilon }(t)),u\right) \\&-\,\sigma \left( t,x^{\varepsilon }(t),\mathbb {E}(x^{\varepsilon }(t)),u\right) ]dt\\&-\,\mathbb {E}\int \limits _{s}^{T}K^{\varepsilon }(t)[\sigma \left( t,\overline{x} ^{\varepsilon }(t),\mathbb {E}(\overline{x}^{\varepsilon }(t)),\overline{u} ^{\varepsilon }(t)\right) \\&-\,\sigma \left( t,x^{\varepsilon }(t),\mathbb {E}(x^{\varepsilon }(t)),u^{\varepsilon }(t)\right) ]dt\\&= \mathbb {I}_{1}\left( \varepsilon \right) +\mathbb {I}_{2}\left( \varepsilon \right) +\mathbb {I}_{3}\left( \varepsilon \right) . \end{aligned}$$

We estimate the first term on the right-hand side $\mathbb {I}_{1}\left( \varepsilon \right) =\mathbb {E}\int \limits _{s}^{T}\left[ \overline{K}^{\varepsilon }(t)\!-\!K^{\varepsilon }(t)\right] [\sigma \left( t,\overline{x}^{\varepsilon }(t),\mathbb {E}(\overline{x}^{\varepsilon }(t)),u\right) $ $-\sigma \left( t, \overline{x}^{\varepsilon }(t),\right. \left. \mathbb {E}(\overline{x}^{\varepsilon }(t)), \overline{u}^{\varepsilon }(t)\right) ].$ For any $\delta \in [0, \frac{1}{3})$ so that $\alpha =3\delta \in [0,1).$ Now, let $\beta $ be a fixed real number such that $1<\beta <2$ so that $(1+\alpha )\beta <2$. Taking $q>2$ such that $\frac{1}{\beta }+\frac{1}{q}=1$ then by using Hô lder’s inequality, Lemma 3.2 and note (4) we obtain

$$\begin{aligned} \mathbb {I}_{1}\left( \varepsilon \right)&\le \left[ \mathbb {E} \int \limits _{s}^{T}\left| \overline{K}^{\varepsilon }(t)-K^{\varepsilon }(t)\right| ^{\beta }dt\right] ^{\frac{1}{\beta }}\\&\times \left[ \mathbb {E}\int \limits _{s}^{T}\left| \sigma \left( t,\overline{x} ^{\varepsilon }(t),\mathbb {E}(\overline{x}^{\varepsilon }(t)),u\right) \right. \right. \\&-\left. \left. \sigma \left( t,\overline{x}^{\varepsilon }(t),\mathbb {E}( \overline{x}^{\varepsilon }(t)),\overline{u}^{\varepsilon }(t)\right) \right| ^{q}dt\right] ^{\frac{1}{q}}\\&\le C\left[ d(\overline{u}^{\varepsilon }(\cdot ),u^{\varepsilon }(\cdot ))^{\frac{\alpha \beta }{2}}\right] ^{\frac{1}{\beta }}\\&\times \left[ \mathbb {E}\int \limits _{s}^{T}(1+\left| \overline{x}^{\varepsilon }(t)\right| ^{q}+\left| \mathbb {E}(\overline{x}^{\varepsilon }(t))\right| ^{q})dt\right] ^{\frac{1}{q}}\\&\le C\left[ \varepsilon ^{\frac{2}{3}}\right] ^{\frac{\alpha \beta }{2}. \frac{1}{\beta }}=C\varepsilon ^{\delta }. \end{aligned}$$

We estimate now the second term $\mathbb {I}_{2}\left( \varepsilon \right) .$ Then by applying Cauchy–Schwarz inequality, note (9), assumption (H1), and Lemma 3.1, we get

$$\begin{aligned} \mathbb {I}_{2}\left( \varepsilon \right)&\le \left[ \mathbb {E} \int \limits _{s}^{T}\left| K^{\varepsilon }(t)\right| ^{2}dt\right] ^{\frac{1 }{2}}\left[ \mathbb {E}\int \limits _{s}^{T}\left| \sigma \left( t,\overline{x} ^{\varepsilon }(t),\mathbb {E}(\overline{x}^{\varepsilon }(t)),u\right) \right. \right. \\&-\left. \left. \sigma \left( t,x^{\varepsilon }(t),\mathbb {E}(x^{\varepsilon }(t)),u\right) \right| ^{2}dt\right] ^{\frac{1}{2}}\\&\le C\left[ \mathbb {E}\int \limits _{s}^{T}(\left| \overline{x}^{\varepsilon }(t)-x^{\varepsilon }(t)\right| ^{2}+\left| \mathbb {E}[\overline{x} ^{\varepsilon }(t)-x^{\varepsilon }(t)]\right| ^{2})dt\right] ^{\frac{1}{ 2}}\\&\le C\left[ d(\overline{u}^{\varepsilon }(\cdot ),u^{\varepsilon }(\cdot ))^{\alpha }\right] ^{\frac{1}{2}}\le C(\varepsilon ^{\frac{2}{3}})^{\alpha \frac{1}{2}}=C\varepsilon ^{\frac{\alpha }{3}}=C\varepsilon ^{\delta }. \end{aligned}$$

Now, let us turn to estimate the third term $\mathbb {I}_{3}\left( \varepsilon \right) .$ By adding and subtracting $\sigma (t,\overline{x} ^{\varepsilon }(t),\mathbb {E}(\overline{x}^{\varepsilon }(t)), u^{\varepsilon }(t))$ then we have

$$\begin{aligned} \mathbb {I}_{3}\left( \varepsilon \right)&= -\mathbb {E}\int \limits _{s}^{T}K^{ \varepsilon }(t)[\sigma \left( t,\overline{x}^{\varepsilon }(t),\mathbb {E}( \overline{x}^{\varepsilon }(t)),\overline{u}^{\varepsilon }(t)\right) \\&-\,\sigma (t,\overline{x}^{\varepsilon }(t),\mathbb {E}(\overline{x} ^{\varepsilon }(t)),u^{\varepsilon }(t))]dt\\&-\,\mathbb {E}\int \limits _{s}^{T}K^{\varepsilon }(t)\sigma \left( t,\overline{x} ^{\varepsilon }(t),\mathbb {E}(\overline{x}^{\varepsilon }(t)),u^{\varepsilon }(t)\right) \\&-\,\sigma \left( t,x^{\varepsilon }(t),\mathbb {E}(x^{\varepsilon }(t)),u^{\varepsilon }(t)\right) )dt, \end{aligned}$$

then by using Cauchy–Schwarz inequality, we have

$$\begin{aligned} \mathbb {I}_{3}\left( \varepsilon \right)&\le \left[ \mathbb {E} \int \limits _{s}^{T}\left| K^{\varepsilon }(t)\right| ^{2}dt\right] ^{\frac{1 }{2}}\\&\times \left[ \mathbb {E}\int \limits _{s}^{T}\left| \sigma \left( t,\overline{x} ^{\varepsilon }(t),\mathbb {E}(\overline{x}^{\varepsilon }(t)),\overline{u} ^{\varepsilon }(t)\right) \right. \right. \\&\left. -\,\sigma (t,\overline{x}^{\varepsilon }(t),\mathbb {E}(\overline{x} ^{\varepsilon }(t)),u^{\varepsilon }(t))\right| ^{2}\left. \mathbf{1} _{\left\{ \overline{u}^{\varepsilon }(\cdot )\ne u^{\varepsilon }(\cdot )\right\} }\left( t\right) dt\right] ^{\frac{1}{2}}\\&+\,\mathbb {E}\int \limits _{s}^{T}\left| K^{\varepsilon }(t)\right| \left| [\sigma \left( t,\overline{x}^{\varepsilon }(t),\mathbb {E}(\overline{x} ^{\varepsilon }(t)),u^{\varepsilon }(t)\right) \right. \\&-\,\left. \sigma \left( t,x^{\varepsilon }(t),\mathbb {E}(x^{\varepsilon }(t)),u^{\varepsilon }(t)\right) ]\right| dt. \end{aligned}$$

We proceed as in $I_{2}\left( \varepsilon \right) $ to estimate the second term in the right of above inequality, then by applying Cauchy–Schwartz inequality, Assumption (H1) and (9) we obtain

$$\begin{aligned}&\mathbb {I}_{3}\left( \varepsilon \right) \!\le \! \left[ \! \mathbb {E}\!\int \limits _{s}^{T}\left| K^{\varepsilon }(t)\right| ^{2}dt\right] ^{\frac{1}{2}}\left\{ \!\left[ \mathbb {E}\int \limits _{s}^{T}\left| \sigma \left( t,\overline{x}^{\varepsilon }(t),\mathbb {E}(\overline{x} ^{\varepsilon }(t)), \right. \right. \right. \right. \\&\qquad \left. \left. \left. \times \,\overline{u}^{\varepsilon }(t)\right) -\sigma (t,\overline{x}^{\varepsilon }(t),\mathbb {E}(\overline{ x}^{\varepsilon }(t)),u^{\varepsilon }(t))\right| ^{4}dt\right] ^{\frac{1}{2}} \\&\,\qquad \times \left. \left[ \mathbb {E}\int \limits _{s}^{T}\mathbf{1}_{\left\{ \overline{u} ^{\varepsilon }(\cdot )\ne u^{\varepsilon }(\cdot )\right\} }\left( t\right) dt\right] ^{\frac{1}{2}}\right\} ^{\frac{1}{2}}+C\varepsilon ^{\delta } \\&\quad \le C\left[ d(\overline{u}^{\varepsilon }(\cdot ),u^{\varepsilon }(\cdot ))^{\frac{1}{2}}\right] ^{\frac{1}{2}}+C\varepsilon ^{\delta }\ le C\varepsilon ^{\delta }, \end{aligned}$$

thus, we have proved that

$$\begin{aligned} S_{1}(\varepsilon )=\mathbb {I}_{1}\left( \varepsilon \right) +\mathbb {I} _{2}\left( \varepsilon \right) +\mathbb {I}_{3}\left( \varepsilon \right) \le C\varepsilon ^{\delta }. \end{aligned}$$

(44)

By using similar arguments developed above, we can prove that

$$\begin{aligned} S_{2}(\varepsilon )\le C\varepsilon ^{\delta }. \end{aligned}$$

(45)

Now, let us turn to estimate the third term $S_{3}( \varepsilon )$. By applying the Cauchy–Schwarz inequality, we get

$$\begin{aligned} S_{3}(\varepsilon )&\le \mathbb {E}\int \limits _{s}^{T}\int \limits _{ \Theta }\left( \overline{\gamma }_{t}^{\varepsilon }(\theta )- \gamma _{t}^{\varepsilon }(\theta )\right) \\&\times \! \left( g\left( t,\overline{x}^{\varepsilon }(t_{-}),u,\theta \right) \!-\!g\left( t,\overline{x}^{\varepsilon }(t_{-}),\overline{u}^{\varepsilon }(t),\theta \right) \right) \mu (d\theta )dt \\&+\,\mathbb {E}\int \limits _{s}^{T}\int \limits _{\Theta }\left[ \gamma _{t}^{\varepsilon }(\theta )[g\left( t,\overline{x}^{\varepsilon }(t_{-}),u,\theta \right) \right. \\&-\,g\left( t,x^{\varepsilon }(t_{-}),u\right) ]\mu (d\theta )dt \\&+\,\mathbb {E}\int \limits _{s}^{T}\int \limits _{\Theta }\gamma _{t}^{\varepsilon }(\theta )(g\left( t,\overline{x}^{\varepsilon }(t_{-}),\overline{u} ^{\varepsilon }(t),\theta \right) \\&-\,g\left( t,x^{\varepsilon }(t_{-}),u^{\varepsilon }(t),\theta \right) )\mu (d\theta )dt, \\&= \mathbb {J}_{1}(\varepsilon )+\mathbb {J}_{2}(\varepsilon )+\mathbb {J}_{3}(\varepsilon ). \end{aligned}$$

For any $\delta \in [0,\frac{1}{3})$ so that $\alpha =3\delta \in [0,1).$ Now, let $\beta $ be a fixed real number such that $\beta \in (1,2)$ so that $(1+\alpha )\beta <2$. Taking $q>2$ such that $\frac{1}{\beta }+\frac{1}{q}=1$. By Hôlder’s inequality, Lemma 3.2 and (5) we obtain

$$\begin{aligned} \mathbb {J}_{1}(\varepsilon )&= \mathbb {E}\int \limits _{s}^{T}\int \limits _{\Theta }\left( \overline{\gamma }_{t}^{\varepsilon }(\theta )-\gamma _{t}^{\varepsilon }(\theta )\right) \\&\times \left( g\left( t,\overline{x}^{\varepsilon }(t_{-}),u,\theta \right) \!-\!g\left( t,\overline{x}^{\varepsilon }(t_{-}),\overline{u}^{\varepsilon }(t),\theta \right) \right) \mu (d\theta )dt \\&\le \left[ \mathbb {E}\int \limits _{s}^{T}\int \limits _{\Theta }\left| \overline{\mathbf { \gamma }}_{t}^{\varepsilon }(\theta )-\gamma _{t}^{\varepsilon }(\theta \right| ^{\beta }\mu (d\theta )dt\right] ^{\frac{1}{\beta }} \\&\times \mathbb {E}\left\{ \int \limits _{s}^{T}((\sup _{\theta \in \Theta }\left| g\left( t,\overline{x}^{\varepsilon }(t),u,\theta \right) \right. \right. \\&-\left. \left. g\left( t,\overline{x}^{\varepsilon }(t),\overline{u} ^{\varepsilon }(t),\theta \right) \right| )^{q}dt\right\} ^{\frac{1}{q} }\mu (\Theta )^{\frac{1}{q}} \\&\le C\left[ d(\overline{u}^{\varepsilon }(\cdot ),u^{\varepsilon }(\cdot ))^{\frac{\alpha \beta }{2}}\right] ^{\frac{1}{\beta }} \\&\times \left[ \mathbb {E}\int \limits _{s}^{T}(1+\left| \overline{x}^{\varepsilon }(t)\right| ^{q}+\left| \mathbb {E}(\overline{x}^{\varepsilon }(t))\right| ^{q})dt\right] ^{\frac{1}{q}} \\&\le C(\varepsilon ^{\frac{2}{3}})^{\frac{\alpha \beta }{2}.\frac{1}{\beta } }=C\varepsilon ^{\frac{\alpha }{3}}. \end{aligned}$$

Applying assumption (H3), Cauchy–Schwarz inequality, Lemma 3.2, note (10) and the fact that $\mu (\Theta )<\infty $ we get

$$\begin{aligned} \mathbb {J}_{2}(\varepsilon )&\le \left[ \mathbb {E}\int \limits _{s}^{T}\int \limits _{\Theta }\left| \gamma _{t}^{\varepsilon }(\theta \right| ^{2}\mu (d\theta )dt\right] ^{\frac{1}{2}}\left[ \mu (\Theta )\right] ^{\frac{1}{2}}\\&\times \left\{ \mathbb {E}\int \limits _{s}^{T}(\sup _{\theta \in \Theta }\left| g\left( t,\overline{x}^{\varepsilon }(t),u,\theta \right) \right. \right. \\&-\left. \left. g\left( t,\overline{x}^{\varepsilon }(t),\overline{u} ^{\varepsilon }(t),\theta \right) \right| )^{2}dt\right\} ^{\frac{1}{2}}\\&\le C\mathbb {E}\left\{ \int \limits _{s}^{T}\left| \overline{x}^{\varepsilon }(t)-x^{\varepsilon }(t)\right| ^{2}dt\right\} ^{\frac{1}{2}}\\&\quad \le C\left[ d(\overline{u}^{\varepsilon }(\cdot ),u^{\varepsilon }(\cdot ))^{\alpha } \right] ^{\frac{1}{2}}. \end{aligned}$$

by using (38) we get $d(\overline{u}^{\varepsilon }(\cdot ),u^{\varepsilon }(\cdot ))^{\alpha }\le \left( \varepsilon ^{\frac{2}{3} }\right) ^{\alpha },$ it holds that

$$\begin{aligned} \mathbb {J}_{2}(\varepsilon )\le C(\varepsilon ^{\frac{2\alpha }{3}})^{\frac{ 1}{2}}=C\varepsilon ^{\frac{\alpha }{3}}=C\varepsilon ^{\delta }. \end{aligned}$$

We proceed to estimate $\mathbb {J}_{3}(\varepsilon )$. By adding and subtracting $g\left( t,\overline{x}^{\varepsilon }(t),u^{\varepsilon }(t),\theta \right) $ and Cauchy–Schwarz inequality we obtain

$$\begin{aligned} \mathbb {J}_{3}(\varepsilon )&= \mathbb {E}\int \limits _{s}^{T}\int \limits _{\Theta }\gamma _{t}^{\varepsilon }(\theta )(g\left( t,\overline{x}^{\varepsilon }(t),\overline{u} ^{\varepsilon }(t),\theta \right) -g\left( t,\overline{x}^{\varepsilon }(t),u^{\varepsilon }(t),\theta \right) \\&\times \, \mathbf{1}_{\left\{ \overline{u}^{\varepsilon }(\cdot )\ne u^{\varepsilon }(\cdot )\right\} }\left( t\right) \mu (d\theta )dt\\&+\,\mathbb {E}\int \limits _{s}^{T}\int \limits _{\Theta }\gamma _{t}^{\varepsilon }(\theta )(g\left( t,\overline{x}^{\varepsilon }(t_{-}),u^{\varepsilon }(t),\theta \right) \\&-\,g\left( t,x^{\varepsilon }(t_{-}),u^{\varepsilon }(t),\theta \right) )\mu (d\theta )dt \\&\le \mathbb {E}\left\{ \int \limits _{s}^{T}\int \limits _{\Theta }\left| \gamma _{t}^{\varepsilon }(\theta )\right| ^{2}\mu (d\theta )dt\right\} ^{\frac{ 1}{2}}\left[ \mu (\Theta )\right] ^{\frac{1}{2}} \\&\times \, \mathbb {E}[\int \limits _{s}^{T}\sup _{\theta \in \Theta }\left| g\left( t, \overline{x}^{\varepsilon }(t),\overline{u}^{\varepsilon }(t),\theta \right) -g\left( t,\overline{x}^{\varepsilon }(t),u^{\varepsilon }(t),\theta \right) \right| ^{2} \\&\times \, \left. \mathbf{1}_{\left\{ \overline{u}^{\varepsilon }(\cdot )\ne u^{\varepsilon }(\cdot )\right\} }\left( t\right) dt\right\} ]^{\frac{1}{2}}\\&+\,\mathbb {E}\left\{ \int \limits _{s}^{T}\int \limits _{\Theta }\left| \gamma _{t}^{\varepsilon }(\theta )\right| ^{2}\mu (d\theta )dt\right\} ^{\frac{ 1}{2}}\mathbb {E}\left\{ \int \limits _{s}^{T}\left| \overline{x}^{\varepsilon }(t)\!-\!x^{\varepsilon }(t)\right| ^{2}dt\right\} ^{\frac{1}{2}}\!, \end{aligned}$$

by applying Cauchy–Schwarz inequality, Lemma 3.2 and (11) it follows that

$$\begin{aligned} \mathbb {J}_{3}(\varepsilon )&\le \mathbb {E}\left\{ \int \limits _{s}^{T}(1+\left| \overline{x}^{\varepsilon }(t)\right| ^{4})dt\right\} ^{\frac{1}{2}}d (\overline{u}^{\varepsilon }(\cdot ),u^{\varepsilon }(\cdot ))^{\frac{1}{2}}\\&{}+C\mathbb {E}\left\{ \int \limits _{s}^{T}\left| \overline{x}^{\varepsilon }(t)-x^{\varepsilon }(t)\right| ^{2}dt\right\} ^{\frac{1}{2}}\le C\varepsilon ^{\delta }. \end{aligned}$$

Thus, we have proved that

$$\begin{aligned} S_{3}(\varepsilon )=\mathbb {J}_{1}(\varepsilon ) +\mathbb {J}_{2}(\varepsilon )+\mathbb {J}_{3}(\varepsilon )\le C\varepsilon ^{\delta }. \end{aligned}$$

(46)

The desired result (14) follows immediately by combining (44), (45), (46) and (34). This completes the proof of Theorem 3.1. $\square $

Proof of Corollary 3.1

In the spike variations technique for the perturbed control $\overline{u}^{\varepsilon , \theta }(\cdot )$ in (37) the point $u\in \mathbb {A}$ may be replaced by any admissible control $u(\cdot )\in \mathcal {U},$ and the subsequent argument still goes through. So the inequality in the estimate (15) holds for any $ u(\cdot )\in \mathcal {U}$ and the subsequent argument still goes through. So the inequalities in the estimate (15) holds for any $u(\cdot )\in \mathcal {U}$. $\square $

4 Sufficient conditions of near-optimality for mean-field jump diffusion processes

We will shows in this section, that under certain concavity conditions on the Hamiltonian $H$ and some convexity conditions on the function $h(\cdot ,\cdot )$, the $\varepsilon $-maximum condition on the Hamiltonian function $\mathcal {H}$ in the integral form is sufficient for near-optimality. We assume:

Assumption (H3) $\psi $ is differentiable in $u$ for $\psi =:f,\sigma , \ell , g$ and there is a constant $C>0$ such that

$$\begin{aligned}&\left| \psi (t,x,y,u)-\psi (t,x,y,u^{\prime })\right| \nonumber \\&\quad +\left| \psi _{u}(t,x,y,u)-\psi _{u}(t,x,y,u^{\prime })\right| \le C\left| u-u^{\prime }\right| ,\nonumber \\&\quad \sup _{\theta \in \Theta }\left| g(t,x,u,\theta )-g(t,x,u^{\prime },\theta )\right| \nonumber \\&\quad +\sup _{\theta \in \Theta }\left| g_{u}(t,x,u,\theta )-g_{u}(t,x,u^{\prime },\theta )\right| \le C\left| u-u^{\prime }\right| .\nonumber \\\end{aligned}$$

(47)

$$\begin{aligned}&h\left( \cdot , \cdot \right) \text { convex with respect to}\left( x,y\right) . \end{aligned}$$

(48)

$$\begin{aligned}&H\left( t,\cdot , \cdot , \cdot , \Psi ^{\varepsilon }(\cdot ),K^{\varepsilon }(\cdot ),\gamma ^{\varepsilon }\left( \cdot \right) \right) \text {is concave with respect to} \nonumber \\&\quad \left( x,y,u\right) , \text { for } a.e.t\in \left[ 0,T\right] , \mathbb {P}-a.s. \end{aligned}$$

(49)

$$\begin{aligned}&\quad \text {The derivatives } f_{y},\sigma _{y},h_{y}\, \ell _{y}\text { are non-negative.} \end{aligned}$$

(50)

Now we are able to state and prove the sufficient conditions for near-optimality for systems governed by mean-field SDEs with jump processes, which is the second main result of this paper.

Let $u^{\varepsilon }(\cdot )$ be an admissible control and $\left( \Psi ^{\varepsilon }(\cdot ),K^{\varepsilon }(\cdot ),\gamma ^{\varepsilon }\left( \cdot \right) \right) , $ $\left( Q^{\varepsilon }(\cdot ),R^{\varepsilon }(\cdot ),\Gamma ^{\varepsilon }\left( \cdot \right) \right) $ be the solution of the adjoint equations (9)–(10) corresponding to $u^{\varepsilon }(\cdot ).$

Theorem 4.1

Sufficient conditions for near-optimality of order $\varepsilon ^{\frac{1}{2}}$). Let conditions (47)–(49) holds. If for some $\varepsilon >0$ and for any $u\left( \cdot \right) \in \mathcal {U}:$

$$\begin{aligned}&\mathbb {E}\int \limits _{s}^{T}\mathcal {H}^{\left( x^{\varepsilon }(\cdot ),u^{\varepsilon }(\cdot )\right) }(t,x^{\varepsilon }(t),\mathbb {E}( x^{\varepsilon }(t)),u^{\varepsilon }(t))dt+\varepsilon \nonumber \\&\quad \ge \sup _{u(\cdot )\in \mathcal {U}}\mathbb {E}\int \limits _{s}^{T}\mathcal {H} ^{\left( x^{\varepsilon }(\cdot ),u^{\varepsilon }(\cdot )\right) }(t,x^{\varepsilon }(t),\mathbb {E}(x^{\varepsilon }(t)),u(t))dt,\nonumber \\ \end{aligned}$$

(51)

then $u^{\varepsilon }(\cdot )$ is a near-optimal control of order $ \varepsilon ^{\frac{1}{2}},$ i.e.,

$$\begin{aligned} J^{s,\zeta }\left( u^{\varepsilon }(\cdot )\right) \le \inf _{u\left( \cdot \right) \in \mathcal {U}}J^{s,\zeta }\left( u(\cdot )\right) +C\varepsilon ^{ \frac{1}{2}}, \end{aligned}$$

where $C>0$ is a positive constant independent of $\varepsilon .$

Corollary 4.1

(Sufficient Conditions for $\varepsilon $-optimality) Under the assumptions of Theorem 4.1 a sufficient condition for an admissible control $u^{\varepsilon }(\cdot )$ to be $\varepsilon $-optimal for our mean-field control problem (1)–(2) is

$$\begin{aligned}&\mathbb {E}\int \limits _{s}^{T}\mathcal {H}^{\left( x^{\varepsilon }(\cdot ),u^{\varepsilon }(\cdot )\right) }(t,x^{\varepsilon }(t),\mathbb {E}( x^{\varepsilon }(t)),u^{\varepsilon }(t))dt+\left( \frac{\varepsilon }{C} \right) ^{2} \\&\quad \ge \sup _{u(\cdot )\in \mathcal {U}}\mathbb {E}\int \limits _{s}^{T}\mathcal {H} ^{\left( x^{\varepsilon }(\cdot ),u^{\varepsilon }(\cdot )\right) }(t,x^{\varepsilon }(t),\mathbb {E}(x^{\varepsilon }(t)),u(t))dt. \end{aligned}$$

Proof of Theorem 4.1

The key step in the proof is to show that

$H_{u}(t,x^{\varepsilon }(t),\mathbb {E}(x^{\varepsilon }(t)),u^{\varepsilon }(t),\Psi ^{\varepsilon }(t),K^{\varepsilon }(t), \gamma _{t}^{\varepsilon }\left( \theta \right) )$ is very small and estimate it in terms of $\varepsilon $. We first fix an $\varepsilon >0$ and define a new metric $\widehat{d}$ on $\mathcal {U}$, by setting: for any $u(\cdot )\ $and $v(\cdot )\in \mathcal {U}:$

$$\begin{aligned} \widehat{d}(u(\cdot ),v(\cdot ))=\mathbb {E}\int \limits _{s}^{T}\left| u(t)-v(t)\right| \pounds ^{\varepsilon }(t)dt, \end{aligned}$$

where

$$\begin{aligned} \pounds ^{\varepsilon }(t)&= 1+\left| \Psi ^{\varepsilon }(t)\right| +\left| K^{\varepsilon }(t)\right| \\&+\,2\left| Q^{\varepsilon }(t)\right| \left[ 1+\left| x^{\varepsilon }(t)\right| +\left| \mathbb {E}(x^{\varepsilon }(t))\right| \right] \\&+\,2\left[ \left| Q^{\varepsilon }(t)\right| +\left| \int \limits _{\Theta } \gamma _{t}^{\varepsilon }(\theta )\mu (d\theta )\right| \right] \left[ 1+\left| x^{\varepsilon }(t)\right| \right] . \end{aligned}$$

Obviously $\widehat{d}$ is a metric on $\mathcal {U}$ satisfied $\pounds ^{\varepsilon }(t)>1$, and it is a complete metric as a weighted $\mathbb {L} ^{1}$-norm.

Define a functional $g$ on $\mathcal {U}$ as follows

$$\begin{aligned} g\left( u(\cdot )\right) =\mathbb {E}\int \limits _{s}^{T}\mathcal {H}^{\left( x^{\varepsilon } (\cdot ),u^{\varepsilon }(\cdot )\right) }\left( t,x^{\varepsilon }(t), \mathbb {E}(x^{\varepsilon }(t)),u(t)\right) dt. \end{aligned}$$

By using assumption (47) then a simple computation shows that

$$\begin{aligned}&\left| g\left( u(\cdot )\right) -g\left( v(\cdot )\right) \right| \\&\quad =\mathbb {E}\int \limits _{s}^{T}\left\{ \mathcal {H}^{\left( x^{\varepsilon }(\cdot ),u^{\varepsilon }(\cdot )\right) }\left( t,x^{\varepsilon }(t),\mathbb {E}( x^{\varepsilon }(t)),u(t)\right) \right. \\&\qquad \left. -\mathcal {H}^{\left( x^{\varepsilon }(\cdot ),u^{\varepsilon }(\cdot )\right) }\left( t,x^{\varepsilon }(t),\mathbb {E}(x^{\varepsilon }(t)),v(t)\right) \right\} dt.\\&\quad \le \mathbb {E}\int \limits _{s}^{T}\left| H\left( t,x^{\varepsilon }(t),\mathbb {E }\left( x^{\varepsilon }(t)\right) , u(t),\Psi ^{\varepsilon }(t),K^{\varepsilon }(t),\gamma _{t}^{\varepsilon }(\theta )\right) \right. \\&\qquad -\left. H\left( t,x^{\varepsilon }(t),\mathbb {E}\left( x^{\varepsilon }(t)\right) , v(t),\Psi ^{\varepsilon }(t),K^{\varepsilon }(t),\gamma _{t}^{\varepsilon }(\theta )\right) \right| dt\\&\qquad +\,\mathbb {E}\int \limits _{s}^{T}\left| \sigma ^{*}\left( t,x^{\varepsilon }(t),\mathbb {E}\left( x^{\varepsilon }(t)\right) , u(t)\right) \right. \\&\qquad -\left. \sigma ^{*}\left( t,x^{\varepsilon }(t),\mathbb {E}\left( x^{\varepsilon }(t)\right) , v(t)\right) \right| \left| Q^{\varepsilon }(t)\right| \\&\qquad \times \left| \sigma \left( t,x^{\varepsilon }(t),\mathbb {E}\left( x^{\varepsilon }(t)\right) ,u^{\varepsilon }(t)\right) \right| dt\\&\qquad +\,\frac{1}{2}\mathbb {E}\int \limits _{s}^{T}\left| \sigma ^{*}\left( t,x^{\varepsilon }(t),\mathbb {E}\left( x^{\varepsilon }(t)\right) ,u(t)\right) Q(t)\right. \\&\qquad \times \, \sigma \left( t,x^{\varepsilon }(t),\mathbb {E}\left( x^{\varepsilon }(t)\right) , u(t)\right) \\&\qquad -\,\sigma ^{*}\left( t,x^{\varepsilon }(t),\mathbb {E}\left( x^{\varepsilon }(t)\right) , v(t)\right) Q^{\varepsilon }(t) \\&\qquad \times \, \left. \sigma \left( t,x^{\varepsilon }(t),\mathbb {E}\left( x^{\varepsilon }(t)\right) , v(t)\right) \right| dt\\&\qquad +\,\mathbb {E}\int \limits _{s}^{T}\int \limits _{\Theta }\left| g^{*}\left( t,x^{\varepsilon }(t),u(t),\theta \right) -g^{*}\left( t,x^{\varepsilon }(t),v(t),\theta \right) \right| \\&\qquad \times \, \left| \left( Q^{\varepsilon }(t)+\gamma _{t}^{\varepsilon }(\theta )\right) g\left( t,x^{\varepsilon }(t_{-}),u^{\varepsilon }(t),\theta \right) \right| \mu (d\theta )dt\\&\qquad +\,\frac{1}{2}\mathbb {E}\int \limits _{s}^{T}\int \limits _{\Theta }\left| g^{*}\left( t,x^{\varepsilon }(t),u(t),\theta \right) \left( Q^{\varepsilon }(t)+\mathbf { \gamma }_{t}^{\varepsilon }(\theta )\right) \right. \\&\qquad \times \, g\left( t,x^{\varepsilon }(t),u(t),\theta \right) \\&\qquad -g^{*}\left( t,x^{\varepsilon }(t),v(t),\theta \right) \left( Q^{\varepsilon }(t)+\gamma _{t}^{\varepsilon }(\theta )\right) \\&\qquad \left. g\left( t,x^{\varepsilon }(t),v(t),\theta \right) \right| \mu (d\theta )dt,\\&\quad =\mathcal {I}_{1}^{\varepsilon }+\mathcal {I}_{2}^{\varepsilon }+\mathcal {I}_{3}^{\varepsilon } +\mathcal {I}_{4}^{\varepsilon }+\mathcal {I}_{5}^{\varepsilon } \end{aligned}$$

Now, by using Definition 2.2 and assumption (H3)

$$\begin{aligned} \mathcal {I}_{1}^{\varepsilon }&= \mathbb {E}\int \limits _{s}^{T}\left| H\left( t,x^{\varepsilon }(t),\mathbb {E}\left( x^{\varepsilon }(t)\right) ,u,\Psi ^{\varepsilon }(t),K^{\varepsilon }(t),\gamma _{t}^{\varepsilon }(\theta )\right) \right. \nonumber \\&-\left. H\left( t,x^{\varepsilon }(t),\mathbb {E}\left( x^{\varepsilon }(t)\right) , v,\Psi ^{\varepsilon }(t),K^{\varepsilon }(t),\gamma _{t}^{\varepsilon }(\theta )\right) \right| dt\nonumber \\&\le C\mathbb {E}\int \limits _{s}^{T}\left| u(t)-v(t)\right| \left[ \left| \Psi ^{\varepsilon }(t)\right| +\left| K^{\varepsilon }(t)\right| \right. \nonumber \\&\left. +\left| \int \limits _{\Theta }\gamma _{t}^{\varepsilon }(\theta )\mu (d\theta )\right| \right] dt\nonumber \\&\le C\mathbb {E}\int \limits _{s}^{T}\left| u(t)-v(t)\right| \pounds ^{\varepsilon }(t)dt. \end{aligned}$$

(52)

Since $\sigma $ is linear growth with respect to $x$ and $y$ then by using assumption (47) we get

$$\begin{aligned} \mathcal {I}_{2}^{\varepsilon }&= \mathbb {E}\int \limits _{s}^{T}\left| \sigma ^{*}\left( t,x^{\varepsilon }(t),\mathbb {E}\left( x^{\varepsilon }(t)\right) ,u\right) \right. \nonumber \\&-\left. \sigma ^{*}\left( t,x^{\varepsilon }(t),\mathbb {E}\left( x^{\varepsilon }(t)\right) ,v\right) \right| \nonumber \\&\times \left| Q^{\varepsilon }(t)\sigma \left( t,x^{\varepsilon }(t), \mathbb {E}\left( x^{\varepsilon }(t)\right) , u^{\varepsilon }(t)\right) \right| dt \nonumber \\&\le C\mathbb {E}\int \limits _{s}^{T}\left| u(t)-v(t)\right| \left| Q^{\varepsilon }(t)\right| \left[ 1+\left| x^{\varepsilon }(t)\right| +\left| \mathbb {E}\left( x^{\varepsilon }(t)\right) \right| \right] dt \!\!\!\!\nonumber \\&\le C\mathbb {E}\int \limits _{s}^{T}\left| u(t)-v(t)\right| \pounds ^{\varepsilon }(t)dt. \end{aligned}$$

(53)

Similarly, since $g$ is linear growth with respect to $x$ then by assumptions (47) we can prove that

$$\begin{aligned} \mathcal {I}_{4}^{\varepsilon }&= \mathbb {E}\int \limits _{s}^{T}\int \limits _{\Theta }\left| g^{*}\left( t,x^{\varepsilon }(t),u,\theta \right) -g^{*}\left( t,x^{\varepsilon }(t),v,\theta \right) \right| \nonumber \\&\times \left| \left( Q^{\varepsilon }(t)+\gamma _{t}^{\varepsilon }(\theta )\right) g\left( t,x^{\varepsilon }(t_{-}),u^{\varepsilon }(t),\theta \right) \right| \mu (d\theta )dt \nonumber \\&\le C\mathbb {E}\int \limits _{s}^{T}\left| u(t)-v(t)\right| \left[ \left| Q^{\varepsilon }(t)\right| +\left| \int \limits _{\Theta }\gamma _{t}^{\varepsilon }(\theta )\mu (d\theta )\right| \right] \nonumber \\&\times \left[ 1+\left| x^{\varepsilon }(t)\right| \right] dt\le C \mathbb {E}\int \limits _{s}^{T}\left| u(t)-v(t)\right| \pounds ^{\varepsilon }(t)dt.\nonumber \\ \end{aligned}$$

(54)

Next, since $\sigma $ is linear growth with respect to $x$ and $y$ then we deduce that

$$\begin{aligned} \mathcal {I}_{3}^{\varepsilon }&= \frac{1}{2}\mathbb {E}\int \limits _{s}^{T}\left| \sigma ^{*}\left( t,x^{\varepsilon }(t),\mathbb {E}\left( x^{\varepsilon }(t)\right) , u\right) Q^{\varepsilon }(t)\right. \nonumber \\&\quad \sigma \left( t,x^{\varepsilon }(t),\mathbb {E}\left( x^{\varepsilon }(t)\right) , u\right) - \sigma ^{*}\left( t,x^{\varepsilon }(t),\mathbb {E}\left( x^{\varepsilon }(t)\right) ,v\right) \nonumber \\&\quad \times \left. Q^{\varepsilon }(t)\sigma \left( t, x^{\varepsilon }(t),\mathbb {E}\left( x^{\varepsilon }(t)\right) ,v\right) \right| dt \nonumber \\&\le C\mathbb {E}\int \limits _{s}^{T}\left| u(t)-v(t)\right| \frac{1}{2} \left| Q^{\varepsilon }(t)\right| \left[ 1+\left| x^{\varepsilon }(t)\right| +\left| \mathbb {E}\left( x^{\varepsilon }(t)\right) \right| \right] dt \nonumber \\&\le C\mathbb {E}\int \limits _{s}^{T}\left| u(t)-v(t)\right| \pounds ^{\varepsilon }(t)dt, \end{aligned}$$

(55)

and

$$\begin{aligned} \mathcal {I}_{5}^{\varepsilon }&= \frac{1}{2}\mathbb {E}\int \limits _{s}^{T}\int \limits _{\Theta }\left| g^{*}\left( t,x^{\varepsilon }(t),u,\theta \right) \left( Q^{\varepsilon }(t)+\gamma _{t}^{\varepsilon }(\theta )\right) \right. \nonumber \\&\times \, g\left( t,x^{\varepsilon }(t),u,\theta \right) -g^{*}\left( t,x^{\varepsilon }(t),v,\theta \right) \left( Q^{\varepsilon }(t)+\gamma _{t}^{\varepsilon }(\theta )\right) \nonumber \\&\times \,\left. g\left( t,x^{\varepsilon }(t),v,\theta \right) \right| \mu (d\theta )dt, \nonumber \\&\le C\mathbb {E}\int \limits _{s}^{T}\left| u(t)-v(t)\right| \frac{1}{2} \left| Q^{\varepsilon }(t)+\gamma _{t}^{\varepsilon }(\theta ) \right| \left[ 1+\left| x^{\varepsilon }(t)\right| \right] dt \nonumber \\&\le C\mathbb {E}\int \limits _{s}^{T}\left| u(t)-v(t)\right| \pounds ^{\varepsilon }(t)dt, \end{aligned}$$

(56)

By combining (52)–(56) we conclude that

$$\begin{aligned} \left| g\left( u(\cdot )\right) -g\left( v(\cdot )\right) \right| \le C\widehat{d}\left( u(\cdot ),v(\cdot )\right) , \end{aligned}$$

which implies that $g$ is continuous on $\mathcal {U}$ with respect to $\widehat{d}$. Now by using (51) and Ekeland’s Variational Principle (Lemma 2.1), there exists $\overline{u}^{\varepsilon }(\cdot )\in \mathcal {U}$ such that

$$\begin{aligned} \widehat{d}(\overline{u}^{\varepsilon }(\cdot ),u^{\varepsilon }(\cdot ))\le \sqrt{\varepsilon }, \end{aligned}$$

(57)

and

$$\begin{aligned}&\mathbb {E}\int \limits _{s}^{T}\widetilde{\mathcal {H}}(t,x^{\varepsilon }(t),\mathbb {E}(x^{\varepsilon }(t)),\overline{u}^{\varepsilon }(t))dt \nonumber \\&\quad =\max _{u(\cdot )\in \mathcal {U}}\mathbb {E}\int \limits _{s}^{T}\widetilde{\mathcal {H}} (t,x^{\varepsilon }(t),\mathbb {E}(x^{\varepsilon }(t)),u(t))dt, \end{aligned}$$

(58)

where

$$\begin{aligned}&\widetilde{\mathcal {H}}(t,x,y,u) \nonumber \\&\quad =\mathcal {H}^{\left( x^{\varepsilon }(\cdot ),u^{\varepsilon }(\cdot )\right) }(t,x,y,u)-\sqrt{\varepsilon }\left| u-\overline{u}^{\varepsilon }(t)\right| \pounds ^{\varepsilon }(t). \end{aligned}$$

(59)

The maximum condition (58) implies a pointwise maximum condition namely, for $\mathbb {P}$—$a.s$, and $a.e.,\,t\in \left[ s,T\right] $

$$\begin{aligned} \widetilde{\mathcal {H}}(t,x^{\varepsilon }(t), \mathbb {E}(x^{\varepsilon }(t)), \overline{u}^{\varepsilon }(t))=\max _{u\in \mathbb {A}}\widetilde{\mathcal {H}} (t,x^{\varepsilon }(t),\mathbb {E}(x^{\varepsilon }(t)),u). \end{aligned}$$

Using [Item 3, Proposition 6.1], then we have

$$\begin{aligned} 0\in \partial _{u}^{{{}^\circ }}\widetilde{\mathcal {H}} (t,x^{\varepsilon }(t),\mathbb {E}(x^{\varepsilon }(t)),\overline{u}^{\varepsilon }(t)). \end{aligned}$$

(60)

Since the function $u:\, \longmapsto \left| u-\overline{u}^{\varepsilon }(t)\right| $ is locally Lipschitz but not differentiable in $\overline{u}^{\varepsilon }(t)$, then Clarke’s generalized gradient (see Proposition 6.1, Example, “Appendix”) shows that

$$\begin{aligned}&\partial _{u}^{\circ }\left( \sqrt{\varepsilon }\left| u-\overline{u} ^{\varepsilon }(t)\right| \pounds ^{\varepsilon }(t)\right) =\overline{co} \left\{ -\pounds ^{\varepsilon }(t)\sqrt{\varepsilon }, \pounds ^{\varepsilon }(t)\sqrt{\varepsilon }\right\} \nonumber \\&\quad =\left[ -\pounds ^{\varepsilon }(t)\sqrt{\varepsilon }, \pounds ^{\varepsilon }(t)\sqrt{\varepsilon }\right] . \end{aligned}$$

(61)

By using (61) and fact that the Clarke’s generalized gradient of the sum of two functions is contained in the sum of the Clarke’s generalized gradient of the two functions, ([Item 5, Proposition 6.1] we get

$$\begin{aligned}&\partial _{u}^{{{}^\circ }}\widetilde{\mathcal {H}} (t,x^{\varepsilon }(t),\mathbb {E}(x^{\varepsilon }(t)),\overline{u}^{\varepsilon }(t)) \\&\quad \subset \partial _{u}^{{{}^\circ }}\mathcal {H}^{\left( x^{\varepsilon }(.),u^{\varepsilon }(.)\right) }(t,x^{\varepsilon }(t),\mathbb {E}(x^{\varepsilon }(t)),\overline{u} ^{\varepsilon }(t)) \\&\quad +\left[ -\sqrt{\varepsilon }\pounds ^{\varepsilon }(t), \sqrt{ \varepsilon }\pounds ^{\varepsilon }(t)\right] . \end{aligned}$$

By applying assumption (47), the Hamiltonian $H$ is differentiable in $u,$ then [Item 4, Proposition 6.1] shows that

$$\begin{aligned}&\partial _{u}^{{{}^\circ }}\widetilde{\mathcal {H}}(t,x^{\varepsilon }(t),\mathbb {E}(x^{\varepsilon }(t)),\overline{u}^{\varepsilon }(t)) \\&\quad \subset \left\{ H_{u}(t,x^{\varepsilon }(t),\mathbb {E}(x^{\varepsilon }(t)), \overline{u}^{\varepsilon }(t),\Psi ^{\varepsilon }(t),K^{\varepsilon }(t), \gamma _{t}^{\varepsilon }(\theta ))\right. \\&\quad +\,\left\{ \sigma _{u}^{*}(t,x^{\varepsilon }(t),\mathbb {E}(x^{\varepsilon }(t)),\overline{u}^{\varepsilon }(t))Q^{\varepsilon }(t)\right. \\&\quad \times \left. (\sigma (t,x^{\varepsilon }(t),\mathbb {E}(x^{\varepsilon }(t)),u^{\varepsilon }(t)))\right. \\&\quad \left. -\,\sigma (t,x^{\varepsilon }(t),\mathbb {E}( x^{\varepsilon }(t)),\overline{u}^{\varepsilon }(t)))\right\} \\&\quad +\int \limits _{\Theta }g_{u}^{*}\left( t,x^{\varepsilon }(t_{-}),\overline{u} ^{\varepsilon }(t),\theta \right) \left( Q^{\varepsilon }(t)+\gamma _{t}^{\varepsilon }(\theta )\right) \\&\quad \left. \times (g\left( t,x^{\varepsilon }(t),u^{\varepsilon }(t),\theta \right) -g\left( t,x^{\varepsilon }(t),\overline{u}^{\varepsilon }(t),\theta \right) )\mu (d\theta )\right\} \\&\quad +\left[ -\sqrt{\varepsilon }\pounds ^{\varepsilon }(t),\sqrt{\varepsilon } \pounds ^{\varepsilon }(t)\right] . \end{aligned}$$

Next, the differential inclusion (60) implies that there is

$$\begin{aligned} \tau ^{\varepsilon }(t)\in \left[ -\sqrt{\varepsilon }\pounds ^{\varepsilon }(t), \sqrt{\varepsilon }\pounds ^{\varepsilon }(t)\right] , \end{aligned}$$

such that

$$\begin{aligned}&H_{u}(t,x^{\varepsilon }(t),\mathbb {E}(x^{\varepsilon }(t)),\overline{u} ^{\varepsilon }(t),\Psi ^{\varepsilon }(t),K^{\varepsilon }(t),\gamma _{t}^{\varepsilon }(\theta )) \nonumber \\&\quad +\,\sigma _{u}^{*}(t,x^{\varepsilon }(t),\mathbb {E}(x^{\varepsilon }(t)), \overline{u}^{\varepsilon }(t))Q^{\varepsilon }(t) \nonumber \\&\quad \times \,(\sigma (t,x^{\varepsilon }(t),\mathbb {E}(x^{\varepsilon }(t)),u^{ \varepsilon }(t))) \nonumber \\&\quad -\,\sigma (t,x^{\varepsilon }(t),\mathbb {E}(x^{\varepsilon }(t)),\overline{u} ^{\varepsilon }(t))) \nonumber \\&\quad +\,\int \limits _{\Theta }g_{u}^{*}\left( t,x^{\varepsilon }(t),\overline{u} ^{\varepsilon }(t),\theta \right) \left( Q^{\varepsilon }(t)+\gamma _{t}^{\varepsilon }(\theta )\right) \nonumber \\&\quad \left. \times \,(g\left( t,x^{\varepsilon }(t),u^{\varepsilon }(t),\theta \right) -g\left( t,x^{\varepsilon }(t),\overline{u}^{\varepsilon }(t),\theta \right) )\mu (d\theta )\right\} \nonumber \\&\quad +\,\tau ^{\varepsilon }(t)=0. \end{aligned}$$

(62)

By using assumption (47) we can prove that

$$\begin{aligned}&\left| H_{u}(t,x^{\varepsilon }(t),\mathbb {E}(x^{\varepsilon }(t)),u^{ \varepsilon }(t),\Psi ^{\varepsilon }(t),K^{\varepsilon }(t),\gamma _{t}^{\varepsilon }(\theta ))\right. \nonumber \\&\quad \left. -H_{u}(t,x^{\varepsilon }(t),\mathbb {E}(x^{\varepsilon }(t)),\overline{ u}^{\varepsilon }(t),\Psi ^{\varepsilon }(t),K^{\varepsilon }(t),\gamma _{t}^{\varepsilon }(\theta ))\right| \nonumber \\&\quad \le C\left| u^{\varepsilon }(t)-\overline{u}^{\varepsilon }(t)\right| \pounds ^{\varepsilon }(t), \end{aligned}$$

(63)

hence from (62) and (63), assumption (47) and the fact that $\left| \tau ^{\varepsilon }(t)\right| \le \sqrt{\varepsilon }\pounds ^{\varepsilon }(t)$ we get

$$\begin{aligned}&\left| H_{u}(t,x^{\varepsilon }(t),\mathbb {E}(x^{\varepsilon }(t)),u^{\varepsilon }(t),\Psi ^{\varepsilon }(t),K^{\varepsilon }(t), \gamma _{t}^{\varepsilon }(\theta ))\right| \nonumber \\&\quad \le C\left| u^{\varepsilon }(t)-\overline{u}^{\varepsilon }(t)\right| \pounds ^{\varepsilon }(t) \nonumber \\&\qquad +\left| \sigma _{u}^{*}(t,x^{\varepsilon }(t),\mathbb {E}( x^{\varepsilon }(t)),\overline{u}^{\varepsilon }(t))Q^{\varepsilon }(t)\right. \nonumber \\&\qquad \times (\sigma (t,x^{\varepsilon }(t),\mathbb {E}(x^{\varepsilon }(t)),u^{\varepsilon }(t))) \nonumber \\&\qquad \left. -\sigma (t,x^{\varepsilon }(t),\mathbb {E}(x^{\varepsilon }(t)), \overline{u}^{\varepsilon }(t)))\right| \nonumber \\&\qquad +\left| \int \limits _{\Theta }g_{u}^{*}\left( t,x^{\varepsilon }(t_{-}), \overline{u}^{\varepsilon }(t),\theta \right) \left( Q^{\varepsilon }(t)+ \gamma _{t}^{\varepsilon }(\theta )\right) \right. \nonumber \\&\qquad \left. \times (g\left( t,x^{\varepsilon }(t_{-}),u^{\varepsilon }(t),\theta \right) -g\left( t,x^{\varepsilon }(t_{-}),\overline{u}^{\varepsilon }(t),\theta \right) )\mu (d\theta )\right| \nonumber \\&\qquad +\left| \tau ^{\varepsilon }(t)\right| \nonumber \\&\qquad \le C\left| u^{\varepsilon }(t)-\overline{u}^{\varepsilon }(t)\right| \pounds ^{\varepsilon }(t)+\left| \tau ^{\varepsilon }(t)\right| \nonumber \\&\qquad \le C\left| u^{\varepsilon }(t)-\overline{u}^{\varepsilon }(t)\right| \pounds ^{\varepsilon }(t)+\sqrt{\varepsilon }\pounds ^{\varepsilon }(t), \end{aligned}$$

(64)

Now, using (49), we obtain for any $u(\cdot )\in \mathcal {U}$

$$\begin{aligned}&H(t,x(t),\mathbb {E}(x(t)),u(t),\Psi ^{\varepsilon }(t),K^{\varepsilon }(t), \gamma _{t}^{\varepsilon }(\theta )) \nonumber \\&\qquad -\,H(t,x^{\varepsilon }(t),\mathbb {E}(x^{\varepsilon }(t)),u^{\varepsilon }(t),\Psi ^{\varepsilon }(t),K^{\varepsilon }(t),\gamma _{t}^{\varepsilon }(\theta )) \nonumber \\&\quad \le H_{x}(t,x^{\varepsilon }(t),\mathbb {E}(x^{\varepsilon }(t)),u^{\varepsilon }(t),\Psi ^{\varepsilon }(t),K^{\varepsilon }(t), \gamma _{t}^{\varepsilon }(\theta )) \nonumber \\&\qquad \times (x(t)-x^{\varepsilon }(t)) \nonumber \\&\qquad +H_{y}(t,x^{\varepsilon }(t),\mathbb {E}(x^{\varepsilon }(t)),u^{\varepsilon }(t),\Psi ^{\varepsilon }(t),K^{\varepsilon }(t),\gamma _{t}^{\varepsilon }(\theta )) \nonumber \\&\qquad \times (x(t)-x^{\varepsilon }(t)) \nonumber \\&\qquad +H_{u}(t,x^{\varepsilon }(t),\mathbb {E}(x^{\varepsilon }(t)),u^{\varepsilon }(t),\Psi ^{\varepsilon }(t),K^{\varepsilon }(t),\gamma _{t}^{\varepsilon }(\theta )) \nonumber \\&\qquad \times (u(t)-u^{\varepsilon }(t)). \end{aligned}$$

(65)

Integrating this inequality with respect to $t$ and taking expectations we obtain from (52) and (64)

$$\begin{aligned}&\mathbb {E}\int \limits _{s}^{T}\left[ H(t,x(t),\mathbb {E}(x(t)),u(t),\Psi ^{\varepsilon }(t),K^{\varepsilon }(t),\gamma _{t}^{\varepsilon }(\theta ))\right. \nonumber \\&\qquad \left. -H(t,x^{\varepsilon }(t),\mathbb {E}(x^{\varepsilon }(t)),u^{\varepsilon }(t),\Psi ^{\varepsilon }(t),K^{\varepsilon }(t), \gamma _{t}^{\varepsilon }(\theta ))\right] dt \nonumber \\&\quad \le \mathbb {E}\int \limits _{s}^{T}H_{x}(t,x^{\varepsilon }(t),\mathbb {E}( x^{\varepsilon }(t)),u^{\varepsilon }(t),\Psi ^{\varepsilon }(t),K^{\varepsilon }(t),\gamma _{t}^{\varepsilon }(\theta )) \nonumber \\&\qquad \times (x(t)-x^{\varepsilon }(t))dt \nonumber \\&\qquad +\,\mathbb {E}\int \limits _{s}^{T}H_{y}(t,x^{\varepsilon }(t),\mathbb {E}(x^{\varepsilon }(t)),u^{\varepsilon }(t),\Psi ^{\varepsilon }(t),K^{\varepsilon }(t), \gamma _{t}^{\varepsilon }(\theta )) \nonumber \\&\qquad \times (x(t)-x^{\varepsilon }(t))dt \nonumber \\&\qquad +\,C(\widehat{d}(u^{\varepsilon }(\cdot ),\overline{u}^{\varepsilon }(\cdot ))+\varepsilon ^{\frac{1}{2}}) \nonumber \\&\quad \le \mathbb {E}\int \limits _{s}^{T}H_{x}(t,x^{\varepsilon }(t),\mathbb {E}( x^{\varepsilon }(t)),u^{\varepsilon }(t),\Psi ^{\varepsilon }(t),K^{\varepsilon }(t),\gamma _{t}^{\varepsilon }(\theta )) \nonumber \\&\qquad \times \, (x(t)-x^{\varepsilon }(t))dt \nonumber \\&\qquad +\,\mathbb {E}\int \limits _{s}^{T}H_{y}(t,x^{\varepsilon }(t),\mathbb {E}(x^{\varepsilon }(t)),u^{\varepsilon }(t),\Psi ^{\varepsilon }(t),K^{\varepsilon }(t), \gamma _{t}^{\varepsilon }(\theta )) \nonumber \\&\qquad \times (x(t)-x^{\varepsilon }(t))dt+C\varepsilon ^{\frac{1}{2}}. \end{aligned}$$

(66)

On the other hand, by using (48) we get

$$\begin{aligned}&h\left( x(T),\mathbb {E}\left( x(T)\right) \right) -h(x^{\varepsilon }(T), \mathbb {E}\left( x^{\varepsilon }(T)\right) )\ge \\&\quad \left[ h_{x}(x^{\varepsilon }(T),\mathbb {E}(x^{\varepsilon }(T)))+h_{y}(x^{\varepsilon }(T),\mathbb {E}(x^{\varepsilon }(T)))\right] \\&\quad \times [x(T)-x^{\varepsilon }(T)]. \end{aligned}$$

Noting that since $\Psi ^{\varepsilon }(T)=h_{x}(x^{\varepsilon }(T),\mathbb {E} (x^{\varepsilon }(T)))+\mathbb {E}\left( h_{y}(x^{\varepsilon }(T), \mathbb {E}(x^{\varepsilon }(T)))\right) $ then we have

$$\begin{aligned}&\mathbb {E}\left\{ h\left( x(T),\mathbb {E}\left( x(T)\right) \right) -h(x^{\varepsilon }(T),\mathbb {E}\left( x^{\varepsilon }(T)\right) )\right\} \nonumber \\&\quad \ge \mathbb {E}\left\{ \Psi ^{\varepsilon }(T)(x(T)-x^{\varepsilon }(T))\right\} \end{aligned}$$

(67)

By integration by parts formula for jumps process $\Psi ^{\varepsilon }(t)(x(t)-x^{\varepsilon }(t))$ (see Lemma 6.1) we get

$$\begin{aligned}&\mathbb {E}\left[ \Psi ^{\varepsilon }(T)(x(T)-x^{\varepsilon }(T))\right] = \mathbb {E}\int \limits _{s}^{T}\Psi ^{\varepsilon }(t)d(x(t)-x^{\varepsilon }(t)) \\&\quad +\,\mathbb {E}\int \limits _{s}^{T}(x(t)-x^{\varepsilon }(t))d\Psi ^{\varepsilon }(t) \\&\quad +\,\mathbb {E}\int \limits _{s}^{T}K^{\varepsilon }(t)(\sigma (t,x(t),\mathbb {E}( x(t)),u(t)) \\&\quad -\,\sigma (t,x^{\varepsilon }(t),\mathbb {E}(x^{\varepsilon }(t)),u^{\varepsilon }(t)))dt \\&\quad +\,\mathbb {E}\int \limits _{s}^{T}\int \limits _{\Theta }\gamma _{t}^{\varepsilon }(\theta )(g\left( t,x(t),u(t),\theta \right) \\&\quad -\,g\left( t,x^{\varepsilon }(t),u^{\varepsilon }(t),\theta \right) )\mu (d\theta )dt, \end{aligned}$$

with the help of (1), and (9) we obtain

$$\begin{aligned}&\mathbb {E}\left\{ \Psi ^{\varepsilon }(T)(x(T)-x^{\varepsilon }(T))\right\} = \mathbb {E}\int \limits _{s}^{T}\left\{ [H_{x}(t,x^{\varepsilon }(t),\right. \\&\quad \mathbb {E}(x^{\varepsilon }(t)),u^{\varepsilon }(t),\Psi ^{\varepsilon }(t),K^{\varepsilon }(t),\gamma _{t}^{\varepsilon }(\theta )) \\&\quad +\,\mathbb {E}(H_{y}(t,x^{\varepsilon }(t),\mathbb {E}(x^{\varepsilon }(t)),u^{\varepsilon }(t),\Psi ^{\varepsilon }(t),K^{\varepsilon }(t), \gamma _{t}^{\varepsilon }(\theta )))] \\&\quad \times \, (x(t)-x^{\varepsilon }(t))+\Psi ^{\varepsilon }(t)[f(t,x(t),\mathbb {E}(x(t)),u(t))\\&\quad -\,f(t,x^{\varepsilon }(t),\mathbb {E}(x^{\varepsilon }(t)),u^{\varepsilon }(t))] \\&\quad +\,K^{\varepsilon }(t)[\sigma (t,x(t),\mathbb {E}(x(t)),u(t)) \\&\quad -\,\sigma (t,x^{\varepsilon }(t),\mathbb {E}(x^{\varepsilon }(t)),u^{\varepsilon }(t))] \\&\quad +\,\int \limits _{\Theta }\gamma _{t}^{\varepsilon }(\theta )[g\left( t,x(t),u(t),\theta \right) \\&\quad -\,\left. g\left( t,x^{\varepsilon }(t),u^{\varepsilon }(t),\theta \right) ]\mu (d\theta )\right\} dt, \end{aligned}$$

then from (49) and (66) we get

$$\begin{aligned}&\mathbb {E}\left\{ \Psi ^{\varepsilon }(T)(x(T)-x^{\varepsilon }(T))\right\} \nonumber \\&\quad \ge \mathbb {E}\int \limits _{s}^{T}\left\{ H(t,x(t),\mathbb {E}(x(t)),u(t),\Psi ^{\varepsilon }(t),K^{\varepsilon }(t),\gamma _{t}^{\varepsilon }(\theta ))\right. \nonumber \\&\quad -H(t,x^{\varepsilon }(t),\mathbb {E}(x^{\varepsilon }(t)),u^{\varepsilon }(t),\Psi ^{\varepsilon }(t),K^{\varepsilon }(t),\gamma _{t}^{\varepsilon }(\theta )) \nonumber \\&\quad +\,\Psi ^{\varepsilon }(t)[f(t,x(t),\mathbb {E}(x(t)),u(t)) \nonumber \\&\quad -f(t,x^{\varepsilon }(t),\mathbb {E}(x^{\varepsilon }(t)),u^{\varepsilon }(t))] \nonumber \\&\quad +\,K^{\varepsilon }(t)[\sigma (t,x(t),\mathbb {E}(x(t)),u(t)) \nonumber \\&\quad -\,\sigma (t,x^{\varepsilon }(t),\mathbb {E}(x^{\varepsilon }(t)),u^{\varepsilon }(t))] \nonumber \\&\quad +\,\int \limits _{\Theta }\gamma _{t}^{\varepsilon }(\theta )[g\left( t,x(t),u(t),\theta \right) \nonumber \\&\quad -\left. g\left( t,x^{\varepsilon }(t),u^{\varepsilon }(t),\theta \right) ]\mu (d\theta )\right\} dt-C\varepsilon ^{\frac{1}{2}} \nonumber \\&\quad =\mathbb {E}\int \limits _{s}^{T}[\ell \left( t,x^{\varepsilon }(t),\mathbb {E}( x^{\varepsilon }(t)),u^{\varepsilon }(t)\right) \nonumber \\&\quad -\,\ell \left( t,x(t),\mathbb {E}(x(t)),u(t)\right) ]dt-C\varepsilon ^{\frac{1}{2}}. \end{aligned}$$

(68)

Combining (67) and (68) we get

$$\begin{aligned}&\mathbb {E}\left\{ h\left( x(T),\mathbb {E}\left( x(T)\right) \right) -h(x^{\varepsilon }(T),\mathbb {E}\left( x^{\varepsilon }(T)\right) )\right\} \\&\quad \ge \mathbb {E}\int \limits _{s}^{T}[\ell \left( t,x^{\varepsilon }(t),\mathbb {E}( x^{\varepsilon }(t)),u^{\varepsilon }(t)\right) \\&\qquad -\ell \left( t,x(t),\mathbb {E}(x(t)),u(t)\right) ]dt-C\varepsilon ^{\frac{1}{ 2}}, \end{aligned}$$

then by using definition of $J^{s,\zeta }$ we conclude

$$\begin{aligned} J^{s,\zeta }\left( u(\cdot )\right) \ge J^{s,\zeta }\left( u^{\varepsilon }(\cdot )\right) -C\varepsilon ^{\frac{1}{2}}. \end{aligned}$$

Finally, since $u(\cdot )$ is arbitrary element of $\mathcal {U}$, the desired result follows. $\square $

5 Application to finance: penalized mean-variance portfolio selection

In this section, we will apply our necessary and sufficient conditions of near-optimality to study a penalized mean-variance portfolio selection and we derive the explicit expression of the optimal portfolio selection strategy. Our method inspired from Zhou ([1], Example 6.1).

Suppose that we have a mathematical market consisting of two investment possibilities:

The first asset is a bond whose price $P_{0}\left( t\right) $ evolves according to the ordinary differential equation

Risk-free security: (e.g., a bond), where the price $ P_{0}(t)$ at time $t$ is given by the following equation:

$$\begin{aligned} \left\{ \begin{array}{l} dP_{0}\left( t\right) =P_{0}\left( t\right) \,\rho (t)dt,\quad t\in \left[ 0,T\right] \\ P_{0}\left( 0\right) >0, \end{array} \right. \end{aligned}$$

(69)

where $\rho (\cdot )$ is a bounded deterministic function.

Risky security (e.g. a stock), where the price $ P_{1}\left( t\right) $ at time $t$ is given by

$$\begin{aligned} \left\{ \begin{array}{l} P_{1}\left( t\right) =P_{1}\left( t\right) \varsigma (t)dt+\sigma _{t}dW(t)P_{1}\left( t\right) \\ \quad \quad \quad \qquad +P_{1}\left( t\right) \int \nolimits _{\Theta }\xi _{t}\left( \theta \right) N\left( d\theta , dt\right) , \\ P_{1}\left( 0\right) >0, \end{array} \right. \end{aligned}$$

(70)

where $\varsigma (t)$, $\sigma _{t}$ and $\xi _{t}\left( \theta \right) $ are bounded deterministic functions such that $\varsigma (t)\ne 0,\sigma _{t}\ne 0$ and $\varsigma (t)>\rho (t).$ and as above $N(d\theta , dt)$ is a compensated random measure.

Assumptions. In order to ensure that $P_{1}\left( t\right) >0$ for all $t\in \left[ 0,T\right] $ we assume that:

1.
$\xi _{t}\left( \theta \right) >-1$ for any $\theta \in \Theta .$
2.
The function $t\rightarrow \int \nolimits _{\Theta }\xi _{t}^{2} \left( \theta \right) \mu (d\theta )$ is a locally bounded

Portfolio and wealth dynamics: A portfolio is a predictable process $\pi (t)=\left( \pi _{0}(t),\pi _{1}(t)\right) $ giving the number of units held at time $t$ of the bond and the stock. The corresponding wealth process $x^{\pi }(t),$ $t\ge 0$ is then given by

$$\begin{aligned} x^{\pi }(t)=\pi _{0}(t)P_{0}\left( t\right) +\pi _{1}(t)P_{1}\left( t\right) . \end{aligned}$$

(71)

The portfolio $\pi (\cdot )$ is called Self-financing if

$$\begin{aligned} x^{\pi }(t)=x^{\pi }(0)+\int \limits _{0}^{t}\pi _{0}(r)dP_{0}\left( r\right) +\int \limits _{0}^{t}\pi _{1}(r)dP_{1}\left( r\right) . \end{aligned}$$

(72)

We denote by

$$\begin{aligned} v(t)=\pi _{1}(t)P\left( t\right) , \end{aligned}$$

(73)

the amount invested in the risky security. Now, by combining (71) and (72) together with (73) we introduce the wealth dynamics as follows

$$\begin{aligned} \left\{ \begin{array}{l} dx^{v}(t)=\left[ \rho (t)x^{v}(t)+(\varsigma (t)-\rho (t))v(t)\right] dt \\ \quad {}+\sigma _{t}v(t)dW(t)+\int \nolimits _{\Theta }\xi _{t_{-}}\left( \theta \right) v(t)N\left( d\theta ,dt\right) , \\ x^{v}(0)=\zeta , \end{array} \right. \end{aligned}$$

(74)

where $\zeta \in \mathbb {R}.$ If the corresponding wealth process $ x^{v}(\cdot ) $ given by SDE-(74) is square integrable, the control variable $v(\cdot )$ is called tame. We denote $\mathcal {U}$ the set of admissible portfolio valued in $\mathbb {A=R}.$

Mean-variance portfolio selection.We assume that we have a family of optimization problem parameterized by $\varepsilon ,$ where $ \varepsilon $ is a small parameter $\varepsilon >0$ may be represent the complexity of the cost functional

$$\begin{aligned} J^{\zeta ,\varepsilon }(v(\cdot ))&= \mathbb {E}\left( x^{v}(T)-\mathbb {E} (x^{v}(T))-\frac{\varepsilon }{2})\right) ^{2} \nonumber \\&+\int \limits _{0}^{T}\frac{\varepsilon ^{2}}{4}L(v(t))dt, \end{aligned}$$

(75)

subject to $x^{v}(T)$ solution of SDE-(74) at time $T$ given by

$$\begin{aligned} x^{v}(T)&= \zeta +\int \limits _{0}^{T}\left[ \rho (t)x^{v}(t)+(\varsigma (t)-\rho (t))v(t)\right] dt \\&+\int \limits _{0}^{T}\sigma _{t}v(t)dW(t)+\int \limits _{0}^{T}\int \limits _{\Theta }\xi _{t_{-}}\left( \theta \right) v(t)N\left( d\theta ,dt\right) , \end{aligned}$$

where $L(\cdot )$ is a nonlinear, convex and bounded function, satisfying assumption (47) and independent of $\varepsilon .$

Inspired from (Zhou [1], example 6.1), our objective is to find an admissible portfolio $v^{*}(\cdot )$ which minimizes the cost function (75) of mean-field type (i.e., with $\ell \equiv \frac{\varepsilon ^{2} }{4}L(v(t)),s\!=\!0, h\left( x(t),\mathbb {E}(x(t))\right) \!=\! \left( x(t)\!-\! \mathbb {E}(x(t))\!-\!\frac{\varepsilon }{2}\right) ^{2}$). Explicit solution of problem (74)–(75), called $\mathcal {P}_{\varepsilon }$, may be a difficult problem. The idea is to show that we can easily get a near-optimal control (in feedback form) analytically based on the optimal control of the simpler problem, called $\mathcal {P}_{0}$ which is obtained by setting $\varepsilon =0$ in (75), then we get

$$\begin{aligned} J_{0}^{\zeta }(v(\cdot ))=\mathbb {E}\left\{ \left( x^{v}(T)-\mathbb {E} (x^{v}(T))\right) ^{2}\right\} , \end{aligned}$$

(76)

We study the optimal control problem where the state is governed by SDE-(74) with a new cost function (76). In a second step, we solve the control problem (74)–(76), and obtain an optimal solution explicitly. Finally, inspired by Zhou ([1], Example 6.1), we solve the control problem $\mathcal {P}_{\varepsilon }$ of near-optimally.

Problem $P_{0}$: (Optimal solution of mean-field stochastic control problem (74)–(76)). By a standard argument, problem $\mathcal {P}_{0}$ can be solved as follows.

Since $f\left( t,x(t),\mathbb {E}(x(t),v(t)\right) \!=\!\rho (t)x(t) +(\varsigma (t)-\rho (t))v(t), \sigma \left( t,x(t),\mathbb {E} (x(t),v(t)\right) =\sigma _{t}v(t), g\left( t,x(t),v(t),\theta \right) =v(t)\xi _{t}\left( \theta \right) , $ then the Hamiltonian $H$ gets the form

$$\begin{aligned}&H\left( t,x,\mathbb {E}\left( x\right) , v(t),\Psi (t),K(t),\gamma _{t}\left( \theta \right) \right) \\&=-\Psi (t)\left[ \rho (t)x(t)+(\varsigma (t)-\rho (t))v(t)\right] \\&\qquad -K(t)\sigma _{t}v(t)-v(t)\int \limits _{\Theta }\gamma _{t}\left( \theta \right) \xi _{t}\left( \theta \right) \mu (d\theta ) \\&=-\Psi (t)\rho (t)x(t)-v(t)\left[ \Psi (t)(\varsigma (t)-\rho (t))\right. \\&\left. \qquad +K(t)\sigma _{t}+\int \limits _{\Theta }\gamma _{t}\left( \theta \right) \xi _{t}\left( \theta \right) \mu (d\theta )\right] . \end{aligned}$$

Consequently, since this is a linear expression of $v(\cdot )$ then it is clear that the supremum is attained at $v^{*}(t)$ satisfying

$$\begin{aligned}&\Psi ^{*}(t)(\varsigma (t)+\rho (t))+K^{*}(t)\sigma _{t} \nonumber \\&\quad +\int \limits _{\Theta }\gamma _{t}^{*}\left( \theta \right) \xi _{t}\left( \theta \right) \mu (d\theta )=0. \end{aligned}$$

(77)

Since $h_{x}\left( x(T),\mathbb {E}(x(T)\right) =2\left( x(T)- \mathbb {E}(x(T)\right) , h_{y}(x(T), \mathbb {E}(x(T))=-2(x(T)-\mathbb {E} (x(T))$ then a simple computation shows that the first-order adjoint equation (9) associated with $v^{*}(t)$ gets the form

$$\begin{aligned} \left\{ \begin{array}{l} d\Psi ^{*}(t)=-\rho (t)\Psi ^{*}(t)dt+K^{*}(t)dW(t) \\ \qquad \qquad \qquad +\int \nolimits _{\Theta }\gamma _{t}^{*}(\theta )N(dt,d\theta ) \\ \Psi ^{*}(T)=2\left( x^{*}(T)-\mathbb {E}(x^{*}(T)\right) . \end{array} \right. \end{aligned}$$

(78)

In order to solve the above Eq. (78) and to find the expression of $v^{*}(t)$ we conjecture a process $\Psi ^{*}(t)$ of the form

$$\begin{aligned} \Psi ^{*}(t)=\Phi _{1}(t)x^{*}(t)+\Phi _{2}(t)\mathbb {E} \left( x^{*}(t)\right) +\Phi _{3}(t), \end{aligned}$$

(79)

where $\Phi _{1}(\cdot ),\Phi _{2}(\cdot )$ and $\Phi _{3}(\cdot )$ are deterministic differentiable functions. (see [4, 12, 15, 22] for other models of conjecture).

Applying Itô’s formula to (79), in virtue of SDE-(74), we get

$$\begin{aligned}&d\Psi ^{*}(t)=\Phi _{1}(t)\left\{ \left[ \rho (t)x^{*}(t)+(\varsigma (t)-\rho (t))v^{*}(t)\right] dt\right. \nonumber \\&\qquad \quad {}+\sigma _{t}v^{*}(t)dW(t)\!+\!\left. \int \limits _{\Theta }v^{*}(t)\xi _{t_{-}}\left( \theta \right) N\left( d\theta , dt\right) \right\} \nonumber \\&\qquad \quad {} +x^{*}(t)\dot{\Phi }_{1}(t)dt+\Phi _{2}(t)[\rho (t)\mathbb {E}(x^{*}(t)) \nonumber \\&\quad {}+(\varsigma (t)-\rho (t))v^{*}(t)]dt+\mathbb {E}\left( x^{*}(t)\right) \dot{\Phi }_{2}(t)dt+\dot{\Phi }_{3}(t)dt \nonumber \\&\quad =\left\{ \Phi _{1}(t)\left[ \rho (t)x^{*}(t)+(\varsigma (t)-\rho (t))v^{*}(t)\right] +x^{*}(t)\dot{\Phi }_{1}(t)\right. \nonumber \\&\quad +\,\Phi _{2}(t)\left[ \rho (t)\mathbb {E}(x^{*}(t))+(\varsigma (t)-\rho (t))v^{*}(t)\right] \nonumber \\&\quad +\,\left. \dot{\Phi }_{2}(t)\mathbb {E}\left( x^{*}(t)\right) +\dot{\Phi } _{3}(t)\right\} dt \nonumber \\&\quad \!+\,\Phi _{1}(t)\sigma _{t}v^{*}(t)dW(t)\!+\!\int \limits _{\Theta }\Phi _{1}(t)v^{*}(t)\xi _{t_{-}}\left( \theta \right) N\left( d\theta , dt\right) , \nonumber \\&\quad \Psi ^{*}(T)\!=\!\Phi _{1}(T)x^{*}(T)\!+\!\Phi _{2}(T)\mathbb {E}\left( x^{*}(T)\right) \!+\!\Phi _{3}(T). \end{aligned}$$

(80)

Next, comparing (80) with (78), we get

$$\begin{aligned} -\rho (t)\Psi ^{*}(t)&= \Phi _{1}(t)\left[ \rho (t)x^{*}(t)+(\varsigma (t)-\rho (t))v^{*}(t) \right] \nonumber \\&+\,x^{*}(t)\dot{\Phi }_{1}(t)\nonumber \\&+\,\Phi _{2}(t)\left[ \rho (t)\mathbb {E}(x^{*}(t))+(\varsigma (t)\right. \nonumber \\&\left. -\rho (t))v^{*}(t)\right] \nonumber \\&+\,\dot{\Phi }_{2}(t)\mathbb {E}\left( x^{*}(t)\right) +\dot{\Phi }_{3}(t),\end{aligned}$$

(81)

$$\begin{aligned}&K^{*}(t)=\Phi _{1}(t)\sigma _{t}v^{*}(t), \end{aligned}$$

(82)

$$\begin{aligned}&\gamma _{t}^{*}(\theta )=\Phi _{1}(t)v^{*}(t)\xi _{t}\left( \theta \right) , \end{aligned}$$

(83)

and

$$\begin{aligned} \Phi _{1}(T)=2,\Phi _{2}(T)=-2,\Phi _{3}(T)=0. \end{aligned}$$

(84)

Combining (82) and (84) together with (77) we get

$$\begin{aligned} v^{*}(t)=\frac{-(\varsigma (t)-\rho (t))\Psi ^{*}(t)}{\Phi _{1}(t) \left[ \sigma _{t}^{2}+\int \nolimits _{\Theta }\xi _{t}^{2}\left( \theta \right) \mu (d\theta )\right] }. \end{aligned}$$

(85)

We denote

$$\begin{aligned} A(t)=\sigma _{t}^{2}+\int \limits _{\Theta }\xi _{t}^{2}\left( \theta \right) \mu (d\theta ), \end{aligned}$$

(86)

by using (77) together with (85) and (86) then we can get

$$\begin{aligned} \Phi _{3}(t)&= 0\text { for}t\in \left[ 0,T\right] , v^{*}(t)=(\rho (t)-\varsigma (t))\left( A(t)\right) ^{-1}\nonumber \\&\frac{\left( \Phi _{1}(t)x^{*}(t)+\Phi _{2}(t)\mathbb {E}\left( x^{*}(t)\right) \right) }{\Phi _{1}(t)}.\nonumber \\&= \left\{ (\rho (t)-\varsigma (t))\left( A(t)\right) ^{-1}\right\} x^{*}(t)\nonumber \\&\! +\!\left\{ (\rho (t)\!-\!\varsigma (t))\left( A(t)\right) ^{-1}\frac{\Phi _{2}(t)}{ \Phi _{1}(t)}\right\} \mathbb {E}\left( x^{*}(t)\right) . \end{aligned}$$

(87)

Now combining (81) with (79) we deduce

$$\begin{aligned}&v^{*}(t)\left( \Phi _{1}(t)+\Phi _{2}(t)\right) (\rho (t)-\varsigma (t)) \nonumber \\&\quad =\left[ 2\rho (t)\Phi _{1}(t)+\dot{\Phi }_{1}(t)\right] x^{*}(t)\nonumber \\&\qquad +\left[ 2\rho (t)\Phi _{2}(t)+\dot{\Phi }_{2}(t)\right] \mathbb {E}(x^{*}(t)). \end{aligned}$$

(88)

By comparing the terms containing $x^{*}(t)$ and $\mathbb {E}\left( x^{*}(t)\right) $, we obtain from (87) with (88) the two ordinary differential equations (ODEs in short):

$$\begin{aligned}&\left[ (\rho (t)-\varsigma (t))^{2}\left( A(t)\right) ^{-1}-2\rho (t)\right] \Phi _{1}(t) \nonumber \\&\quad +\,(\rho (t)-\varsigma (t))^{2}\left( A(t)\right) ^{-1}\Phi _{2}(t)=\dot{\Phi } _{1}(t).\nonumber \\&\left[ (\rho (t)-\varsigma (t))^{2}\left( A(t)\right) ^{-1}-2\rho (t)\right] \Phi _{2}(t) \nonumber \\&\quad +\,(\rho (t)-\varsigma (t))^{2}\left( A(t)\right) ^{-1}\frac{\Phi _{2}^{2}(t)}{ \Phi _{1}(t)}=\dot{\Phi }_{2}(t), \end{aligned}$$

(89)

a simple computation from (89) we obtain

$$\begin{aligned} \dot{\Phi }_{1}(t)\Phi _{2}(t)=\dot{\Phi }_{2}(t)\Phi _{1}(t), \end{aligned}$$

(90)

Since $\Phi _{1}(T)=2,\Phi _{2}(T)=-2,$ (see (84)) we deduce

$$\begin{aligned} \Phi _{1}(t)=-\Phi _{2}(t), \end{aligned}$$

(91)

Let us turn to calculate explicitly $\Phi _{1}(t)$ and $\Phi _{2}(t)$. By dividing the first ODE in (89) by $\Phi _{1}(t)$ and the second ODE by $\Phi _{2}(t)$ we get

$$\begin{aligned} \dot{\Phi }_{1}(t)&= -2\rho (t)\Phi _{1}(t), \Phi _{1}(T)=2,\\ \dot{\Phi }_{2}(t)&= -2\rho (t)\Phi _{2}(t), \Phi _{2}(T)=-2. \end{aligned}$$

We now try to solve the above ODEs (See the book by Boyce and DiPrima [28], Chapter §2). By simple computations shows that for any $t\in \left[ 0,T\right] $

$$\begin{aligned} \left\{ \begin{array}{l} \Phi _{1}(t)=2\exp \left[ \int \nolimits _{t}^{T}\rho (s)ds\right] \\ \Phi _{2}(t)=-2\exp \left[ \int \nolimits _{t}^{T}\rho (s)ds\right] . \end{array} \right. \end{aligned}$$

(92)

With this choice of $\Phi _{1}(t)$ and $\Phi _{2}(t)$, we conclude that $v^{*}(t)$ is given by

$$\begin{aligned} v^{*}(t)&= \left[ (\rho (t)-\varsigma (t))\left( A(t)\right) ^{-1}\right] x^{*}(t)\nonumber \\&\quad -\left[ (\rho (t)-\varsigma (t))\left( A(t)\right) ^{-1}\right] \mathbb {E} \left( x^{*}(t)\right) , \end{aligned}$$

(93)

and the adjoint processes

$$\begin{aligned} \Psi ^{*}(t)&= \Phi _{1}(t)x^{*}(t)+\Phi _{2}(t)\mathbb {E}\left( x^{*}(t)\right) ,\\ K^{*}(t)&= \Phi _{1}(t)\sigma _{t}v^{*}(t), \\ \gamma _{t}^{*}(\theta )&= \Phi _{1}(t)\xi _{t}\left( \theta \right) v^{*}(t), \end{aligned}$$

satisfying the adjoint equation (9). Moreover, with this choice of $v^{*}(t)$, the maximum condition (14) of Theorem 3.1 holds. Since $h\left( x(t),\mathbb {E}x(t)\right) =\left( x(t)-\mathbb {E} x(t)\right) ^{2}$ is convex and $H\left( \cdot ,\cdot , \cdot , \Psi (t),K(t), \gamma _{t}(\theta )\right) $ is concave, we can assert that our admissible portfolio $v^{*}(t)$ is optimal and the sufficient conditions in Theorem 4.1 are satisfied where $v^{*}(t)$ achieves the maximum. Finally, we give the explicit optimal portfolio in the state feedback form in the following theorem.

Theorem 5.1

The optimal solution of our mean-field stochastic control problem $\mathcal {P}_{0}$ is given in the state feedback form by

$$\begin{aligned}&v^{*}(t,x^{*}(t),\mathbb {E}\left( x^{*}(t)\right) ) \nonumber \\&\quad =\left[ (\rho (t)-\varsigma (t))\left( A(t)\right) ^{-1}\right] x^{*}(t) \nonumber \\&\qquad -\left[ (\rho (t)-\varsigma (t))\left( A(t)\right) ^{-1}\right] \mathbb {E} \left( x^{*}(t)\right) , \end{aligned}$$

(94)

where $A(t)$ is given by (86).

Problem $\mathcal {P}_{\varepsilon }$: The Hamiltonian function $\mathcal {H}$ for the problem $\mathcal {P}$ is

$$\begin{aligned}&\mathcal {H}^{\left( z\left( \cdot \right) , v(\cdot )\right) }(t,x,u)\\&\quad =-\Psi (t)\rho (t)x(t)-u(t)\left\{ \Psi (t)(\varsigma (t)-\rho (t))\right. \\&\qquad \left. {} + K(t)\sigma _{t}+\int \limits _{\Theta }\gamma _{t}\left( \theta \right) \xi _{t}\left( \theta \right) \mu (d\theta )\right\} \\&\qquad {}+\sigma _{t}^{2}v(t)u(t)Q(t)-\frac{1}{2}\sigma _{t}^{2}u^{2}(t)Q(t)\\&\qquad {}+ u(t)v(t)\int \limits _{\Theta }\left( \xi _{t}\left( \theta \right) \right) ^{2}\left( Q^{*}(t)+\gamma _{t}^{*}(\theta )\right) \mu (d\theta )\\&\qquad {}-\frac{1}{2}v(t)\int \limits _{\Theta }\left( \xi _{t}\left( \theta \right) \right) ^{2}\left( Q^{*}(t)+\gamma _{t}^{*}(\theta )\right) \mu (d\theta ), \end{aligned}$$

where $Q^{*}(\cdot )$ is given by second-order adjoint equation

$$\begin{aligned} \left\{ \begin{array}{l} dQ^{*}(t)=-2\rho (t)Q^{*}(t)dt+R^{*}(t)dW(t)\\ \quad \quad \quad +\int \nolimits _{\Theta }\Gamma _{t}^{*}(\theta )N(d\theta , dt)\\ Q^{*}(T)=2. \end{array} \right. \end{aligned}$$

By uniqueness of the solution of the above classical backward SDE it is easy to show that

$$\begin{aligned} (Q^{*}(t),R^{*}(t),\Gamma _{t}^{*}(\theta ))=\left( 2\exp \left( 2\int \limits _{t}^{T}\rho (r)dr\right) ,0,0\right) , \end{aligned}$$

then we get

$$\begin{aligned}&\mathcal {H}^{x^{*}\left( .\right) , v^{*}(\cdot )}(t,x,v)\nonumber \\&\quad =-\Psi (t)\rho (t)x(t)-v(t)\left\{ \Psi ^{*}(t)(\varsigma (t)-\rho (t))\right. \nonumber \\&\qquad \left. +\,K^{*}(t)\sigma _{t}+\int \limits _{\Theta }\gamma _{t}^{*}\left( \theta \right) \xi _{t}\left( \theta \right) \mu (d\theta )\right\} \nonumber \\&\qquad +\,\sigma _{t}^{2}v^{*}(t)v(t)Q^{*}(t)-\frac{1}{2}\sigma _{t}^{2}v^{2}(t)Q^{*}(t)\nonumber \\&\qquad +\,v(t)v^{*}(t)\int \limits _{\Theta }\left( \xi _{t}\left( \theta \right) \right) ^{2}\left( Q^{*}(t)+\gamma _{t}^{*}(\theta )\right) \mu (d\theta )\nonumber \\&\qquad -\,\frac{1}{2}v^{2}(t)\int \limits _{\Theta }\left( \xi _{t}\left( \theta \right) \right) ^{2}\left( Q^{*}(t)+\gamma _{t}^{*}(\theta )\right) \mu (d\theta ). \end{aligned}$$

(95)

Since $v^{*}(\cdot )$ is optimal, by stochastic maximum principle, it necessary that $v^{*}(\cdot )$ maximizes the $\mathcal {H}$-function $a.s.$ namely,

$$\begin{aligned}&\Psi ^{*}(t)(\varsigma (t)-\rho (t))+K^{*}(t)\sigma _{t}\nonumber \\&\quad +\int \limits _{\Theta }\gamma _{t}^{*}\left( \theta \right) \xi _{t}\left( \theta \right) \mu (d\theta )=0.\nonumber \\&\quad \mathbb {P}-a.s, a.e. t. \end{aligned}$$

(96)

The Hamiltonian $\mathcal {H}_{\varepsilon }$ for the problem $\mathcal {P}_{\varepsilon }$ is

$$\begin{aligned}&\mathcal {H}_{\varepsilon }^{\left( x^{*}\left( \cdot \right) , v^{*}(\cdot )\right) }(t,x,v)\nonumber \\&\quad =-\Psi (t)\rho (t)x(t)-v(t)\left\{ \Psi ^{*}(t)(\varsigma (t)-\rho (t))\right. \nonumber \\&\qquad \left. {} + K^{*}(t)\sigma _{t}+\int \limits _{\Theta }\gamma _{t}^{*} \left( \theta \right) \xi _{t}\left( \theta \right) \mu (d\theta )\right\} \nonumber \\&\qquad {}+\sigma _{t}^{2}v^{*}(t)v(t)Q^{*}(t)-\frac{1}{2}\sigma _{t}^{2}v^{2}(t)Q^{*}(t)\nonumber \\&\qquad {}+ v(t)v^{*}(t)\int \limits _{\Theta }\left( \xi _{t}\left( \theta \right) \right) ^{2}\left( Q^{*}(t)+\gamma _{t}^{*}(\theta )\right) \mu (d\theta )\nonumber \\&\qquad {}-\frac{1}{2}v^{2}(t)\int \limits _{\Theta }\left( \xi _{t}\left( \theta \right) \right) ^{2}\left( Q^{*}(t)+\gamma _{t}^{*}(\theta )\right) \mu (d\theta )\nonumber \\&\qquad {}-\frac{\varepsilon ^{2}}{4}L(v(t)). \end{aligned}$$

(97)

The above function is maximized at $v^{\varepsilon }(t)$ which satisfies

$$\begin{aligned}&\Psi ^{*}(t)(\varsigma (t)-\rho (t))+K^{*}(t)\sigma _{t}\\&\quad {}+\int \limits _{\Theta }\gamma _{t}^{*}\left( \theta \right) \xi _{t}\left( \theta \right) \mu (d\theta )+\sigma _{t}^{2}v^{*}(t)Q^{*}(t)\\&\quad {}-\sigma _{t}^{2}v^{\varepsilon }(t)Q^{*}(t)\\&\quad {}+v^{*}(t)\int \limits _{\Theta }\left( \xi _{t}\left( \theta \right) \right) ^{2}\left( Q^{*}(t)+\gamma _{t}^{*}(\theta )\right) \mu (d\theta )\\&\quad {}-v^{\varepsilon }(t)\int \limits _{\Theta }\left( \xi _{t}\left( \theta \right) \right) ^{2}\left( Q^{*}(t)+\gamma _{t}^{*}(\theta )\right) \mu (d\theta )\\&\quad {}-\frac{\varepsilon ^{2}}{4}\dot{L}(v^{\varepsilon }(t))=0,\\&\quad \mathbb {P}-a.s, a.e. t. \end{aligned}$$

by applying (96) we have

$$\begin{aligned}&\left[ \sigma _{t}^{2}Q^{*}(t)+\int \limits _{\Theta }\left( \xi _{t}\left( \theta \right) \right) ^{2}\left( Q^{*}(t)+\gamma _{t}^{*}(\theta )\right) \mu (d\theta )\right] \nonumber \\&\quad \times \left( v^{*}(t)-v^{\varepsilon }(t)\right) -\dfrac{\varepsilon ^{2}}{4}\dot{L}(v^{\varepsilon }(t))=0. \end{aligned}$$

(98)

Combining (97)–(96) then we can shows that

$$\begin{aligned}&\max _{v(\cdot )\in \mathcal {U}}\mathcal {H}_{\varepsilon }^{\left( x^{*}\left( .\right) , v^{*}(\cdot )\right) }(t,x(t),v(t))\\&\qquad {}-\mathcal {H}_{\varepsilon }^{\left( x^{*}\left( .\right) , v^{*}(\cdot )\right) }(t,x(t),v^{*}(t))\\&\quad =\mathcal {H}_{\varepsilon }^{\left( x^{*}\left( .\right) , v^{*}(\cdot )\right) }(t,x(t),v^{\varepsilon }(t))\\&\qquad {}-\mathcal {H}_{\varepsilon }^{\left( x^{*}\left( .\right) , v^{*}(\cdot )\right) }(t,x(t),v^{*}(t))\\&\quad =\sigma _{t}^{2}v^{*}(t)v^{\varepsilon }(t)Q^{*}(t)-\frac{1}{2} \sigma _{t}^{2}\left( v^{\varepsilon }(t)\right) ^{2}Q^{*}(t)\\&\qquad {}-\frac{\varepsilon ^{2}}{4}L(v^{\varepsilon }(t))+(v^{\varepsilon }(t)v^{*}(t)\\&\qquad {}-\frac{1}{2}\left( v^{\varepsilon }(t)\right) ^{2})\int \limits _{\Theta }\left( \xi _{t_{-}}\left( \theta \right) \right) ^{2}\left( Q^{*}(t)+\gamma _{t}^{*}(\theta )\right) \mu (d\theta )\\&\qquad {}-\left\{ \frac{1}{2}\sigma _{t}^{2}\left( v^{*}(t)\right) ^{2}Q^{*}(t)-\frac{\varepsilon ^{2}}{4}L(v^{*}(t))\right. \\&\qquad \left. {} +\frac{1}{2}\left( v^{*}(t)\right) ^{2}\int \limits _{\Theta }\left( \xi _{t}\left( \theta \right) \right) ^{2}\left( Q^{*}(t)+\gamma _{t}^{*}(\theta )\right) \mu (d\theta )\right\} \\&\quad =\sigma _{t}^{2}Q^{*}(t)\left[ v^{\varepsilon }(t)v^{*}(t)-\frac{1}{2}\left( v^{\varepsilon }(t)\right) ^{2}-\frac{1}{2}\left( v^{*}(t)\right) ^{2}\right] \\&\qquad {} +(v^{\varepsilon }(t)v^{*}(t)-\frac{1}{2}\left( v^{\varepsilon }(t)\right) ^{2}\\&\qquad {}-\frac{1}{2}\left( v^{*}(t)\right) ^{2})\int \limits _{\Theta }\left( \xi _{t}\left( \theta \right) \right) ^{2}\left( Q^{*}(t)+\gamma _{t}^{*}(\theta )\right) \mu (d\theta )\\&\qquad {}-\frac{\varepsilon ^{2}}{4}\left( L(v^{\varepsilon }(t))-L(v^{*}(t))\right) . \end{aligned}$$

since

$$\begin{aligned} v^{\varepsilon }(t)v^{*}(t)\!-\!\frac{1}{2}\left( v^{\varepsilon }(t)\right) ^{2}\!-\!\frac{1}{2}\left( v^{*}(t)\right) ^{2}\!=\!-\frac{1}{2}\left( v^{*}(t)\!-v^{\varepsilon }(t)\right) ^{2}, \end{aligned}$$

then by simple computation we get

$$\begin{aligned}&\max _{v(\cdot )\in \mathcal {U}}\mathcal {H}_{\varepsilon }^{\left( x^{*}\left( \cdot \right) , v^{*}(\cdot )\right) }(t,x,v(t)) -\mathcal {H}_{\varepsilon }^{\left( x^{*}(\cdot ).,v^{*}(\cdot )\right) }(t,x,v^{*}(t)) \\&\quad =-\frac{1}{2}\left( v^{*}(t)-v^{\varepsilon }(t)\right) ^{2}\biggl \{ \sigma _{t}^{2}Q^{*}(t) +\int \limits _{\Theta }(\xi _{t}\left( \theta \right) )^{2}(Q^{*}(t) \\&\qquad {}+ \gamma _{t}^{*}(\theta ))\mu (d\theta )\biggr \} -\frac{ \varepsilon ^{2}}{4}\left( L(v^{\varepsilon }(t))-L(v^{*}(t))\right) \end{aligned}$$

using (98), (47), and the fact that $L\left( \cdot \right) $ is convex and bounded we obtain

$$\begin{aligned}&\max _{v(\cdot )\in \mathcal {U}}\mathcal {H}_{\varepsilon }^{\left( x^{*}\left( .\right) , v^{*}(\cdot )\right) }(t,x,v(t)) \\&\qquad {}-\mathcal {H}_{\varepsilon }^{\left( x^{*}\left( .\right) , v^{*}(\cdot )\right) }(t,x,v^{*}(t)) \\&\quad =-\frac{\varepsilon ^{2}}{8}\left( v^{*}(t)-v^{\varepsilon }(t)\right) \dot{L}(v^{\varepsilon }(t)) \\&\qquad {}+\frac{\varepsilon ^{2}}{4}\left( L(v^{*}(t))-L(v^{\varepsilon }(t))\right) \le C\varepsilon ^{2}. \end{aligned}$$

Moreover, by using (96) the hamiltonian $H_{\varepsilon }$ of problem $\mathcal {P}_{\varepsilon }$ is

$$\begin{aligned}&H_{\varepsilon }\left( t,x,\mathbb {E}\left( x\right) , v(t),\Psi (t),K(t), \gamma _{t}(\theta )\right) \\&\quad =-\Psi (t)\rho (t)x(t)-v(t)\left\{ \Psi (t)(\varsigma (t)-\rho (t))\right. \\&\quad \left. {} + K(t)\sigma _{t}+\int \limits _{\Theta }\gamma _{t}\left( \theta \right) \xi _{t}\left( \theta \right) \mu (d\theta )\right\} -\frac{\varepsilon ^{2}}{4} L(v(t)) \\&\quad =-\Psi (t)\rho (t)x(t)-\frac{\varepsilon ^{2}}{4}L(v(t)). \end{aligned}$$

Since $L(\cdot )$ is convex then the Hamiltonian $H_{\varepsilon }\left( t,\cdot , \mathbb {\cdot },\cdot ,\right. \left. \Psi (t), K(t),\gamma _{t}(\theta )\right) $ is concave. By applying Theorem 4.1, this proves that, the control $v^{*}(t)$ given by (94) is indeed a near-optimal for stochastic control problem $\mathcal {P}_{\varepsilon }$.

Concluding remarks In this paper, necessary and sufficient conditions of near-optimal stochastic control for systems governed by mean-field jump diffusion processes are proved. The control variable is allowed to enter both diffusion and jump coefficients and also the diffusion coefficients depend on the state of the solution process as well as of its expected value. Moreover, the cost functional is also of mean-field type. Our result is applied to financial optimization problem, where explicit expression of the optimal (and near-optimal) portfolio is obtained in the state feedback form. If we assume that $\varepsilon =0$ Theorem 3.1 reduces to stochastic maximum principle of optimality developed in Hafayed and Abbas ([17], Theorem 3.1).

Moreover, if we assume that $\varepsilon =0$ and when the coefficients $f$, $ \sigma $ of the underlying jump diffusion processes and the cost functional do not explicitly depend on the expected value, Theorem 3.1 reduces to necessary conditions of optimality developed in Tang and Li ([9], Theorem 2.1) and Theorem 4.1 reduces to sufficient conditions of optimality developed in Framstad et al. ([12] Theorem 2.1).

References

Zhou XY (1998) Stochastic near-optimal controls: necessary and sufficient conditions for near-optimality. SIAM J Control Optim 36(3):929–947
Article MATH MathSciNet Google Scholar
Hafayed M, Abbas S, Veverka P (2013) On necessary and sufficient conditions for near-optimal singular stochastic controls. Optim Lett 7(5):949–966
Article MATH MathSciNet Google Scholar
Hafayed M, Veverka P, Abbas S (2012) On maximum principle of near-optimality for diffusions with jumps, with application to consumption-investment problem. Differ Equ Dyn Syst 20(2):111–125
Article MATH MathSciNet Google Scholar
Hafayed M, Abbas S (2013) On near-optimal mean-field stochastic singular controls: necessary and sufficient conditions for near-optimality. J Optim Theory Appl. doi:10.1007/s10957-013-0361-1
Hafayed M, Abbas S (2013) Stochastic near-optimal singular controls for jump diffusions: necessary and sufficient conditions. J Dyn Control Syst. doi:10.1007/s10883-013-9191-6
Huang J, Li X, Wang G (2010) Near-optimal control problems for linear forward-backward stochastic systems. Automatica 46(2):397–404
Article MATH MathSciNet Google Scholar
Hui E, Huang J, Li X, Wang G (2011) Near-optimal control for stochastic recursive problems. Syst Control Lett 60:161–168
Article MATH MathSciNet Google Scholar
Chighoub F, Mezerdi B (2011) Optimality conditions in stochastic control of jump diffusion processes. Syst Control Lett 60:907–916
Article MATH MathSciNet Google Scholar
Tang SJ, Li XJ (1994) Necessary conditions for optimal control of stochastic systems with random jumps. SIAM J Control Optim 32(5):1447–1475
Article MATH MathSciNet Google Scholar
Hafayed M (2013) A mean-field maximum principle for optimal control of forward-backward stochastic differential equations with Poisson jump processes. Int J Dyn Control. doi:10.1007/s40435-013-0027-8
Cadenillas A (2002) A stochastic maximum principle for system with jumps, with applications to finance. Syst Control Lett 47:433–444
Article MATH MathSciNet Google Scholar
Framstad NC, Øksendal B, Sulem A (2004) Sufficient stochastic maximum principle for the optimal control of jump diffusions and applications to finance. J Optim Theory Appl 121:77–98
Article MathSciNet Google Scholar
Øksendal B, Sulem A (2007) Applied stochastic control of jump diffusions, 2nd edn. Springer, Berlin
Book Google Scholar
Rishel R (1975) A minimum principle for controlled jump processes. Lecture notes in economics and mathematical systems, vol 107. Springer, Berlin, pp 493–508
Shi J, Wu Z (2010) Maximum principle for Forward-backward stochastic control system with random jumps and application to finance. J Syst Sci Complex 23:219–231
Article MATH MathSciNet Google Scholar
Shi J, Wu Z (2011) A stochastic maximum principle for optimal control of jump diffusions and application to finance. Chin J Appl Prob Stat 27(2)
Hafayed M, Abbas S (2013) A general maximum principle for stochastic differential equations of mean-field type with jump processes. Technical report. arXiv:1301.7327v4
Shi J, Wu Z (2006) The Maximum principle for fully coupled Forward-backward stochastic control system. Acta Autom Sinica 32(2):161–169
MathSciNet Google Scholar
Buckdahn R, Djehiche B, Li J, Peng S (2009) Mean-field backward stochastic differential equations: a limit approach. Ann Prob 37(4):1524–1565
Article MATH MathSciNet Google Scholar
Buckdahn R, Djehiche B, Li J (2011) A general stochastic maximum principle for SDEs of mean-field type. Appl Math Optim 64:197–216
Article MATH MathSciNet Google Scholar
Shi J (2012) Sufficient conditions of optimality for mean-field stochastic control problems. In: 12th international conference on control, automation, robotics and vision Guangzhou, China, 5–7th December
Li J (2012) Stochastic maximum principle in the Mean-field controls. Automatica 48:366–373
Article MATH Google Scholar
Andersson D, Djehiche B (2011) A maximum principle for SDEs of mean-field type. Appl Math Optim 63:341–356
Google Scholar
Shen Y, Siu TK (2013) The maximum principle for a jump-diffusion mean-field model and its application to the mean-variance problem. Nonlinear Anal 86:58–73
Article MATH MathSciNet Google Scholar
Meyer-Brandis T, Øksendal B, Zhou, XY (2012) A mean-field stochastic maximum principle via Malliavin calculus. Stoch Int J Prob Stoch Proc 84(5–6):643–666
Google Scholar
Ekeland I (1974) On the variational principle. J Math Anal Appl 47:324–353
Article MATH MathSciNet Google Scholar
Yong J, Zhou XY (1999) Stochastic controls, Hamiltonian systems and HJB equations. Springer, New York
MATH Google Scholar
Boyce WE, DiPrima RC (2000) Elementary differential equations and boundary value problems, 7th edn. Wiley, New York
Google Scholar
Clarke FH (1983) Optimization and nonsmooth analysis. Wiley, New York
MATH Google Scholar
Bouchard B, Elie R (2008) Discrete time approximation of decoupled Forward-Backward SDE with jumps. Stoch Process Appl 118(1):53–75
Article MATH MathSciNet Google Scholar

Download references

Acknowledgments

The authors would like to thank the editor and anonymous referees for their constructive corrections and valuable suggestions that improved the manuscript. The first author was partially supported by Algerian PNR project grant 08-u07-857, ATRST-ANDRU 2011-2013.

Author information

Authors and Affiliations

Laboratory of Applied Mathematics, Biskra University, Po Box 145, 07000 , Biskra, Algeria
Mokhtar Hafayed
Departement of Mathematics, University of Biskra, 07000 , Biskra, Algeria
Abdelmadjid Abba
School of Basic Sciences, Indian Institute of Technology Mandi, Mandi, 175001, HP, India
Syed Abbas

Authors

Mokhtar Hafayed
View author publications
You can also search for this author in PubMed Google Scholar
Abdelmadjid Abba
View author publications
You can also search for this author in PubMed Google Scholar
Syed Abbas
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Abdelmadjid Abba.

Appendix

The following result gives the definition and some basic properties of the Clarke’s generalized gradient.

Definition 6.1

Let $F$ be a convex set in $\mathbb {R}^{n}$ and let $f:F\rightarrow \mathbb {R}$ be a locally Lipschitz function. The generalized gradient of $f$ at $\widehat{x}\in F$, denoted by $\partial _{x}^{{{}^\circ }}f\left( \widehat{x}\right) $, is a set defined by

$$\begin{aligned} \partial _{x}^{{{}^\circ }}f\left( \widehat{x}\right) =\left\{ \xi \in \mathbb {R}^{n}:\left\langle \xi ,\upsilon \right\rangle \le f^{\circ }\left( \widehat{x},\upsilon \right) , \text { for any}\upsilon \in \mathbb {R}^{n}\right\} , \end{aligned}$$

where $f^{\circ }\left( \widehat{x},\upsilon \right) =\lim \sup _{y\rightarrow \widehat{x},t\rightarrow 0}\frac{1}{t}\left( f\left( y+t\upsilon \right) -f\left( y\right) \right) .$

Proposition 6.1

If $f:\mathbb {R}^{n}\rightarrow \mathbb {R}$ is locally Lipschitz at $x\in \mathbb {R}^{n}$, then the following statements holds

1.
$\partial _{x}^{{{}^\circ }}f\left( x\right) $ is nonempty, compact and convex set in $\mathbb {R}^{n}$.
2.
$\partial _{x}^{{{}^\circ }}\left( -f\right) \left( x\right) =-\partial _{x}^{{{}^\circ }}\left( f\right) \left( x\right) $.
3.
$\partial _{x}^{{{}^\circ }}f\left( x\right) \ni 0$ if $f$ attains a local minimum or maximum at $x$.
4.
If $f$ is continuously differentiable at $x$, then $\partial _{x}^{{{}^\circ }}f\left( x\right) =\left\{ f^{\prime }\left( x\right) \right\} .$
5.
If $f,$ $g:\mathbb {R}^{n}\rightarrow \mathbb {R}$ are locally Lipschitz functions at $x\in \mathbb {R}^{d}$, then $\partial _{x}^{ {{}^\circ }}\left( f+g\right) \left( x\right) \subset \partial _{x}^{ {{}^\circ } }f\left( x\right) +\partial _{x}^{ {{}^\circ }}g\left( x\right) .$

For the detailed proof of the above Proposition see Clarke [29] or the book by Yong and Zhou ([27] Lemma 2.3).

As a simple example of the generalized gradient, we consider the absolute value function $f:x\mapsto \left| x-a\right| $ which is continuously differentiable everywhere except at $x=a$. Since $f^{\prime }\left( x\right) =1$ for $x>a$ and $f^{\prime }\left( x\right) =-1$ for $x<a$, then a simple calculation shows that the generalized gradient of $f$ at $ x=a$ is given by $\partial _{x}^{ {{}^\circ } }f\left( a\right) =\overline{co}\left\{ -1,1\right\} =\left[ -1,1\right] $.

The following result gives special case of the Itô formula for jump diffusions.

Lemma 6.1

(Integration by parts formula for jumps processes) Suppose that the processes $x_{1}(t)$ and $x_{2}(t)$ are given by: for $j=1,2, t\in \left[ s,T\right] :$

$$\begin{aligned} \left\{ \begin{array}{l} dx_{j}(t)=f\left( t,x_{j}(t),u(t)\right) dt+\sigma \left( t,x_{j}(t),u(t)\right) dW(t) \\ +\int \nolimits _{\Theta }g\left( t,x_{j}(t^{-}),u(t),\theta \right) N\left( d\theta ,dt\right) , x_{j}(s)=0. \end{array} \right. \end{aligned}$$

Then we get

$$\begin{aligned}&\mathbb {E}\left( x_{1}(T)x_{2}(T)\right) \\&\quad =\mathbb {E}\left[ \int \limits _{s}^{T}x_{1}(t)dx_{2}(t)+\int \limits _{s}^{T}x_{2}(t)dx_{1}(t) \right] \\&\quad +\,\mathbb {E}\int \limits _{s}^{T}\sigma ^{*}\left( t,x_{1}(t),u(t)\right) \sigma \left( t,x_{2}(t),u(t)\right) dt \\&\quad +\,\mathbb {E}\int \limits _{s}^{T}\!\!\int \limits _{\Theta }g^{*}\left( t,x_{1}(t),u(t),\theta \right) g\left( t,x_{2}(t),u(t),\theta \right) \mu (d\theta )dt. \end{aligned}$$

See Framstad et al. ([12, Lemma 2.1) for the detailed proof of the above Lemma.

Proposition 6.2

Let $\mathcal {G}$ be the predictable $\sigma $-field on $\Omega \times \left[ s,T\right] $, and $f$ be a $\mathcal {G}\times \mathcal {B}(\Theta )$-measurable function such that

$$\begin{aligned} \mathbb {E}\int \limits _{s}^{T}\int \limits _{\Theta }\left| f\left( r,\theta \right) \right| ^{2}\mu (d\theta )dr<\infty , \end{aligned}$$

then for all $\beta \ge 2$ there exists a positive constant $C=C(T,\beta , \mu (\Theta ))$ such that

$$\begin{aligned}&\mathbb {E}\left[ \sup _{0\le t\le T}\left| \int \limits _{s}^{t}\int \limits _{\Theta }f\left( r,\theta \right) N(d\theta , dr)\right| ^{\beta }\right] \\&\quad \le C\mathbb {E}\left[ \int \limits _{s}^{T}\int \limits _{\Theta }\left| f\left( r,\theta \right) \right| ^{\beta }\mu (d\theta )dr\right] . \end{aligned}$$

See Bouchard and Elie ([30], Appendix).

Rights and permissions

Reprints and permissions

About this article

Cite this article

Hafayed, M., Abba, A. & Abbas, S. On mean-field stochastic maximum principle for near-optimal controls for Poisson jump diffusion with applications. Int. J. Dynam. Control 2, 262–284 (2014). https://doi.org/10.1007/s40435-013-0040-y

Download citation

Received: 04 September 2013
Revised: 10 November 2013
Accepted: 12 November 2013
Published: 03 December 2013
Issue Date: September 2014
DOI: https://doi.org/10.1007/s40435-013-0040-y

Keywords

Mathematics Subject Classification

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

On mean-field stochastic maximum principle for near-optimal controls for Poisson jump diffusion with applications

Abstract

Similar content being viewed by others

Mean-Field Maximum Principle for Optimal Control of Forward–Backward Stochastic Systems with Jumps and its Application to Mean-Variance Portfolio Problem

The Stochastic Maximum Principle for a Jump-Diffusion Mean-Field Model Involving Impulse Controls and Applications in Finance

A general characterization of the stochastic optimal combined control of mean field stochastic systems with application

1 Introduction

2 Problem formulation and preliminaries

Basic Notations

Basic Assumptions

Assumption (H1)

Assumption (H2)

Definition 2.1

Definition 2.2

Definition 2.3

Lemma 2.1

3 Necessary conditions of near-optimality for mean-field jump diffusion processes

Theorem 3.1

Corollary 3.1

Lemma 3.1

Proof

Lemma 3.2

Proof

Lemma 3.3

Proof

Proof of Theorem 3.1

Proof of Corollary 3.1

4 Sufficient conditions of near-optimality for mean-field jump diffusion processes

Theorem 4.1

Corollary 4.1

Proof of Theorem 4.1

5 Application to finance: penalized mean-variance portfolio selection

Theorem 5.1

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Appendix

Appendix

Definition 6.1

Proposition 6.1

Lemma 6.1

Proposition 6.2

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Mathematics Subject Classification

Search

Navigation