N-Player Games and Mean Field Games of Moderate Interactions

Flandoli, Franco; Ghio, Maddalena; Livieri, Giulia

doi:10.1007/s00245-022-09834-7

N-Player Games and Mean Field Games of Moderate Interactions

Open access
Published: 10 May 2022

Volume 85, article number 38, (2022)
Cite this article

Download PDF

You have full access to this open access article

Applied Mathematics & Optimization Submit manuscript

N-Player Games and Mean Field Games of Moderate Interactions

Download PDF

1810 Accesses
Explore all metrics

Abstract

We study the asymptotic organization among many optimizing individuals interacting in a suitable “moderate" way. We justify this limiting game by proving that its solution provides approximate Nash equilibria for large but finite player games. This proof depends upon the derivation of a law of large numbers for the empirical processes in the limit as the number of players tends to infinity. Because it is of independent interest, we prove this result in full detail. We characterize the solutions of the limiting game via a verification argument.

Mean Field Games

Mean field games based on stable-like processes

Article 12 November 2016

1 Introduction

The theory of Mean Field Games (MFGs, henceforth) began with the pioneering works of Lasry and Lions [17] and Huang et al. [13] to describe the asymptotic organization among a large population of optimizing individuals interacting with each other in a mean-field way and subject to constraints of economic or energetic type. The mean-field interaction enables to reduce the analysis to a control problem for one single representative player, interacting with, and evolving in, the environment created by the aggregation of the other individuals. Intuitively, the system’s symmetries will force the players to obey a form of law of large numbers and satisfy a propagation of chaos phenomenon as the size of the population grows. The literature on MFGs is rapidly growing and the application of MFG theory is catching on in areas as diverse as Economics, Biology, Physics, and Machine Learning; hence, it is impossible to give an exhaustive account of the activity on the topic. For this reason, we refer the reader to the lecture notes by Cardaliaguet [3] and the two-volume monograph by Carmona et al. [6] for a comprehensive presentation of the MFG theory and its applications; the first reference presents the theory from an analytic perspective, whereas the second one from a probabilistic point of view.

However, in many practical situations (e.g., in evacuation planning and crowd management at mass gatherings), it stands to reason that a single person interacts only with the few people in the surrounding environment, i.e., each individual has her/his space. A possible mathematical way to describe this type of interaction is through an appropriate rescaling of a given reference function V, where V is a sufficiently regular probability density function; see, e.g., Oelschläger [21] and Morale et al. [19]. Denoting by x and y the positions of two individuals (out of a population of N) in a d-dimensional space, then their interaction can be modelled by:

$$\begin{aligned} N^{-1} V^N(x-y), \end{aligned}$$

where

$$\begin{aligned} V^N(z) = N^{\beta } V(N^{\beta /d}z). \end{aligned}$$

(1.1)

The parameter $\beta \in (0,1)$ describes how V is rescaled for the total number N of individuals and expresses the so-called moderate interaction among the individuals; see Oelschläger [21]. On the other hand, $\beta = 0$ expresses an interaction of mean-field type, whereas $\beta = 1$ generates the so-called nearest-neighbour interaction. This paper aims to analyze the asymptotic organization among many optimizing individuals moderately interacting with each other. To the best of our knowledge, the study of this type of asymptotic organization has been performed only in Aurell and Djehiche [1] and Cardaliaguet [4]. In the former work, authors introduced models for crowd motion, although in a more simplified setting. Indeed, they account for the moderate interaction among the individuals in the cost functional only, although they consider that the position of each pedestrian (in a crowd of N pedestrians) belongs to $\mathbb {R}^{d}$. Also, in Cardaliaguet [4] only the payoff of a player depends in an increasingly singular way on the players which are very close to her/him. In addition, to avoid issues related to boundary conditions or problems at infinity, in the latter work data are assumed periodic in space. The fact that data are assumed periodic in space and (mostly) that the moderate interaction enters only in the cost functional has a consequence in proving the existence and uniqueness of solutions of the Partial Differential Equation (PDE) MFG system associated with our model; see the discussion here below in the introduction and Sect. 4.

The model The motion of a single-player $X^{N,i}_t$, $t \in [0, T]$, in a population of N individuals is assumed to be modelled as

$$\begin{aligned} \begin{aligned} X_t^{N, i}&= {X_0}^{N,i} + \int _{0}^{t}\Bigg (\alpha ^{N,i}(s) + b\Big (X_s^{N,i}, \frac{1}{N} \sum _{j = 1}^{N} V^{N}(X_s^{N,i}-X_s^{N, j})\Big )\Bigg )\,ds \\&\quad \,\, + W_t^{N, i},\quad t \in [0, T],\quad i \in \left\{ 1,\ldots ,N \right\} . \end{aligned} \end{aligned}$$

(1.2)

Here, $\varvec{\alpha }^{N} \doteq (\alpha ^{N,1},\ldots , \alpha ^{N,N})$ is a vector of strategies that we will specify below, b is a given deterministic function and $W^{N,1}, \ldots , W^{N,N}$ are independent d-dimensional Wiener processes defined on some filtered probability space $(\Omega , \mathcal {F}, (\mathcal {F}_t)_{t \in [0,T]}, \mathbb {P})$. We will denote by $\mathbf {X}^{N}_{{t}} \doteq (X^{N,1}_{{t}}, \ldots , X^{N, N}_{{t}})$ the vector of the positions at time t of the N individuals. In addition, ${X_0}^{N,i}$, $i = 1, \ldots , N$, are $\mathbb {R}^{d}$-valued independent and identically distributed (i.i.d) random variables, independent of the Wiener processes, such that ${X_0}^{N,i}\overset{d}{\sim } \xi $ (notice that $``\overset{d}{\sim }"$ stands for “distributed as") where $\xi $ is an auxiliary random variable with law $\mu _0$ with density $p_0$, i.e. $\mu _0$ is absolutely continuous with respect to (w.r.t) the Lebesgue measure. Eq. (1.2) says that each individual i partially controls its velocity through her/his strategy $\alpha ^{N, i}$. However, the velocity depends on her/his position and on the other individuals’ in a neighbourhood of $X^{N,i}$. Indeed, the functions $V^{N}(\,\cdot \,)$ (see Eq. (1.1)) are mollifiers (see Appendix A for a precise definition) describing the intermediate regime between the mean-field and the nearest-neighbour interaction. For large N they have a relatively small support and therefore the individual i interacts, via the term $V^{N}(X_s^{N,i}-X_s^{N,j})$, only with few players, indexed by j, in a neighbourhood of $X_s^{N,i}$. In particular, the rate of convergence to zero of the support of $V^{N}$ will be such that the number of players i is still very large, in the limit as N tends to infinity, but very small compared to the full population size N. It is worth mentioning that it is also possible to let a common disturbance affect all the individuals [13], commonly referred to in the MFGs literature as common noise; we refer to the second volume by Carmona et al. [6] for an overview of this theory. The common disturbance could be used—as also pointed out by Aurell and Djehiche [1]—to model an evacuation during, for instance, a fire or a earthquake.

We leave, however, the study of this case for future research.

Each player acts to minimize her/his own expected costs according to a given functional over a finite time horizon [0, T]. More precisely, player i evaluates a strategy vector $\varvec{\alpha }^{N}$ according to the following cost functional

$$\begin{aligned}&J^{N}_{i}(\varvec{\alpha }^{N}) \doteq \mathbb {E}\left[ \int _{0}^{T}\Bigg (\frac{1}{2}|\alpha ^{N,i}(s)|^2 + f\Big (X_s^{N,i}, \frac{1}{N} \sum _{j = 1}^{N} V^{N}(X_s^{N,i}-X_s^{N, j})\Big )\Bigg )\,ds + g(X_T^{N,i})\right] ,\nonumber \\ \end{aligned}$$

(1.3)

where $\varvec{X}^{N}_{{t}}$ is the solution of Eq. (1.2) under $\varvec{\alpha }^{N}$. Notice that the cost coefficients f and g are the same for all players. The cost functional $J_{i}^{N}(\varvec{\alpha }^{N})$ can be interpreted practically in the following way; see, also, Aurell and Djehiche [1]. The first term penalizes the usage of energy, the second term, instead, the trajectories passing through densely crowded areas. Finally, the final cost $g(\,\cdot \,)$ penalizes deviation from specific target regions. More details on the setting with all the technical assumptions will be given in the next sections.

For the class of games just introduced, we focus on the construction of approximate Nash equilibria [17] for the game with a finite number of individuals (i.e., for the N-player game) via the solution of the corresponding control problem for one single representative player (i.e., through the solution of the corresponding MFG). Hereafter, we will use the words “intermediate interactions" and “moderate interactions" interchangeably.

Our main contributions are as follows:

We introduce the limit model corresponding to the above N-player games as N tends to infinity, namely the MFG of moderate interaction. We formulate both the PDE approach to MFGs with moderate interaction and the stochastic formulation; see Definitions 4.1 and 4.7, respectively.
We prove that the PDE system (or the equivalent mild formulation; see Lemma 4.2) admits a solution $(0, \infty )$; see Theorem 4.4. Also, we prove that the same system admits a unique solution for T sufficiently small; see Theorem 4.5.
We prove the existence of a solution in the feedback form to the MFG of moderate interaction; see Theorem 4.8.
We derive, in the limit as the number of different processes in Eq. (1.2) tends to infinity, law of large numbers for the empirical processes, and we characterize the limit dynamics; see Theorem 5.1.
We prove that any feedback solution of the MFG induces a sequence of approximate Nash equilibria for the N-player games with approximation error tending to zero as N tends to infinity; see Theorem 6.1.

The MFG system of PDEs associated with our model takes the form of a backward Hamilton–Jacobi equation coupled with a forward Kolmogorov equation. In particular, it is a second-order MFG system with local coupling or of local type. Many authors have studied this type of system in the last years; see Lasry and Lions [16, 17], Porretta [23], Gomes et al. [12], Cardaliaguet and Porretta [5]. However, the framework in these works deviates from ours’ for two main reasons. First, the authors consider that the state space is the d-dimensional torus $\mathbb {T}^{d}$ and not all the space $\mathbb {R}^{d}$. Second, and most importantly, they do not consider dependence on the local density of measure in the dynamics; see the term b(x, p(t, x)) in the first equation in Eq. (4.1). We prove^{Footnote 1} the existence of solutions of the PDE MFG system for any $T >0$ via the Brouwer-Schauder fixed point theorem. Instead, we will not be able to prove the uniqueness of such solutions under the standard monotonicity assumption for any $T > 0$ but only for small T via the contraction principle, the difficulty arising precisely from the dependence on the local density in the dynamics.

The proof of the existence of a MFG solution is based on a verification argument. We identify the unique solution of the PDE system of the MFG with moderate interaction with the feedback control solution of the MFG in its stochastic formulation. In our case, the value function of the representative player is not “regular enough", and so, in order to apply Itô formula, some work based on standard mollification arguments will be needed; see Appendix 1, Sect. 1.

The proof of Theorem 5.1 on the characterization of the limit dynamics of the empirical processes is one of the main achievements of this work. It represents a version of the superb result of Oelschläger [21] on the study of the macroscopic limit of moderately interacting diffusion particles. Contrary to us, Oelschläger [21] does not assume the absolute continuity of $\mu _0$ with respect to the Lebesgue measure. Admittedly, this would be an additional technicality that would not add to the present work’s conceptual advancements. On the other hand, we can show the validity of Theorem 5.1 under a more general assumption on the SDE drift in Eq. (1.2). In Oelschläger [21] a more strict Lipschitz condition on the drift (see Eq. (1.5) in his work) is imposed; this condition is used to prove the uniqueness of the solution of a certain (deterministic) equation that characterizes the limit dynamics of the empirical processes. We believe that this paper’s assumptions lead to a much more comprehensive understanding of the problem at hand. Because it is of independent interest, we will devote the entire Sect. 5 to the proof of the propagation of chaos result.

The proof of Theorem 6.1 of approximate Nash equilibria is based on weak convergence arguments and controlled martingale problems, whose use has a longstanding tradition; see, for instance, Funaki [11], Oelschlager [20], Huang et al. [13], as well Carmona et al. [6], Section 6.1 of the second volume. However, contrary to those works, we have to study the passage to the many player (particle) limit in the presence of a deviating player, which destroys the prelimit systems’ symmetry. We will use an argument based on relaxed controls.

Structure of the paper The rest of this paper is organized as follows. Section 2 introduces some terminology and notation and sets the main assumptions on the dynamics and on the cost functionals. Section 3 describes the setting of N-player games with moderate interaction, while Sect. 4 introduces the corresponding MFG. In Sect. 5, one of the main results, namely the derivation of a law of large numbers for the empirical processes, is stated and proved. Section 6 contains the result on the construction of approximate Nash equilibria for the N-player game from a solution of the limit problem. The technical results used in the paper are all gathered in the Appendix, including the aforementioned existence and uniqueness result for the PDE system and the proof of the existence of a MFG solution in Appendix 1, and bounds on Hölder-type semi-norm to prove the results of Sect. 5 in Appendix 1 and Appendix 1.

2 Preliminaries and Assumptions

Let $d \in \mathbb {N}$ be the dimension of the space of private state and of the noise. We equip the spaces $\mathbb {R}^{d}$, $d \in \mathbb {N}$, with the standard Euclidean norm, which will be denoted by $|\,\cdot \,|$. Instead $T > 0$ is the finite time horizon.

For $\mathcal {S}$ Polish space we let $\mathcal {P}(\mathcal {S})$ denote the space of probability measures on $\mathcal {B}(\mathcal {S})$, the Borel sets of $\mathcal {S}$. For $s \in \mathcal {S}$ we let $\delta _s$ indicate the Dirac measure concentrated in s. If $\mathcal {P}(\mathcal {S})$ is equipped with the topology of weak convergence of probability measures, then $\mathcal {P}(\mathcal {S})$ is a Polish space. In particular, $\text {C}([0,T] ; \mathcal {P}(\mathcal {S}))$ denotes the space of continuous flow of measures.

We set $\mathcal {X} \doteq \text {C} ([0,T] ; \mathbb {R}^{d})$ and we equip it with the topology of uniform convergence; the space $\mathcal {X}$ with this topology is a Polish space. Given $N \in \mathbb {N}$, we will use the usual identification of $\mathcal {X}^{N} = \times ^{N} \mathcal {X}$ with the space $ \text {C} ([0,T] ; \mathbb {R}^{d\cdot N})$; $\mathcal {X}^{N}$ is equipped with the topology of uniform convergence. For $\ell \in \mathbb {R}_{+}$, we denote by $\text {C}_b^{\ell }(\mathbb {R}^{d} ; \mathbb {R}^{d})$ the set of $\mathbb {R}^{d}$-valued functions on $\mathbb {R}^{d}$ with bounded $\ell $-th derivative, and by $\text {C}_c^{\ell }(\mathbb {R}^{d}; \mathbb {R}^{d})$ the set of $\mathbb {R}^{d}$-valued functions on $\mathbb {R}^{d}$ with compact support and continuous $\ell $-th derivative. We will use simply $\text {C}_b(\mathbb {R}^{d})$, $\text {C}_b^{\ell }(\mathbb {R}^{d})$ and $\text {C}_c^{\ell }(\mathbb {R}^{d})$ when the functions are real-valued. Moreover, $\text {C}^{\ell }([0,T]; \text {C}_b(\mathbb {R}^{d}))$ denotes the space of $\text {C}_b(\mathbb {R}^{d})$-valued functions on [0, T] with continuous $\ell $-th derivative; analogous definitions hold if $\text {C}_b(\mathbb {R}^{d})$ is replaced with either $\text {C}_b^{\ell }(\mathbb {R}^{d})$ or $\text {C}_c^{\ell }(\mathbb {R}^{d})$.

Similarly, we denote by $\text {C}([0,T] \times \mathbb {R}^{d}; \mathbb {R}^{d})$ the set of $\mathbb {R}^{d}$-valued continuous functions on $[0,T] \times \mathbb {R}^{d}$ and with $\text {C}^{1, 2}([0,T] \times \mathbb {R}^{d}; \mathbb {R}^{d})$ the set of $\mathbb {R}^{d}$-valued continuous functions on $[0,T] \times \mathbb {R}^{d}$ with continuous first (resp. second) derivative with respect to the time (resp. space); analogous definitions (cfr. the characterizations in the previous paragraph) hold for the spaces $\text {C}_b^{1, 2}([0,T] \times \mathbb {R}^{d}; \mathbb {R}^{d})$, $\text {C}_c^{1, 2}([0,T] \times \mathbb {R}^{d}; \mathbb {R}^{d})$. Again, we will use simply $\text {C}([0,T] \times \mathbb {R}^{d})$, $\text {C}^{1,2}([0,T] \times \mathbb {R}^{d})$, $\text {C}_b^{1, 2}([0,T] \times \mathbb {R}^{d})$, $\text {C}_c^{1, 2}([0,T] \times \mathbb {R}^{d})$ when the functions are real-valued. In particular, notice that $\text {C}([0,T];\text {C}_b(\mathbb {R}^{d})) \subset \text {C}_b([0,T]\times \mathbb {R}^{d})$.

As usual, $\nabla $ and $\Delta $ denote the gradient and the Laplacian operator, respectively. Finally, for the sake of simplicity, we write $i \in [[N]]$ in place of $i = 1, \ldots , N$.

Now let

$$\begin{aligned} \begin{aligned}&b\,:\,\mathbb {R}^{d} \times \mathbb {R}_{+} \rightarrow \mathbb {R}^{d},\\&f\,:\,\mathbb {R}^{d} \times \mathbb {R}_{+} \rightarrow \mathbb {R},\quad \quad g\,:\,\mathbb {R}^{d} \rightarrow \mathbb {R}. \end{aligned} \end{aligned}$$

The function b will denote the drift, while f and g will quantify the running and the terminal costs, respectively. Let us make the following assumptions:

(H1)
b and f are Borel measurable functions, continuous and such that there exist two constants $C, L > 0$ for which it holds that
$$\begin{aligned} \begin{aligned}&|b(x, p)| + |f(x, p)| \le C,\\&|b(x,p) - b(y,q)| + |f(x,p) - f(y,q)| \le L(|x-y| + |p-q|) \end{aligned} \end{aligned}$$
for all $x, y \in \mathbb {R}^d$, $p, q \in \mathbb {R}_{+}$.
(H2)
g is a Borel measurable function such that $g, \partial _{x_i} g \in \text {C}_b(\mathbb {R}^{d})$, $i \in [[d]]$.
(H3)
For each $N\in \mathbb {N}$, for some $\beta \in (0, 1/2)$ and some $V \in \text {C}_{c}^{1}(\mathbb {R}^{d}) \cap \mathcal {P}(\mathbb {R}^{d})$ we have
$$\begin{aligned} V^{N}(x) \doteq N^{\beta } V(N^{\frac{\beta }{d}} x),\quad x \in \mathbb {R}^{{d}}, \end{aligned}$$
(2.1)
where, we remind, $\text {C}_{c}^{1}(\mathbb {R}^{d})$ is the space of continuous functions on $\mathbb {R}^d$ with compact support and continuous first derivatives, while $\mathcal {P}(\mathbb {R}^{d})$ denotes the probability measures on $\mathbb {R}^d$. In particular, $\text {C}_{c}^{1}(\mathbb {R}^{d}) \cap \mathcal {P}(\mathbb {R}^{d})$ denotes the set of probability measures with a density that has compact support and that is differentiable.
(H4)
The law $\mu _0 \in \mathcal {P}(\mathbb {R}^{d})$ is absolutely continuous with respect to the Lebesgue measure on $\mathbb {R}^{d}$ and with density $p_0 \in \text {C}_{b}(\mathbb {R}^{d})$ satisfying the following condition:
$$\begin{aligned} \int _{\mathbb {R}^{d}} e^{\lambda |x|} p_0(x)\,dx < \infty \end{aligned}$$
for all $\lambda > 0$.

3 N-Player Games

Let $N \in \mathbb {N}$ be the number of players. Denote by $X^{N,i}_t$ the private state of player i at time $t \in [0,T]$. The evolution of the players’ state depends on the strategies they choose and on the initial distribution of states, which we indicate by $\mu ^{N}_0$ (thus, $\mu ^{N}_0 \in \mathcal {P}(\mathbb {R}^{N \times d})$). We assume that $\mu ^{N}_0$ can be factorized and that for each $\mu _0$ hypothesis (H4) is in force. Here, we consider players using feedback strategies with full state information, i.e. strategies $\alpha _t^{N,i} = \alpha (t, \varvec{X}_t^{N})$ where $\alpha \in \text {C}_b([0,T] \times \mathbb {R}^{d\cdot N} ; \mathbb {R}^{d})$ that are uniformly bounded by some constant $C>0$. Thus, let $\mathcal {A}_{C}^{N, 1, fb}$ denote the set of all these individual strategies. A vector ${\varvec{\alpha }^{N}\doteq }(\alpha ^{N,1},\ldots ,\alpha ^{N,N})$ of individual strategies is called a strategy vector or strategy profile. We denote with $\mathcal {A}_C^{N, fb}$ the set of all vectors $\varvec{\alpha }^{N}$ of feedback strategies for the N-player game that are uniformly bounded by some constant $C>0$. Given a vector of N-player feedback strategies $\varvec{\alpha }^{N}$, consider the system of equations

$$\begin{aligned} \begin{aligned} X_t^{N, i}&= {X_0}^{N, i} + \int _{0}^{t}\Bigg (\alpha (s, {\varvec{X}_s^{N}}) + b\Big (X_s^{N,i}, \frac{1}{N} \sum _{j = 1}^{N} V^{N}(X_s^{N,i}-X_s^{N, j})\Big )\Bigg )\,ds \\&\quad \, + W_t^{N, i},\quad t \in [0, T],\, i \in {[[N]]}, \end{aligned} \end{aligned}$$

(3.1)

where $\varvec{X}^{N}_{{t}} = (X^{N,1}_{{t}}, \ldots , X^{N,N}_{{t}})$ and $W^{N,1}, \ldots , W^{N,N}$ are independent Wiener processes defined on some filtered probability space $(\Omega , \mathcal {F}, (\mathcal {F}_t), \mathbb {P})$ satisfying the usual conditions. The initial conditions $X_0^{N, i}$ are i.i.d. $\mathcal {F}_0$-measurable random variables, each with law $\mu _0 \in \mathcal {P}(\mathbb {R}^{d})$ and independent of the Wiener processes, the functions $V^{N}(\,\cdot \,)$ are mollifiers (see hypothesis (H3)) through which we obtain the interaction of moderate type among the players. A solution of Eq. (3.1) under $\varvec{\alpha }^{N}$ with initial distribution $\mu _0^{N}$ is a triple $((\Omega , \mathcal {F}, (\mathcal {F}_t), \mathbb {P}), \varvec{W}^{N}, \varvec{X}^{N})$ where $(\Omega , \mathcal {F}, (\mathcal {F}_t), \mathbb {P})$ is a filtered probability space satisfying the usual hypotheses, $\varvec{W}^{N} = (W^{N,1},\ldots , W^{N,N})$ a vector of independent d-dimensional $(\mathcal {F}_t)$-Wiener processes, and $\varvec{X}^N = (X^{N,1}, \ldots , X^{N,N})$ a vector of continuous $\mathbb {R}^{d}$-valued $(\mathcal {F}_t)$-adapted processes such that Eq. (3.1) holds $\mathbb {P}$-almost surely with strategy vector $\varvec{\alpha }^N$ and $\mathbb {P} \circ (\varvec{{X_0}}^{N})^{-1} = \mu _0^{N}$, each $X_0^{N, i}$ for $i\in [[N]]$ being independent of the Wiener processes. The i-th player evaluates a (feedback) strategy vector $\varvec{\alpha }^{N}$ according to the cost functional

$$\begin{aligned} J^{N}_{i}(\varvec{\alpha }^{N})\doteq & {} \mathbb {E}\Bigg [ \int _{0}^{T}\Bigg (\frac{1}{2}|\alpha (s, {\varvec{X}_s^{N}})|^2 + f\Big (X_s^{N,i}, \frac{1}{N} \sum _{j = 1}^{N} V^{N}(X_s^{N,i} -X_s^{N, j})\Big )\Bigg )\, ds \nonumber \\&+ g(X_T^{N,i})\Bigg ], \end{aligned}$$

(3.2)

where $\varvec{X}^{N}_{{t}} = (X^{N,1}_{{t}}, \ldots , X^{N,N}_{{t}})$ and $((\Omega , \mathcal {F}, (\mathcal {F}_t), \mathbb {P}), \varvec{W}^{N}, \varvec{X}^{N})$ is a solution of Eq. (3.1) under $\mu _0^N$. The cost functional is well defined thanks to the hypothesis (H1).

Given a strategy vector $\varvec{\alpha }^{N}\in \mathcal {A}^{N, fb}_C$ and an individual strategy $\beta \in \mathcal {A}^{N, 1, fb}_C$, let $[\varvec{\alpha }^{N, -i}, \beta ]\in \mathcal {A}^{N, fb}_C$ indicate the strategy vector that is obtained from $\varvec{\alpha }^N$ by replacing $\alpha ^{N,i}$, the strategy of player i, with $\beta $. The correct interpretation of optimization of the cost functional $J_i^{N}(\varvec{\alpha }^{N})$ in Eq. (3.2)—classical in game theory—would be the concept of Nash Equilibrium. In the case of a large number of players, our goal will be to prove the validity of a weaker equilibrium concept, that is the concept of $\varepsilon $-Nash equilibrium, introduced in the theory of MFGs.

Definition 3.1

($\varepsilon $-Nash equilibria) Let $\varepsilon \ge 0$. A strategy vector $\varvec{\alpha }^{N}$ is called an $\varepsilon $-Nash equilibrium for the N-player game if for every $i \in [[N]]$

$$\begin{aligned} J^{N}_i(\varvec{\alpha }) \le J^{N}_{i}([\varvec{\alpha }^{N, -i}, \beta ]) + \varepsilon , \end{aligned}$$

(3.3)

for all admissible single player strategies $\beta $, i.e., strategies that belong to $\mathcal {A}_{C}^{N, 1, fb}$.

If $\varvec{\alpha }^{N}$ is an $\varepsilon $-Nash equilibrium with $\varepsilon = 0$, then $\varvec{\alpha }^{N}$ is called Nash equilibrium.

In our framework, we consider strategy vectors $\varvec{\alpha }^N$ belonging to $\mathcal {A}^{N, fb}_C$, where we will later in the work fix the constant C to be equal to $K\left( T,b,f,p_{0},g\right) $ defined in Eq. (4.13). We say that a single player strategy $\beta $ is admissible (i.e. it is an admissible deviation from equilibrium) for a player $i\in [[N]]$ if it belongs to $\mathcal {A}^{N, 1, fb}_C$ where the constant C is intended to be fixed.

4 Mean Field Games

Let $T>0$ be the finite time horizon and $b, f, p_0, g$ as in Sect. 2. Let us introduce the PDE approach to MFGs with moderate interaction via the following coupled system of backward Hamilton–Jacobi Bellman equation and Kolmogorov forward equation, called PDE system:

$$\begin{aligned} {\left\{ \begin{array}{ll} -\partial _{t}u-\frac{1}{2}\Delta u-b(x,p(t,x))\cdot \nabla u+\frac{1}{2}\left| \nabla u\right| ^{2}=f(x,p(t,x)),\quad (t,x)\in [0,T)\times \mathbb {R}^{d},\\ \partial _{t}p-\frac{1}{2}\Delta p+\text {div}{[p(t,x)(-\nabla u(t,x)+b(x,p(t,x)))]}=0,\quad \quad (t,x)\in (0,T]\times \mathbb {R}^{d},\\ p(0,\,\cdot \,)=p_{0}(\,\cdot \,)\quad x\in \mathbb {R}^{d},\quad u(T,\,\cdot \,)=g(\,\cdot \,),\quad \quad \quad \quad \quad \quad \quad \qquad \,\, x\in \mathbb {R}^{d}, \end{array}\right. } \end{aligned}$$

(4.1)

for all $(x,p) \in \mathbb {R}^{d}\times \mathbb {R}_{+}$. Precisely, the first equation of the PDE system is the Hamilton–Jacobi Bellman equation with a quadratic cost for the value function u of the representative player. Instead, the second one is the Kolmogorov forward equation for the density $p(t,\,\cdot \,)$ of the representative player. As said in the introduction, the PDE MFG system is of local type with the dependence on the local density p(t, x) appearing both on the dynamics, via the term b(x, p(t, x)), and on the running cost, via the term f(x, p(t, x)). In addition, the state space is $\mathbb {R}^{d}$.

The notion of solution we consider for the PDE system is the one in Definition 4.1 below, where we let $\mathcal {A}$ denote the following operator:

$$\begin{aligned} \mathcal {A} \doteq \partial _t - \frac{1}{2}\Delta . \end{aligned}$$

(4.2)

Definition 4.1

(MFG solution, PDE formulation) A weak solution of the PDE system is a pair (u, p) such that:

(i):

u, $\partial _i u$ and $p \in \text {C}_b([0,T] \times \mathbb {R}^{d})$ for all $i \in [[\,d\,]]$;

(ii):

for all $\varphi , \psi \in \text {C}^{1,2}_{c}([0,T] \times \mathbb {R}^{d})$ and all $t \in [0, T]$ the following two equations

$$\begin{aligned}&\quad \left\langle u\left( t \right) ,\varphi \left( t\right) \right\rangle - \left\langle g,\varphi \left( T\right) \right\rangle +\int _{t}^{T}\left\langle u\left( s\right) ,\mathcal {A} \varphi \left( s\right) \right\rangle ds \nonumber \\&=\int _{t}^{T}\left\langle b(\,\cdot \,,p(s))\cdot \nabla u\left( s\right) -\frac{1}{2}\left| \nabla u\left( s\right) \right| . ^{2}+f(\,\cdot \,,p(s)),\varphi \left( s\right) \right\rangle ds, \qquad \nonumber \\&\quad \left\langle p\left( t\right) ,\psi \left( t\right) \right\rangle -\left\langle p_{0},\psi \left( 0\right) \right\rangle -\int _{0}^{t}\left\langle u\left( s\right) ,\mathcal {A}\psi \left( s\right) \right\rangle ds \end{aligned}$$

(4.3)

$$\begin{aligned}&= \int _{0}^{t}\left\langle {p(s)(-\nabla u(s)+b(\,\cdot \,,p(s))),\nabla }\psi \left( s\right) \right\rangle ds. \end{aligned}$$

(4.4)

hold.

We now state and prove that under the regularity condition (i) in Definition 4.1 the system in Eqs. (4.3)–(4.3) admits an equivalent mild formulation. To this end, set $G(t, x-y)$ the density of $x + W_t$, where $W_t$ is a standard blackian motion, $t \in [0, T]$ and $x, y \in \mathbb {R}^{d}$, and introduce the notation $\mathcal {P}_t$ for the associated semi-group,

$$\begin{aligned} (\mathcal {P}_t h)(x)\doteq \int _{\mathbb {R}^{d}} G(t, x-y) h(y)\,dy, \end{aligned}$$

(4.5)

defined on functions $h \in \text {C}_b(\mathbb {R}^{d})$. By taking, for all $t \in [0,T]$, in the Eqs. (4.3) and (4.3) the functions $\varphi (t)$ and $\psi ( t )$ as the function $y \mapsto G(t, x-y)\,h(y)$, with x a given parameter, one can show the equivalence between the weak formulations of Eq. (4.3) and (4.3 and the following mild formulation. This is the content of the following lemma.

Lemma 4.2

Let (u, p) a pair with the regularity of point (i) in Definition 4.1. Then (ii) in the same definition is equivalent to the validity, for all $t\in \left[ 0,T\right] $, of the following system:

$$\begin{aligned} \begin{aligned}&u\left( t\right) =\mathcal {P}_{T-t}g-\int _{t}^{T}\mathcal {P}_{s-t}\left( b\left( \,\cdot \,,p\left( s\right) \right) \cdot \nabla u\left( s\right) -\frac{1}{2}\left| \nabla u\left( s\right) \right| ^{2}+f(\,\cdot \,,p(s))\right) ds\\ \end{aligned} \end{aligned}$$

(4.6)

and

$$\begin{aligned} \begin{aligned}&p\left( t\right) =\mathcal {P}_{t}p_{0}-\int _{0}^{t}\nabla \mathcal {P}_{t-s}\left( p\left( s\right) \left( \nabla u\left( s\right) -{b(\,\cdot \,,p(s))}\right) \right) ds,\\ \end{aligned} \end{aligned}$$

(4.7)

where in the last integral we understand that

$$\begin{aligned} \left( \nabla \mathcal {P}_{t-s}h\right) \left( x\right) =\int _{\mathbb {R} ^{d}}\nabla _{x}G\left( t-s,x-y\right) h\left( y\right) dy. \end{aligned}$$

(4.8)

A solution of this integral system with the regularity of point (i) in Definition (4.1) is called a mild solution.

Proof

See Appendix 1, Sect. 1, where we give a sketch of the (less classical) proof for the backward equation (4.6). $\square $

Now, we prove that there exists (u, p) weak solution (cfr. Definition 4.1) of the PDE MFG system 4.1 in $(0, \infty )$. In order to do so, we use the Hopf-Cole transform for quadratic Hamiltonians (see, e.g. Remark 1.13 in Cardaliaguet and Porretta [5]) and we consider the following auxiliary system

$$\begin{aligned} {\left\{ \begin{array}{ll} \partial _t w + \frac{1}{2} \Delta w + b(x,p(t,x))\cdot \nabla w = w\,f(x,p(t,x)),\quad \quad \,\, (t,x)\in [0,T)\times \mathbb {R}^{d}, \\ \partial _{t}p-\frac{1}{2}\Delta p+\text {div}{\left[ p(t,x)\left( \frac{\nabla w}{w} + b(x, p(t, x))\right) \right] }=0,\quad \quad (t,x)\in (0,T]\times \mathbb {R}^{d}, \\ p(0,\,\cdot \,)=p_{0}(\,\cdot \,)\quad x\in \mathbb {R}^{d},\quad w(T,\,\cdot \,)=\exp (-g(\,\cdot \,)),\quad \quad \quad \quad \quad \quad \,\,\,\, x\in \mathbb {R}^{d}. \end{array}\right. } \end{aligned}$$

(4.9)

Notice that if (w, p) is a weak solution of the previous system such that $p, w, \partial _i w \in \text {C}_b([0,T]\times \mathbb {R}^{d})$, $i \in [[d]]$, then $w(t, x) \ge e^{-(\Vert g \Vert _{\infty } + T \Vert f \Vert _{\infty })}$ by strong maximum principle. Therefore, the ratio $\frac{\nabla w}{w} \in \text {C}_b([0,T] \times \mathbb {R}^{d} ; \mathbb {R}^{d})$ with a bound that depends only on the infinity norms of the coefficients; precisely:

$$\begin{aligned} \left\| \frac{\nabla w}{w}\right\| _{\infty } \le C_w(g, f, b, T). \end{aligned}$$

(4.10)

This observation justifies the following definition, analogous to Definition 4.1.

Definition 4.3

(MFG solution, PDE formulation - I) Let $p_{0} \in \text {C}_{b}\left( \mathbb {R}^{d}\right) $ a given probability density and $g\in \text {C}_{b}\left( \mathbb {R}^{d}\right) $, also given. A weak solution of the PDE system (4.9) is a pair (w, p) such that $w, \partial _i w$ and $p \in \text {C}_{b}\left( \left[ 0,T\right] \times \mathbb {R}^{d}\right) $ for all $i \in [[ d ]]$, $w\left( t,x\right) \ge e^{-\left( \left\| g\right\| _{\infty }+T\left\| f\right\| _{\infty }\right) }$ and the system is satisfied in the weak sense as in Definition 4.1.

In particular, the weak formulation in Definition 4.3 is equivalent to the validity, for all $t \in [0,T]$, of the following system

$$\begin{aligned} \begin{aligned} w\left( t\right) =\mathcal {P}_{T-t}\exp \left( -g\right) -\int _{t}^{T} \mathcal {P}_{s-t}\left( b\left( \cdot ,p\left( s\right) \right) \cdot \nabla w\left( s\right) -w\left( s\right) f\left( \cdot ,p\left( s\right) \right) \right) ds \end{aligned} \end{aligned}$$

(4.11)

and

$$\begin{aligned} \begin{aligned} p\left( t\right) =\mathcal {P}_{t}p_{0}+\int _{0}^{t}\nabla \mathcal {P} _{t-s}\left( p\left( s\right) \left( \frac{\nabla w\left( s\right) }{w\left( s\right) }+b\left( \cdot ,p\left( s\right) \right) \right) \right) ds, \end{aligned} \end{aligned}$$

(4.12)

where the quantity $\nabla \mathcal {P}_{t-s}$ is defined in Lemma 4.2, Eq. (4.8). The proof of such equivalence is the same as in Lemma 4.2 and we decide to omit it for the sake of space.

To prove global existence of weak solutions, we need the following additional assumption on $p_0$:

(H5)
There exists a continuous function $\rho :\mathbb {R}^{d}\rightarrow \left( 0,\infty \right) $ such that
$$\begin{aligned} \lim _{\left\| x\right\| \rightarrow \infty }\rho \left( x\right) =0\quad \text {and}\quad p_{0}\left( x\right) \le \rho \left( x\right) \end{aligned}$$
for all $x\in \mathbb {R}^{d}$. Moreover $p_{0}\in \text {C}_{b}^{\alpha }(\mathbb {R}^{d})$ for some $\alpha >0$ and $\rho ^{-1}\in \text {C}^{2}\left( \mathbb {R}^{d}\right) $ with $\left\| \Delta \rho ^{-1}\right\| _{\infty }+\left\| \nabla \rho ^{-1}\right\| _{\infty }<\infty $.

Notice that the latter assumption on $\rho ^{-1}$ is not restrictive. Indeed, smoothness of $\rho ^{-1}$ can be obtained by regularization and the bounds on $\left\| \Delta \rho ^{-1}\right\| _{\infty }$ and $\left\| \nabla \rho ^{-1}\right\| _{\infty }$ are true if $\rho $ decays slowly, monotonically and radially, which can always be assumed without loss of generality. We are now ready to prove the existence of a weak solution of the PDE system (4.9); this is the content of the following theorem, whose proof is relatively standard but some new details – up to our knowledge – are due to the fact that the space is $\mathbb {R}^{d}$ instead of a bounded set.

Theorem 4.4

There exists a weak solution $\left( w,p\right) $ on $\left[ 0,T\right] $ of system (4.9). Moreover, the pair

$$\begin{aligned} \left( u,p\right) \doteq \left( -\log w,p\right) \end{aligned}$$

is a weak solution of the system (4.1).

Proof

See Appendix 1, Sect. 1. $\square $

Now, we prove that the system (4.1) admits a unique solution for T sufficiently small via the contraction principle; indeed, the following theorem holds.

Theorem 4.5

(Local well posedness) There exists a unique weak (or mild) solution of the MFG system (4.6)–(4.7), for T sufficiently small.

Proof

See Appendix 1, Sect. 1. $\square $

Next, let $T>0$ indicate (as before) the finite time horizon, and let $b, f, p_0, g$ as in Sect. 2. If the PDE system in Eq. (4.1) has a unique weak (or mild) solution (u, p), then we denote by $K(T, b, f, p_0, g)$ the following constant:

$$\begin{aligned} K\left( T,b,f,p_{0},g\right) \doteq \sup _{t\in \left[ 0,T\right] ,x\in \mathbb {R} ^{d}}\left| \nabla u\left( t,x\right) \right| . \end{aligned}$$

(4.13)

4.1 Feedback MFG with Given Density

We started the section by formulating the PDE approach to MFGs of moderate interaction. Here, instead, we introduce the corresponding stochastic (feedback first and open-loop in the next subsection) formulation.

Let $K>0$. In order to make precise our definition of (feedback) MFG solution, we introduce the following notation:

(i):

We denote by $\mathcal {A}^{fb}_K$ the set of feedback controls for the MFG, which is defined as the set of functions $\alpha \in \text {C}_b([0,T] \times \mathbb {R}^{d} ; \mathbb {R}^{d})$ bounded by K.

(ii):

Next, given the function p as in Definition 4.1, given an admissible control $\alpha \in \mathcal {A}^{fb}_K$, we consider the equation

$$\begin{aligned} {X_t = X_0 + \int _{0}^{t} (\alpha (s,X_s) + b(X_s, p(s, X_s)))\,ds + W_t,\quad t \in [0, T],} \end{aligned}$$

(4.14)

where $X_0$ is a $\mathcal {F}_0$-measurable random variable distributed as $\mu _0$ having density $p_0$ while W is a d-dimensional Wiener process defined on some filtered probability space $(\Omega , \mathcal {F}, (\mathcal {F}_t), \mathbb {P})$.

(iii):

Finally, we consider the following cost functional

$$\begin{aligned} {J(\alpha ) \doteq \mathbb {E}\left[ \int _{0}^{T} \frac{1}{2}|\alpha (s,X_s)|^2 + f(X_s, p(s, X_s))\,ds + g(X_T)\right] } \end{aligned}$$

and we say that $\alpha ^{*}\in \mathcal {A}^{fb}_{K}$ is an optimal control if it is a minimizer of J over $\mathcal {A}^{fb}_{K}$, i.e. if $J(\alpha ^{*}) = \inf _{\alpha \in \mathcal {A}^{fb}_K} J(\alpha )$.

The notion of solution we will consider in the feedback case is then the following:

Definition 4.6

(MFG solution, stochastic feedback formulation) Let $T>0$ be the finite time horizon and $b, f, p_0, g$ as in (H1)-(H2) and (H4); see Sect. 2. Then a feedback MFG solution for bound $K>0$ is a pair $(\alpha ^*,p)$ such that:

(i):: $p\in C_b([0,T]\times \mathbb {R}^d)$ and $\alpha ^* \in \mathcal {A}^{fb}_K$;
(ii):: Given $p\in C_b([0,T]\times \mathbb {R}^d)$, $\alpha ^* \in \mathcal {A}^{fb}_K$ is an optimal control for the cost functional $J(\cdot )$ (in the sense of item (iii) above);
(iii):: For any weak solution $(\Omega , \mathcal {F}, (\mathcal {F}_t), \mathbb {P},X,W)$ of Eq. (4.14), $X_t$ has law $\mu _t$ with density $p(t,\cdot )$ for every $t\in [0,T]$.

Assume that the MFG system in Eq. (4.1) has a unique weak solution (u, p) and let K be any constant such that

$$\begin{aligned} {K \ge K(T, b, f, p_0, g),} \end{aligned}$$

where $K(T, b, f, p_0, g)$ is the constant in Eq. (4.13). From an operative point of view, in order to find a (feedback) MFG solution in the sense of Definition 4.6, we look for an optimal control $\alpha ^*\in \mathcal {A}^{fb}_K$ such that, given $p\in C_b([0,T]\times \mathbb {R}^d)$ and given any weak solution $(\Omega , \mathcal {F}, (\mathcal {F}_t), \mathbb {P},X^*,W)$ of Eq. (4.14) (controlled by $\alpha ^*$ and with density p appearing in the drift), the law of $X^*_t$ has density $p^*\in C_b([0,T]\times \mathbb {R}^d)$ such that $p^*\equiv p$.

Given the environment $(\Omega , \mathcal {F}, (\mathcal {F}_t), \mathbb {P},W, p)$, i.e. a filtered probability space with Wiener process W and with a given distribution of players specified by its density function p, where p is as in Definition 4.1, we notice that path-wise uniqueness and existence of a strong solution of Eq. (4.14) is provided by Veretennikov [25]. Then, we define the unique solution X of Eq. (4.14) in the given environment $(\Omega , \mathcal {F}, (\mathcal {F}_t), \mathbb {P},W, p)$ and with $\alpha \doteq -\nabla u$, to be the state of the PDE system in Eq. (4.1) in the given environment with density p. Nevertheless, we decide to introduce and work with weak solutions in view of the approximation result of Sect. 6, where we exploit weak convergence of the laws of the N-player system and provide a stochastic representation of the limiting dynamics by means of the martingale problem of Stroock and Varadhan [24].

4.2 Open-Loop MFG with Given Density

We now introduce a more general notion of control, that of open-loop control, together with what we intend with a solution of the MFG in open-loop form.

Let $K>0$. In order to make precise our definition of (open-loop) MFG solution, we introduce the following notation:

(i):

We denote by $\mathcal {A}_{K}$ the set of admissible open-loop controls for the MFG, which is defined as the set of tuples $(\Omega , \mathcal {F}, (\mathcal {F}_t), \mathbb {P}, X, W, \alpha )$ where $\alpha = (\alpha (t))_{t \in [0,T]}$ is $\mathcal {F}_t$-progressively measurable, continuous and bounded by K a.s. for all $t\in [0,T]$, while $(\Omega , \mathcal {F}, (\mathcal {F}_t), \mathbb {P}, X, W)$ is a weak solution of

$$\begin{aligned} {X_t = {X_0} + \int _{0}^{t} (\alpha (s) + b(X_s, p(s, X_s)))\,ds + W_t,\quad t \in [0, T]} \end{aligned}$$

(4.15)

where $X_0 \overset{d}{\sim } \mu _0$, having density $p_0$, is independent of the $\mathcal {F}_t$-Wiener process W. For the sake of brevity and where no confusion is possible we will denote a control for the MFG simply with $\alpha $, in place of the full tuple.

(ii):

We consider the following cost functional

$$\begin{aligned} {J(\alpha ) \doteq \mathbb {E}\left[ \int _{0}^{T} \frac{1}{2}|\alpha (s)|^2 + f(X_s, p(s, X_s))\,ds + g(X_T)\right] } \end{aligned}$$

(4.16)

and we say that $\alpha ^{*} \doteq (\alpha ^{*}(t))_{t \in [0, T]} \in \mathcal {A}_{K}$ is an optimal control if it is a minimizer of J over $\mathcal {A}_{K}$, i.e. if $J(\alpha ^{*}) = \inf _{\alpha \in \mathcal {A}_K} J(\alpha )$.

Thereafter, we will denote by $\mathbf {OC}$ the just-introduced optimal control problem. The notion of solution we will consider in the open-loop case is then the following:

Definition 4.7

(MFG solution, stochastic open-loop formulation) Let $T>0$ be the finite time horizon and $b, f, p_0, g$ as in (H1)–(H2) and (H4); see Sect. 2. Then a open-loop MFG solution for bound $K>0$ is a pair $(\alpha ^*,p)$ such that:

(i):: $p\in C_b([0,T]\times \mathbb {R}^d)$ and $\alpha ^* \in \mathcal {A}_K$, $\alpha ^*$ standing for the full tuple:
$$\begin{aligned}(\Omega , \mathcal {F}, (\mathcal {F}_t), \mathbb {P}, X, W, \alpha ^*);\end{aligned}$$
(ii):: Given $p\in C_b([0,T]\times \mathbb {R}^d)$, $\alpha ^* \in \mathcal {A}_K$ is an optimal control for problem $\mathbf {OC}$ (in the sense of item (ii) above);
(iii):: $(\Omega , \mathcal {F}, (\mathcal {F}_t), \mathbb {P},X,W)$ is a weak solution of Eq. (4.15) such that $X_t$ has law $\mu _t$ with density $p(t,\cdot )$ for every $t\in [0,T]$.

As for the feedback case, given the environment $(\Omega , \mathcal {F}, (\mathcal {F}_t), \mathbb {P},W, p)$ where p is as in Definition 4.1, given an admissible control $\alpha \in \mathcal {A}_K$, we notice that path-wise uniqueness and existence of a strong solution of Eq. (4.15) is provided by Veretennikov [25] but we will continue working with weak solutions in view of the approximation result of Sect. 6.

We point out that feedback controls induce stochastic open-loop controls so, as a consequence, the computation of the infimum of $J(\alpha )$ over the class of stochastic open-loop controls would, in principle, lead to a lower value with respect to performing the same computation over the set of stochastic feedback controls. However, thanks to Proposition 2.6 in El Karoui et al. [10], the two minimization problems are equivalent from the point of view of the value function.

We state now the main result of this section, the Verification Theorem, which gives an optimal control for $\mathbf {OC}$. In particular, we are going to show that $\alpha ^{*}$ is the optimal feedback control, namely the optimal strategy to play at time t for a given state x.

Theorem 4.8

(Verification Theorem) Consider the PDE system in Eq. (4.1) and let (u, p) be a weak (or mild) solution. Consider the optimal control problem $\mathbf {OC}$ as in Definition 4.7-(iii) and set $\alpha ^{*}(t) = \alpha ^{*}(t, x) \doteq -\nabla u(t, x)$. Then,

(i):: $\alpha ^{*}$ is an optimal control for $\mathbf {OC}$;
(ii):: For any weak solution $(\Omega , \mathcal {F}, (\mathcal {F}_t), \mathbb {P},X^*,W)$ of Eq. (4.15) with $\alpha (s) = \alpha ^{*}(s, X_s^{*})$, the state $X^{*}_t$ has law $\mu ^*_t$ with density $p(t,\,\cdot \,)$ for every $t \in [0, T]$.

Proof

Let $\alpha \in \mathcal {A}_K$ and $X^{\alpha } \doteq (X_t^{\alpha })_{t \in [0,T]}$ the solution of Eq. (4.15) controlled by $\alpha $. Besides, let $X_t^{*}$ as in Definition 4.16-(ii), i.e.,

$$\begin{aligned} X_t^{*} = {X_0} + \int _{0}^{t} (-\nabla u(s, X_s^{*}) + b(X_s^{*}, p(s, X_s^{*})))\,ds + W_t. \end{aligned}$$

Notice that, thanks to boundedness of the drift, the previous equation admits both a weak solution and, in any given environment $(\Omega , \mathcal {F}, (\mathcal {F}_t), \mathbb {P},W, p)$, a strong solution that is path-wise unique [25].

Proof of (i). Heuristically, should the function $u \in \text {C}^{1,2}([0,T] \times \mathbb {R}^{d})$, then we could apply Itô formula to $u(t, X_t^{\alpha })$ and obtain (in expectation)

$$\begin{aligned} \begin{aligned}&\mathbb {E}[g( X_T^{\alpha })]\\&\quad = \mathbb {E}[u(T, X_T^{\alpha })]\\&\quad =\mathbb {E}\left[ u(0, X_0^{\alpha }) + \int _{0}^{T}\left( \alpha (s) \cdot \nabla u(s, X_s^{\alpha }) - \frac{1}{2}| \nabla u(s, X_s^{\alpha })|^2 - f(X_s^{\alpha }, p(s, X_s^{\alpha })\right) \,ds\right] ,\\ \end{aligned}\nonumber \\ \end{aligned}$$

(4.17)

where we use the fact that the function u satisfies the first equation of the PDE system in Eq. (4.1), which implies

$$\begin{aligned} \begin{aligned} \mathbb {E}[g(&X_T^{\alpha })] \ge \mathbb {E}\left[ u(0, X_0^{\alpha }) + \int _{0}^{T}\left( - \frac{1}{2}| \alpha (s) |^2 - f(X_s^{\alpha }, p(s, X_s^{\alpha })\right) \,ds\right] . \end{aligned} \end{aligned}$$

Hence for any admissible control $\alpha $ we would have $J(\alpha ) \ge \mathbb {E}[u(0, X_0^{\alpha })]$. In particular, the above inequality becomes an equality for $\alpha (s) = \alpha ^{*}(s,x) = -\nabla u(s,x)$, i.e. $J(\alpha ^{*}) = \inf _{\alpha } J(\alpha ) = \mathbb {E}[u(0, X_0^{*})]$. This would prove that $\alpha ^{*}$ is an optimal control for $\mathbf {OC}$.

However, the function u is not “regular enough" to apply Itô formula and some work is needed to adapt the heuristic argument to u. Given the technicality of this part and being it based on standard mollification arguments, we decide to move the required computations in Appendix 1, Sect. 1.

Proof of (ii). Now, let $\mu ^{*}_t$ be the law of $X_t^{*}$ and let $\varphi \in \text {C}_b^{2}(\mathbb {R}^{d})$ be a test function. By Itô formula,

$$\begin{aligned} \begin{aligned} \varphi (X_t^{*})&= \varphi ({X_0}) + \int _{0}^{t} \nabla \varphi (X_s^{*}) \cdot (-\nabla u(s, X_s^{*}) + b(X_s^{*}, p(s, X_s^{*})))\,ds\\&\quad \,\,+ \int _{0}^{t} \nabla \varphi (X_s^{*})\,dW_s + \frac{1}{2} \int _{0}^{t} \Delta \varphi (X_s^{*})\,ds. \end{aligned} \end{aligned}$$

Hence, taking expectations on both sides, we have

$$\begin{aligned} \begin{aligned} \left\langle \mu _t^{*}, \varphi (\,\cdot \,) \right\rangle&= \left\langle p_0 ,\varphi (\,\cdot \,)\right\rangle + \int _{0}^{t} \left\langle \mu _s^{*}, \nabla \varphi (\,\cdot \,) \cdot (-\nabla u(s, \,\cdot \,) + b(\,\cdot \,,p(s,\,\cdot \,)))\right\rangle \,ds \\&\quad + \frac{1}{2} \int _{0}^{t} \left\langle \mu _s^{*}, \Delta \varphi (\,\cdot \,)\right\rangle \,ds. \end{aligned} \end{aligned}$$

Theorem 4.5 guarantees that this equation has a unique weak (or mild) solution $\mu _t$ with density $p(t,\,\cdot \,)$; hence $\mu $ and $\mu ^*$ coincide and $\mu _t^{*}$ has density $p(t,\,\cdot \,)$ for every $t \in [0, T]$. This concludes the proof. $\square $

5 Moderately Interacting Particles

Let $N \in \mathbb {N}$ be the number of players and denote by $X_t^{N,i}$ the private state of player i at time t, $t \in \left[ 0, T\right] $. In this section, we assume that the evolution of the players’ states is given by Eq. (3.1) and, as said, we consider players using feedback strategies, i.e. $\alpha ^{N,i}(s) = \alpha (s, {\mathbf {X}}_s^{N})$ with $\alpha $ sufficiently smooth. In particular, we will assume – with the natural identification – that $\alpha \in \text {C}_b([0,T] \times \mathbb {R}^{d\cdot N} ; \mathbb {R}^{d})$. Besides, b, $V^{N}$ and ${X_0}^{N, i}$, $i \in [[ N ]]$, satisfy the hypotheses $\text {(H1)}$, $\text {(H3)}$ and $\text {(H4)}$ in Sect. 2. Before proceeding, notice that the function

$$\begin{aligned} F : [0,T] \times \mathbb {R}^{d\cdot N} \rightarrow \mathbb {R}^{d\cdot N} \end{aligned}$$

defined component-wise as

$$\begin{aligned} F_i(t, x_1, \ldots , x_N) \doteq \alpha (t, x_i) + b\Bigg (x_i, \frac{1}{N}\sum _{j = 1}^{N} V^{N}(x_i - x_j)\Bigg ) \end{aligned}$$

(5.1)

is continuous and bounded. Since the blackian motion $\varvec{W}^{N}_t \in \mathbb {R}^{d\cdot N}$ in Eq. (3.1) is non-degenerate, both existence of a weak solution and existence of a pathwise unique strong solution in any given environment $((\Omega _{N}, \mathcal {F}_{N}, (\mathcal {F}_t^{N}), \mathbb {P}^N), \varvec{W}^{N}, V)$, where now in the N-player case the interaction among players is prescribed by V, holds for this system [25]. Let $S^{N}_t$ be the empirical measure on $\mathbb {R}^{d}$ of the players’ private states, that is,

$$\begin{aligned} S_t^{N}(B) \doteq \frac{1}{N} \sum _{i = 1}^{N} \delta _{X_t^{N,i}}(B),\quad B \in \mathcal {B}(\mathbb {R}^{d}),\,\,t\in [0,T]. \end{aligned}$$

(5.2)

$S^N=(S_t^{N})$ is a continuous stochastic process with values in $\mathcal {P}(\mathbb {R}^{d})$; hence it can be seen as a random variable with values in $\text {C}([0,T];\mathcal {P}(\mathbb {R}^{d}))$ (notice that for the sake of notation we do not put the explicit dependence on $\omega \in \Omega $ in these definitions). Therefore, $\mathcal {L}(S_{t}^{N})\in \mathcal {P}(\mathcal {P}(\mathbb {R}^{d}))$ and $\mathcal {L}(S^{N})\in \mathcal {P}(\text {C}([0,T];\mathcal {P}( \mathbb {R}^{d})))$, respectively.

The main goal of this section is the characterization of the convergence of the laws $(\mathcal {L}(S^{N}))_{N\in \mathbb {N}}$ in $\mathcal {P}(\text {C}([0,T];\mathcal {P}(\mathbb {R}^{d})))$. This characterization result is the content of Theorem 5.1 here below.

Theorem 5.1

(Moderately interacting particles) [cfr. 22, oelschlager1985law] Grant $\text {(H1)}$ and $\text {(H3)}-\text {(H4)}$. Let $\alpha \in \text {C}_b([0,T] \times \mathbb {R}^{d \times N} ; \mathbb {R}^{d})$ be given. Then,

(i):

The sequence of laws $(\mathcal {L}(S^{N}))_{N\in \mathbb {N}}$ converges weakly in $\mathcal {P}(C([0,T];\mathcal {P}(\mathbb {R}^{d})))$ to $\delta _{\mu }\in \mathcal {P}(C([0,T];\mathcal {P}(\mathbb {R}^{d})))$ for a flow of probability measures $\mu \in C([0,T];\mathcal {P}(\mathbb {R}^{d}))$; hence also $S^{N}$ converges in probability to $\mu $;

(ii):

For each $t\in [0,T]$, $\mu _{t}$ is absolutely continuous with respect to the Lebesgue measure on $\mathbb {R}^{d}$, with density $p(t,\,\cdot \,)$; the flow of density functions satisfies

$$\begin{aligned} p \in {\text {C}_b([0,T] \times \mathbb {R}^{d})} \end{aligned}$$

and it is the unique solution in this space of the equation

$$\begin{aligned} p\left( t\right) =\mathcal {P}_{t}p_0 +\int _{0}^{t}\nabla \mathcal {P} _{t-s}\left( p\left( s\right) \left( \alpha \left( s\right) +{b(\,\cdot \,,p(s))} \right) \right) ds. \end{aligned}$$

(5.3)

The proof of the previous theorem is divided into four parts. The first one is the tightness of the sequence of laws $(\mathcal {L}(S^{N} ))_{N\in \mathbb {N}}$ in $\mathcal {P}(\text {C}([0,T];\mathcal {P}(\mathbb {R} ^{d}))$; see Sect. 5.1. The second one is the collection of estimates on $V^{N}*S_{t}^{N}$; see Sect. 5.2. The third one is the characterization of the limits: all the possible limits are a random solutions of the deterministic equation in Eq. (5.3), with the required regularity; see Sect. 5.3. The fourth one is the proof of the uniqueness of solutions of this deterministic equation.

5.1 Tightness of the Empirical Measure

On $\mathcal {P}(\mathbb {R}^{d})$ the weak topology is generated by the following complete metric:

$$\begin{aligned} d_{w}(\mu ,\nu )\doteq \sup _{f\in \text {Lip}_{1}(\mathbb {R}^{d})\cap \text {C}_{b}(\mathbb {R}^{d})}\left( \langle \mu ,f\rangle -\langle \nu ,f\rangle \right) . \end{aligned}$$

We refer to Oelschläger [21], Page 285, and Dudley [8], Theorem 18, for a complete proof of the previous result. Also, we consider the regularized empirical measures

$$\begin{aligned} \left( V^{N}*S_{t}^{N}\right) (x)=\int _{\mathbb {R}^{d}}V^{N} (x-y)S_{t}^{N}(dy). \end{aligned}$$

In particular, these are probability densities, because they are non-negative functions with

$$\begin{aligned} \int _{\mathbb {R}^{d}}\left( V^{N}*S_{t}^{N}\right) (x)dx=\int _{\mathbb {R}^{d}}\left( \int _{\mathbb {R}^{d}}V^{N}(x-y)dx\right) S_{t} ^{N}(dy)=1. \end{aligned}$$

Therefore, we consider the probability measure with density $V^{N}*S_{t}^{N}$ as a random time-dependent element of $\mathcal {P}(\mathbb {R}^{d})$ (for each t and a.s. on the probability space). In the next lemma, when we mention the laws $(\mathcal {L}(V^{N}*S^{N}))_{N\in \mathbb {N}}$ on $\mathcal {P}(C([0,T];\mathcal {P}(\mathbb {R} ^{d})))$, we adopt this interpretation.

Lemma 5.2

(Tightness) The laws $(\mathcal {L}(S^{N}))_{N\in \mathbb {N}}$ are tight in $\mathcal {P}(\text {C}([0,T];\mathcal {P}(\mathbb {R}^{d})))$. Similarly, the laws $(\mathcal {L}(V^{N}*S^{N}))_{N\in \mathbb {N}}$ are tight in $\mathcal {P}(\text {C}([0,T];\mathcal {P}(\mathbb {R}^{d})))$.

Proof

Part 1. Recall that the initial conditions ${X_0}^{N, i}$, $i \in [[N]]$, admit a density $p_0$ which is integrable. Therefore,

$$\begin{aligned} \mathbb {E}\left[ \int _{\mathbb {R}^{d}} |x| S_{0}^{N}(dx)\right] \le C \end{aligned}$$

for some constant $C>0$, uniformly in $N \in \mathbb {N}$. To establish the tightness in $\text {C}([0,T];\mathcal {P}(\mathbb {R}^{d}))$, we have to show (see, for instance Karatzas and Shreve, 1998, Problem 2.4.11) that the following two conditions are satisfied:

(i):: $\mathbb {E}\left[ \sup _{t\in [0,T]}\int _{\mathbb {R}^{d}}|x|S_{t}^{N}(dx)\right] \le C$, $t\in [0,T]$,
(ii):: $\mathbb {E}\left[ d_{w}(S_{t}^{N},S_{s}^{N})^{p}\right] \le C|t-s|^{1+\epsilon }$, $t,s\in [0,T]$

for some constants $C>0$, $p\ge 2$ and $\epsilon >0$. In order to verify $ (i) $, we compute

$$\begin{aligned} \int _{\mathbb {R}^{d}}|x|S_{t}^{N}(dx)=\frac{1}{N}\sum _{i=1}^{N}|X_{t}^{N,i}|, \end{aligned}$$

where

$$\begin{aligned} |X_{t}^{N,i}|\le & {} |{X_0}^{N,i}|+\int _{0}^{t}|\alpha \left( s,X_{s}^{N,i}\right) \\&+b\Big (X_{s}^{N,i},\frac{1}{N}\sum _{j=1}^{N}V^{N}(X_{s}^{N,i}-X_{s} ^{N,j})\Big )|ds+|W_{t}^{N,i}|. \end{aligned}$$

Hence,

$$\begin{aligned} \Vert X^{N,i}\Vert _{\infty ,t}\le |{X_0}^{N,i}|+CT+\Vert W^{i}\Vert _{\infty ,t}, \end{aligned}$$

which implies

$$\begin{aligned} \mathbb {E}\left[ \Vert X^{N,i}\Vert _{\infty ,T}\right] \le \mathbb {E}\left[ |{X_0}^{N,i}|\right] +CT+C_{T}^{W}(d), \end{aligned}$$

where we use the boundedness (uniformly in N) of $\alpha $, b and $\mathbb {E}\left[ |{X_0}^{N,i}|\right] $; the quantity $C_{T}^{W}(d)$ only depends on T and d. As regards (ii), instead,

$$\begin{aligned} \begin{aligned} \mathbb {E}\left[ d_{w}(S_{t}^{N},S_{s}^{N})^{p}\right]&\le \mathbb {E}\left[ \sup _{f}\left| \frac{1}{N}\sum _{i=1}^{N}(f(X_{t} ^{N,i})-f(X_{s}^{N,i}))\right| ^{p}\right] \\&\le \mathbb {E}\left[ \sup _{f}\frac{1}{N}\sum _{i=1}^{N}\left| f(X_{t}^{N,i})-f(X_{s}^{N,i})\right| ^{p}\right] \\&\le \mathbb {E}\left[ \frac{1}{N}\sum _{i=1}^{N}\left| X_{t}^{N,i} -X_{s}^{N,i}\right| ^{p}\right] \\&\le C(|t-s|^{p}+|t-s|^{\frac{p}{2}}),\\&\end{aligned} \end{aligned}$$

where we apply Jensen’s inequality, the 1-Lipschitz continuity of f, boundedness of $\alpha $ and b and Burkholder-Davis-Gundy inequality, respectively. To conclude it suffices to choose $p>2$.

Part 2 To prove the statement for the random flow of probability measures $V^{N}*S_{t}^{N}$, let us first notice that, denoting by $R>0$ a real number such that the support of V is included in $B_{R}(0)$, the open ball of radius R around the origin, for all $y\in \mathbb {R}^{d}$ we have

$$\begin{aligned} \int _{\mathbb {R}^{d}}\left| x\right| V^{N}\left( x-y\right) dx=\int _{\mathbb {R}^{d}}\left| z+y\right| V^{N}\left( z\right) dz\le \sup _{\left| w\right| \le R}\left| w+y\right| \le \left| y\right| +R \end{aligned}$$

and thus

$$\begin{aligned} \int _{\mathbb {R}^{d}}\left| x\right| \left( V^{N}*S_{t} ^{N}\right) \left( x\right) dx&=\int _{\mathbb {R}^{d}}\left| x\right| \left( \int _{\mathbb {R}^{d}}V^{N}\left( x-y\right) S_{t} ^{N}\left( dy\right) \right) dx\\&=\int _{\mathbb {R}^{d}}\left( \int _{\mathbb {R}^{d}}\left| x\right| V^{N}\left( x-y\right) dx\right) S_{t}^{N}\left( dy\right) \\&\le \int _{\mathbb {R}^{d}}\left| y\right| S_{t}^{N}\left( dy\right) +R. \end{aligned}$$

We conclude by going back to the previous estimate without the mollifier. Moreover, denoted $V^{N,-}\left( x\right) =V^{N}\left( -x\right) $, if f has Lipschitz constant less or equal to one, then

$$\begin{aligned}&\left| \left( V^{N,-}*f\right) \left( x\right) -\left( V^{N,-}*f\right) \left( y\right) \right| \\&\quad =\left| \int _{\mathbb {R}^{d}}V^{N}\left( x^{\prime }-x\right) f\left( x^{\prime }\right) dx^{\prime }-\int _{\mathbb {R}^{d}}V^{N}\left( x^{\prime }-y\right) f\left( x^{\prime }\right) dx^{\prime }\right| \\&\quad =\left| \int _{\mathbb {R}^{d}}V^{N}\left( z\right) f\left( z+x\right) dz-\int _{\mathbb {R}^{d}}V^{N}\left( z\right) f\left( z+y\right) dz\right| \\&\quad \le \int _{\mathbb {R}^{d}}V^{N}\left( z\right) \left| f\left( z+x\right) -f\left( z+y\right) \right| dz\le \left| x-y\right| \int _{\mathbb {R}^{d}}V^{N}\left( z\right) dz=\left| x-y\right| \end{aligned}$$

namely $V^{N,-}*f$ has also Lipschitz constant less or equal to one. Therefore,

$$\begin{aligned}&\left| \left\langle V^{N}*S_{t}^{N},f\right\rangle -\left\langle V^{N}*S_{s}^{N},f\right\rangle \right| \\&\quad =\left| \left\langle S_{t}^{N},V^{N,-}*f\right\rangle -\left\langle S_{s}^{N},V^{N,-}*f\right\rangle \right| \\&\quad \le \frac{1}{N}\sum _{i=1}^{N}\left| \left( V^{N,-}*f\right) ( X_{t}^{N,i} ) -\left( V^{N,-}*f\right) \left( X_{s} ^{N,i}\right) \right| \\&\quad \le \frac{1}{N}\sum _{i=1}^{N}\left| X_{t}^{N,i}-X_{s}^{N,i}\right| , \end{aligned}$$

and we are again led back to the previous estimate without the mollifier. $\square $

5.2 Estimates on Mollified Empirical Measures

In this subsection we obtain estimates on mollified empirical measures. More precisely, we first prove that the empirical measure $S_t^{N}$ satisfies the following identity for a test function $\varphi \in \text {C}_b^{1,2}([0, T] \times \mathbb {R}^{d})$:

$$\begin{aligned} \begin{aligned} \left\langle S_{t}^{N}, \varphi \left( t,\,\cdot \,\right) \right\rangle&= \left\langle S_{0}^{N}, \varphi \left( 0,\,\cdot \,\right) \right\rangle \\&\quad \, +\int _{0}^{t}\left( \left\langle S_{s}^{N}, \frac{\partial \varphi }{\partial s} \left( s,\,\cdot \,\right) +\frac{1}{2}\Delta \varphi \left( s,\,\cdot \,\right) \right\rangle +\left\langle S_{s}^{N},\alpha \left( s \right) \cdot \nabla \varphi \left( s, \cdot \right) \right\rangle \right) ds\\&\quad \,+\int _{0}^{t}\left\langle S_{s}^{N},b\left( \,\cdot \,,\left( V^{N}*S_{s}^{N}\right) \left( \,\cdot \,\right) \right) \cdot \nabla \varphi \left( s,\,\cdot \,\right) \right\rangle ds+M_{t}^{N, \varphi }, \end{aligned} \end{aligned}$$

where $M_{t}^{N, \varphi }$ is a martingale to be defined below. Then, in Lemma 5.3 we obtain an identity in mild form for the empirical density; the latter is defined as any convolution of the empirical measure with a smooth mollifier. In our paper, we work with the following particular convolution:

$$\begin{aligned} p^{N}(t,x) \doteq \left( V^{N}*S_{t}^{N}\right) (x) = \int _{\mathbb {R} ^{d}}V^{N}(x-y)S_{t}^{N}(dy) = \frac{1}{N}\sum _{i=1}^{N}V^{N}(x-X_{t}^{N,i}), \end{aligned}$$

(5.4)

where $t \in [0, T]$ and $x \in \mathbb {R}^{d}$. Then, in Lemma 5.4 we derive Hölder-type semi-norm bound for the martingale $M_t^{N,\varphi }$, and in Lemma 5.6, instead, Hölder-type semi-norm bound for the empirical density (5.4). In particular, we will see that in order to understand the limit of $(\mathcal {L}(S^{N}))_{N \in \mathbb {N}}$ it is crucial to study rigorously the regularity properties of $p^{N}$ that remain stable in the limit as N tends to infinity.

First, we obtain the identity for the empirical measure. Let $\varphi \in \text {C}_b^{1,2}([0, T] \times \mathbb {R}^{d})$ be a test function. By Itô formula,

$$\begin{aligned} \begin{aligned} d\left\langle S_{t}^{N},\varphi \left( t,\,\cdot \,\right) \right\rangle&=\frac{1}{N} \sum _{i=1}^{N}d\varphi \left( t,X_{t}^{N,i}\right) \\&=\frac{1}{N}\sum _{i=1}^{N}\frac{\partial \varphi }{\partial t} \Big ( t,X_{t}^{N,i} \Big ) dt +\frac{1}{N}\sum _{i=1}^{N}\nabla \varphi \left( t,X_{t}^{N,i}\right) \cdot \alpha (t,X_{t}^{N,i}) dt \\&\quad +\frac{1}{N}\sum _{i=1}^{N}\nabla \varphi \left( t, X_{t}^{N,i}\right) \cdot b \Big (X_{t}^{N,i},\frac{1}{N}\sum _{j=1}^{N}V^{N}(X_{t}^{N,i}-X_{t}^{N,j}) \Big )dt \\&\quad +\frac{1}{N}\sum _{i=1}^{N}\nabla \varphi \Big (t, X_{t}^{N,i} \Big ) \cdot dW_{t}^{N, i}+\frac{1}{2N}\sum _{i=1}^{N} \Delta \varphi \Big (t, X_{t}^{N,i} \Big ) dt\\&= \left\langle S_t^{N}, \frac{\partial \varphi }{\partial t}(t,\,\cdot \,) \right\rangle + \left\langle S_{t}^{N},\alpha ( t ) \cdot \nabla \varphi (t,\,\cdot \,) \right\rangle dt \\&\quad +\left\langle S_{t}^{N},b(\,\cdot \,, \left( V^{N}*S_{t}^{N}\right) \left( \,\cdot \,\right) ) \cdot \nabla \varphi (t,\,\cdot \,) \right\rangle dt \\&\quad +\frac{1}{N}\sum _{i=1}^{N}\nabla \varphi \left( t, X_{t}^{N,i}\right) \cdot dW_{t}^{N, i}+\frac{1}{2}\left\langle S_{t}^{N},\Delta \varphi (t,\,\cdot \,)\right\rangle dt. \end{aligned} \end{aligned}$$

In particular, the previous expression can be rewritten in integral form as:

$$\begin{aligned} \begin{aligned} \left\langle S_{t}^{N}, \varphi \left( t,\,\cdot \,\right) \right\rangle&= \left\langle S_{0}^{N}, \varphi \left( 0,\,\cdot \,\right) \right\rangle \\&\quad +\int _{0}^{t}\left( \left\langle S_{s}^{N}, \frac{\partial \varphi }{\partial s} \left( s,\,\cdot \,\right) +\frac{1}{2}\Delta \varphi \left( s,\,\cdot \,\right) \right\rangle +\left\langle S_{s}^{N},\alpha \left( s \right) \cdot \nabla \varphi \left( s, \cdot \right) \right\rangle \right) ds\\&\quad +\int _{0}^{t}\left\langle S_{s}^{N},b\left( \,\cdot \,,\left( V^{N}*S_{s}^{N}\right) \left( \,\cdot \,\right) \right) \cdot \nabla \varphi \left( s,\,\cdot \,\right) \right\rangle ds+M_{t}^{\varphi ,N}, \end{aligned} \end{aligned}$$

(5.5)

where $M_{t}^{N, \varphi }$ is the martingale

$$\begin{aligned} M_{t}^{N, \varphi }=\int _{0}^{t}\frac{1}{N}\sum _{i=1}^{N}\nabla \varphi \left( s,X_{s}^{N,i}\right) \cdot dW_{s}^{N, i}. \end{aligned}$$

(5.6)

Second, we obtain the identity in mild form for the empirical density. Henceforth, we will use the classical notational conventions used in the semigroups theory [see 22]. Sometimes, it may happen that we will indicate the explicit dependence on the state variable to clarify the results; see e.g. the second integral in the lemma here below.

Lemma 5.3

Let $p^{N}$ as in Eq. (5.4). Grant assumptions of Theorem 5.1. Then,

$$\begin{aligned} \begin{aligned} p^{N}(t)&=\mathcal {P}_{t}p^{N}(0) +\int _{0}^{t}\nabla \mathcal {P} _{t-s}\left( V^{N}*\left( \alpha (s)\, S_{s}^{N}\right) \right) ds \\&\quad + \int _{0}^{t}\nabla \mathcal {P}_{t-s}\left( V^{N}*\left( b\left( \,\cdot \,, p^{N}(s,\,\cdot \,)\right) S_{s}^{N}\right) \right) ds+M_{t}^{N}(\,\cdot \,) \end{aligned} \end{aligned}$$

where

$$\begin{aligned} M_{t}^{N}(\,\cdot \,) = \int _{0}^{t}\frac{1}{N}\sum _{i=1}^{N}\mathcal {P}_{t-s}\nabla V^{N}\left( \,\cdot \, -X_{s}^{N,i}\right) dW_{s}^{N, i}. \end{aligned}$$

(5.7)

Proof

For the reader convenience, let us first recall the definition of $\mathcal {P}_{t}$; cfr. Eq. (4.5). If we set $G(t, x-y)$ the density of $x + W_t$, where $W_t$ is a standard blackian motion, $t \in [0, T]$ and $x, y \in \mathbb {R}^{d}$, then $\mathcal {P}_t$ is defined on functions $h \in \text {C}_b(\mathbb {R}^{d})$ as

$$\begin{aligned} (\mathcal {P}_t h)(x)\doteq \int _{\mathbb {R}^{d}} G(t, x-y) h(y)\,dy, \end{aligned}$$

Now, consider for a given $t \in [0, T]$ the identity in Eq. (5.5) with the following choice

$$\begin{aligned} \varphi ^{(t)} \left( s, x\right) =\left( \mathcal {P}_{t-s}(V^{N,-}*h)\right) \left( x\right) ,\quad s \in [0, t], \end{aligned}$$

with $h\in \text {C}_{b}^{2}(\mathbb {R}^{d})$ and $V^{N,-}\left( x\right) \doteq V^{N}\left( -x\right) $. Recall that the convolution commutes and hence $\mathcal {P}_t (V^{N,-}*h) = (V^{N,-}*\mathcal {P}_{t}h)$. Besides, it holds that $\nabla \mathcal {P}_{t}(V^{N,-}*h) = (V^{N,-} *\nabla \mathcal {P}_{t}h)$. Therefore,

$$\begin{aligned} \begin{aligned} \left\langle V^{N}*S_{t}^{N},h\right\rangle =&\left\langle V^{N}*S_{0}^{N},\mathcal {P}_{t}h\right\rangle -\int _{0}^{t}\left\langle V^{N}*\left( \alpha \left( s \right) S_{s}^{N}\right) ,\nabla \mathcal {P}_{t-s}h\right\rangle ds \\&+\int _{0}^{t}\left\langle V^{N}*\left( b\left( \,\cdot \,,\left( V^{N}*S_{s}^{N}\right) \left( \,\cdot \,\right) \right) S_{s}^{N}\right) ,\nabla \mathcal {P}_{t-s}h\right\rangle ds \\&+\int _{0}^{t}\frac{1}{N}\sum _{i=1}^{N}V^{N,-}*\nabla \left( \mathcal {P} _{t-s}h\right) \left( X_{s}^{N,i}\right) \cdot dW_{s}^{N, i}. \end{aligned} \end{aligned}$$

By Fubini–Tonelli theorem and stochastic Fubini theorem, we can move the semigroup on the first argument and use integration by parts to obtain:

$$\begin{aligned} \begin{aligned} \left\langle p^{N}\left( t\right) ,h\right\rangle =&\left\langle \mathcal {P}_{t}p^{N}(0),h\right\rangle -\int _{0}^{t}\left\langle \nabla \mathcal {P}_{t-s}\left( V^{N}*\left( \alpha \left( s \right) S_{s}^{N}\right) \right) ,h\right\rangle ds \\&+\int _{0}^{t}\left\langle \nabla \mathcal {P}_{t-s}\left( V^{N}*\left( b\left( \,\cdot \,,\left( V^{N}*S_{s}^{N}\right) \left( \,\cdot \,\right) \right) S_{s}^{N}\right) \right) ,h\right\rangle ds \\&+\left\langle M_{t}^{N}(\,\cdot \,),h\right\rangle . \end{aligned} \end{aligned}$$

By the arbitrarily of h, this concludes the proof. $\square $

Now, let denote by $\left[ f\right] _{\gamma }$ the Hölder semi-norm on $\mathbb {R}^{d}$ and by $\left\| f\right\| _{\gamma }$ the associated norm, i.e.:

$$\begin{aligned} \left[ f\right] _{\gamma }=\sup _{\begin{array}{c} x,y\in \mathbb {R}^{d} \\ x\ne y \end{array}}\frac{\left| f\left( x\right) -f\left( y\right) \right| }{ \left| x-y\right| ^{\gamma }},\qquad \left\| f\right\| _{\gamma }=\left[ f\right] _{\gamma }+\left\| f\right\| _{\infty } \end{aligned}$$

(5.8)

where, as usual, $\left\| f\right\| _{\infty }=\sup _{x\in \mathbb {R}^{d}}\left| f\left( x\right) \right| $. We state the following lemma.

Lemma 5.4

Let $M_t^{N}(\,\cdot \,)$ be the martingale in Eq. (5.7) and $\beta \in (0, 1/2)$ the constant as in the definition of $V^{N}$; see Eq. (2.1). Then, there exists $\gamma \in \left( 0,1\right) $ such that, for all $p\ge 2$, there is a constant $C_{p}>0$ such that $\mathbb {E}\left[ \left\| M_{t}^{N}\right\| _{\gamma }^{p}\right] \le C_{p}$, for all $N\in \mathbb { N}$ and $t\in \left[ 0,T\right] $.

Proof

It is enough to check the sufficient conditions (C.3)–(C.4) of Lemma C.2 in Appendix 1.

Let $\epsilon _N^{-1} = N^{\frac{\beta }{d}}$. Using Eq. (C.6), the bound in Eq. (C.3) reads

$$\begin{aligned} \begin{aligned} \mathbb {E}\left[ \left| M_{t}^{N}\left( x\right) \right| ^{p}\right]&= \frac{1}{N^{p}}\mathbb {E}\left[ \left| \sum _{i=1}^{N}\int _{0}^{t}\nabla \mathcal {P}_{t-s}V^{N}\left( x-X_{s}^{N,i}\right) dW_{s}^{N, i}\right| ^{p}\right] \\&\le \frac{C_{p}}{N^{p}}\mathbb {E}\left[ \left| \sum _{i=1}^{N}\int _{0}^{t}\left| \nabla \mathcal {P}_{t-s}V^{N}\left( x-X_{s}^{N,i}\right) \right| ^{2}ds\right| ^{p/2}\right] \\&\le \frac{C_{p}C_{T,R,V}^{p}\epsilon _{N}^{-pd-p\delta }}{N^{p}}\mathbb {E} \left[ \left| \sum _{i=1}^{N}\int _{0}^{t}\frac{1}{\left( t-s\right) ^{1-\delta }} e^{-\frac{\left| x-X_{s}^{N,i}\right| }{4T}}\,ds\right| ^{p/2}\right] \\&\le \frac{\widetilde{C}_{p,T,R,V}\epsilon _{N}^{-pd-p\delta }}{N^{p}}\,e^{-\frac{\left| x\right| }{8T}} \mathbb {E}\left[ e^{\frac{|| X^{N, i} ||_{\infty , T}}{4 T}} \left| \sum _{i=1}^{N}\int _{0}^{t}\frac{1}{\left( t-s\right) ^{1-\delta }}ds\right| ^{p/2}\right] \\&\le \frac{C_{p,T,R,V,\delta }^{\prime }\epsilon _{N}^{-pd-p\delta }}{ N^{p/2}}\,{e^{-\frac{|x|}{8 T}}}\mathbb {E} \left[ e^{p\,\frac{|| X^{N, 1} ||_{\infty , T}}{8 T}} \right] \end{aligned} \end{aligned}$$

where, to ease notation, we set $|| X^{N, i} ||_{\infty , T}\doteq \sup _{s \in [0,T]}| X_s^{N,i} |,\,\,i\in [[N]]$. The last expected value is finite thanks to $(\text {H4})$; therefore,

$$\begin{aligned} \mathbb {E}\left[ \left| M_{t}^{N}\left( x\right) \right| ^{p}\right] \le C_{p,T,R,V,\delta }^{\prime \prime }\frac{\epsilon _{N}^{-pd-p\delta }}{ N^{p/2}}g^{p}\left( x\right) , \end{aligned}$$

where (up to a constant) $g\left( x\right) \doteq e^{-\frac{\left| x\right| }{8T}}$ is integrable at any power. Now, recall that $\epsilon _{N}^{-1}=N^{\frac{\beta }{d}}$. Then

$$\begin{aligned} \frac{\epsilon _{N}^{-pd-p\delta }}{N^{p/2}}=\frac{N^{\frac{\beta }{d}\left( pd+p\delta \right) }}{N^{p/2}}=N^{\left( \frac{1}{2}-\beta \right) p-\frac{ \beta p\delta }{d}}, \end{aligned}$$

which is bounded for $\beta <\frac{1}{2}$ by choosing $\delta $ (depending on p) small enough.

As regards the bound in Eq. (C.3), we use estimate (C.7) with $ \gamma $ small enough compared to $\delta $ so to have $( \gamma -\delta \left( 1-\gamma \right) ) <0$. To ease notation and for the sake of space, we denote

$$\begin{aligned} \begin{aligned}&\Delta _h M_t^{N}(x) \doteq M_{t}^{N}\left( x\right) -M_{t}^{N}\left( x+h\right) \\&\Delta _h \mathcal {P}_{t-s} V^{N} (x - X_s^{N,i}) \doteq V^{N} (x - X_s^{N,i}) - V^{N} (x + h - X_s^{N,i}) \end{aligned} \end{aligned}$$

and, as before, $|| X^{N, i} ||_{\infty , T}\doteq \sup _{s \in [0,T]}| X_s^{N,i} |,\,\,i\in [[N]]$. We get

$$\begin{aligned} \begin{aligned}&\mathbb {E}\left[ \left| \Delta _h M_t^{N}(x) \right| ^{p}\right] \\&\quad = \frac{1}{N^{p}}\mathbb {E}\left[ \left| \sum _{i=1}^{N}\int _{0}^{t} \Delta _h \mathcal {P}_{t-s} V^{N} (x - X_s^{N,i}) dW_{s}^{N, i}\right| ^{p}\right] \\&\quad \le \frac{C_{p}}{N^{p}}\mathbb {E}\left[ \left| \sum _{i=1}^{N}\int _{0}^{t}\left| \Delta _h \mathcal {P}_{t-s} V^{N} (x - X_s^{N,i}) dW_{s}^{N, i} \right| ^{2}ds\right| ^{p/2}\right] \\&\quad \le \frac{C_{p}}{N^{p}}\mathbb {E}\left[ \left| \sum _{i=1}^{N}\int _{0}^{t}\frac{C_{T,R,V}^{2}}{\left( t-s\right) ^{1+\bar{\gamma }}}\left| h\right| ^{2\gamma }\epsilon _{N}^{-2d-2\delta \left( 1-\gamma \right) } e^{-2\lambda _{T,R,V}\left| x-X_{s}^{N,i}\right| }\, ds\right| ^{p/2} \right] \\&\quad \le \frac{\widetilde{C}_{p,T,R,V}\epsilon _{N}^{-pd-p\delta \left( 1-\gamma \right) }}{N^{p/2}}\left| h\right| ^{p\gamma }e^{-2\lambda _{T,R,V}\left| x \right| } \mathbb {E}\left[ e^{p\,\lambda _{T,R,V} \Vert X^{N,i}\Vert _{\infty , T}} \right] \end{aligned} \end{aligned}$$

and the conclusion is the same as for the previous term. $\square $

Remark 5.5

Lemma 5.4 is a non-trivial achievement of this paper. Indeed, the Kolmogorov–Chentsov criterion (see Karatzas and Shreve 1998, Theorem 2.2.8) would provide with much fewer computations a similar result on bounded sets. However, the dominating constant would diverge when passing to the full space. Notice that we will need the passage to the full space in Lemma 5.6 below. For this reason, we use a more complicated strategy—summarized by the results in Appendix 1—based on Sobolev embedding theorem.

Lemma 5.6

Let $p^{N}(t)$ as in Lemma 5.3. If $\beta \in \left( 0, 1/2 \right) $ and $\sup _{N}\left\| p^{N}(0)\right\| _{\gamma }^{2}<\infty $, then there exist $p\ge 2$, $\gamma \in \left( 0,1\right) $ and a constant $C_{\gamma }>0$ such that $\mathbb {E}\left[ \left\| p^{N}(t)\right\| _{\gamma }^{p}\right] \le C$.

Proof

Lemma 5.3 provides the following bound

$$\begin{aligned} \begin{aligned} \mathbb {E}\left[ \left\| p^{N}(t)\right\| _{\gamma }^{p}\right] ^{1/p}&\le \mathbb {E}\left[ \left\| \mathcal {P}_{t}p^{N}(0)\right\| _{\gamma }^{p}\right] ^{1/p}\\&\quad +\int _{0}^{t}\mathbb {E}\left[ \left\| \nabla \mathcal {P}_{t-s}\left( V^{N}*\left( \alpha (s)S_{s}^{N}\right) \right) \right\| _{\gamma }^{p}\right] ^{1/p}ds\\&\quad +\int _{0}^{t}\mathbb {E}\left[ \left\| \nabla \mathcal {P}_{t-s}\left( V^{N}*\left( b\left( \,\cdot \,,p^{N}\left( s, \,\cdot \,\right) \right) S_{s}^{N}\right) \right) \right\| _{\gamma }^{p}\right] ^{1/p}ds\\&\quad +\mathbb {E }\left[ \left\| M_{t}^{N}\right\| _{\gamma }^{p}\right] ^{1/p}, \end{aligned} \end{aligned}$$

where we use the first inequality of Lemma C.3 in Appendix 1 and the bound of Lemma 5.4. Therefore,

$$\begin{aligned} \begin{aligned} \mathbb {E}\left[ \left\| p^{N}(t)\right\| _{\gamma }^{p}\right] ^{1/p}&\le C+\int _{0}^{t}\frac{C}{\left( t-s\right) ^{\frac{1+\gamma }{2}} }\mathbb {E}\left[ \left\| V^{N}*\left( \alpha ( s,\,\cdot \, )S_{s}^{N}\right) \right\| _{\infty }^{p}\right] ^{1/p}ds\\&\quad +\int _{0}^{t}\frac{C}{\left( t-s\right) ^{\frac{1+\gamma }{2}}}\mathbb {E} \left[ \left\| V^{N}*\left( b\left( \,\cdot \,,p^{N}\left( s, \,\cdot \,\right) \right) S_{s}^{N}\right) \right\| _{\infty }^{p}\right] ^{1/p}ds\\&\quad +C. \end{aligned} \end{aligned}$$

At this point, we need to find a bound for the last two expected values. We start from the first.

$$\begin{aligned} \begin{aligned}&\left| \left( V^{N}*\left( \alpha (s,\,\cdot \,)S_{s}^{N}\right) \right) \left( x\right) \right| \le \int _{\mathbb {R}^{d}}V^{N}\left( x-y\right) \left| \alpha (s, y) \right| S_{s}^{N}\left( dy\right) \\&\quad \le \left\| \alpha (s,\,\cdot \,) \right\| _{\infty }\int _{\mathbb {R}^{d}}V^{N}\left( x-y\right) S_{s}^{N}\left( dy\right) =\left\| \alpha (s,\,\cdot \,) \right\| _{\infty } p^{N}(s,x) \end{aligned} \end{aligned}$$

hence

$$\begin{aligned} \mathbb {E}\left[ \left\| V^{N}*\left( \alpha (s,\,\cdot \,)S_{s}^{N}\right) \right\| _{\infty }^{p}\right] ^{1/p}\le & {} \left\| \alpha (s,\,\cdot \,) \right\| _{\infty }\mathbb {E}\left[ \left\| p^{N}(s)\right\| _{\infty }^{p}\right] ^{1/p}\\\le & {} \left\| \alpha (s,\,\cdot \,) \right\| _{\infty }\mathbb {E}\left[ \left\| p^{N}(s)\right\| _{\gamma }^{p}\right] ^{1/p} \end{aligned}$$

As regards the second expected value, we similarly obtain

$$\begin{aligned} \mathbb {E}\left[ \left\| V^{N}*\left( b\left( \,\cdot \,, p^{N}(s, \, \cdot \,) \right) S_{s}^{N}\right) \right\| _{\infty }^{p}\right] ^{1/p}\le \left\| b\right\| _{\infty }\mathbb {E}\left[ \left\| p^{N}(s)\right\| _{\gamma }^{p}\right] ^{1/p}. \end{aligned}$$

Therefore,

$$\begin{aligned} \mathbb {E}\left[ \left\| p^{N}(t)\right\| _{\gamma }^{p}\right] ^{1/p}\le C+C\int _{0}^{t}\frac{\left\| \alpha (s,\,\cdot \,) \right\| _{\infty }+\left\| b\left( \,\cdot \,, p^{N}(s, \, \cdot \,) \right) \right\| _{\infty }}{\left( t-s\right) ^{\frac{1+\gamma }{2} }}\mathbb {E}\left[ \left\| p^{N}(s)\right\| _{\gamma }^{p}\right] ^{1/p}ds. \end{aligned}$$

The conclusion follows by a generalized version of Gronwall’s lemma. $\square $

We are now ready to prove Theorem 5.1; its proof is the content of the next subsection.

5.3 Identification of the Limit

Let us denote by $P_{N}$ and $Q_{N}$ the laws of $S^{N}$ and $V^{N}*S^{N}$, respectively, on $\text {C}([0,T];\mathcal {P}(\mathbb {R}^{d}))$, for each $N\in \mathbb {N}$. By Lemma 5.2, we know that both the families $(P_{N})_{N\in \mathbb {N}}$ and $(Q_{N})_{N\in \mathbb {N}}$ are tight in $\text {C}([0,T];\mathcal {P}(\mathbb {R}^{d}))$. In particular, their convergent sub-sequences have the same limit, in the following strong sense.

Lemma 5.7

Assume a subsequence $(P_{N_{k}})_{k\in \mathbb {N}}$ converges weakly to a probability measure P on $\text {C}([0,T];\mathcal {P}(\mathbb {R}^{d}))$. Then also $(Q_{N_{k}})_{k\in \mathbb {N}}$ converges weakly to P.

Proof

To prove the lemma, we are going to show that every convergent subsequence of $(Q_{N_{k}})_{k\in \mathbb {N}}$ has limit P; indeed, this implies that $(Q_{N_{k}})_{k\in \mathbb {N}}$ converges to P. To this end, let $(Q_{N_{k}^{\prime }})_{k\in \mathbb {N}}$ be a subsequence of $(Q_{N_{k}})_{k\in \mathbb {N}}$ converging to a probability measure Q on $\text {C}([0,T];\mathcal {P}(\mathbb {R}^{d}))$. In particular, for every positive integer m and every finite sequence $t_{1}<...<t_{m}\in \left[ 0,T\right] $, both $\pi _{\left( t_{1},...,t_{m}\right) }P_{N_{k}^{\prime }}$ and $\pi _{\left( t_{1},...,t_{m}\right) }Q_{N_{k}^{\prime }}$ converge weakly on $\mathcal {P}(\mathbb {R}^{d})^{m}$, where $\pi _{\left( t_{1},...,t_{m}\right) }$ is the projection on the finite dimensional marginal at times $\left( t_{1} ,...,t_{m}\right) $. The limits are, respectively, $\pi _{\left( t_{1},...,t_{m}\right) }P$ and $\pi _{\left( t_{1},...,t_{m}\right) }Q$. If we prove that they are equal, then $P=Q$ as a consequence of Kolmogorov extension theorem (see e.g. Stroock and Varadhan, 2007, Theorem 1.1.10).

Now, by Skorokhod representation theorem, on a new probability space $\left( \widetilde{\Omega },\widetilde{\mathcal {F}},\widetilde{\mathbb {P}}\right) $ we may consider a sequence $\widetilde{S}_{t}^{N_{k}^{\prime }}$ of continuous processes with values in $\mathcal {P}(\mathbb {R}^{d})$ and a continuous process $\widetilde{\mu }_{t}$ with values in $\mathcal {P}(\mathbb {R}^{d})$ such that their laws on $\text {C}([0,T];\mathcal {P}(\mathbb {R}^{d}))$ are $P_{N_{k}^{\prime }}$ and P respectively; and $V^{N_{k}^{\prime }}*\widetilde{S}_{\cdot }^{N_{k}^{\prime }}$ has law $Q_{N_{k}^{\prime }}$, which we know to be convergent, weakly, to Q. As remarked at the beginning of Appendix 1, given $t\in \left[ 0,T\right] $, with probability one, $\left\langle V^{N_{k}^{\prime }}*\widetilde{S} _{t}^{N_{k}^{\prime }},\varphi \right\rangle $ converges to $\left\langle \widetilde{\mu }_{t},\varphi \right\rangle $ for all $\varphi \in C_{c}\left( \mathbb {R}^{d}\right) $, and therefore for all $\varphi \in \text {C}_{b}\left( \mathbb {R}^{d}\right) $ because $\widetilde{\mu }_{t}\in \mathcal {P}\left( \mathbb {R}^{d}\right) $. Therefore, with $\widetilde{\mathbb {P}}$-probability one, $V^{N_{k}^{\prime }}*\widetilde{S}_{t}^{N_{k}^{\prime }}$ converges to $\widetilde{\mu }_{t}$ in the topology of $\mathcal {P}(\mathbb {R}^{d})$. Hence, also the law of $V^{N_{k}^{\prime }}*\widetilde{S}_{t}^{N_{k}^{\prime }}$ converges weakly to the law of $\widetilde{\mu }_{t}$ in the topology of $\mathcal {P}(\mathbb {R}^{d})$; namely $\pi _{t}Q_{N_{k}^{\prime }}$ converges weakly to $\pi _{t}P$. Similarly, if $t_{1}<...<t_{m}\in \left[ 0,T\right] $, the $\mathcal {P} (\mathbb {R}^{d})^{m}$-valued random variable $\left( V^{N_{k}^{\prime }} *\widetilde{S}_{t_{1}}^{N_{k}^{\prime }},...,V^{N_{k}^{\prime }} *\widetilde{S}_{t_{m}}^{N_{k}^{\prime }}\right) $ converges a.s. to $\left( \widetilde{\mu }_{t_{1}},...,\widetilde{\mu }_{t_{m}}\right) $ in the topology of $\mathcal {P}(\mathbb {R}^{d})^{m}$. Therefore, also the law of $\left( V^{N_{k}^{\prime }}*\widetilde{S}_{t_{1}}^{N_{k}^{\prime } },...,V^{N_{k}^{\prime }}*\widetilde{S}_{t_{m}}^{N_{k}^{\prime }}\right) $ converges weakly to the law of $\left( \widetilde{\mu }_{t_{1}} ,...,\widetilde{\mu }_{t_{m}}\right) $ in the topology of $\mathcal {P} (\mathbb {R}^{d})^{m}$, which means that $\pi _{\left( t_{1},...,t_{m}\right) }Q_{N_{k}^{\prime }}$ converges weakly to $\pi _{\left( t_{1},...,t_{m}\right) }P$. $\square $

Now, let $(P_{N_{k}})_{k\in \mathbb {N}}$ be a convergent subsequence of $(P_{N})_{N\in \mathbb {N}}$ (which exists thanks to Lemma 5.2) with limit P on $\text {C}([0,T];\mathcal {P}(\mathbb {R}^{d}))$. We shall prove the following two statements.

(i):: The probability measure P is equal to $\delta _{\mu }$ for a suitable $\mu \in \text {C}([0,T];\mathcal {P} (\mathbb {R}^{d}))$ which does not depend on the subsequence $\left( N_{k}\right) _{k\in \mathbb {N}}$; hence the full sequence $(P_{N} )_{N\in \mathbb {N}}$ will converge weakly to $\delta _{\mu }$ and $S^{N}$ will converge in probability to $\mu $.
(ii):: $\mu $ satisfies the conditions in Theorem 5.1.

To this end, with the purpose of simplifying notations, we shall prove that the original sequence $(P_{N})_{N\in \mathbb {N}}$ admits a subsequence $(P_{N_{k}})_{k\in \mathbb {N}}$ which converges weakly to $\delta _{\mu }$ for a unique $\mu \in C([0,T];\mathcal {P}(\mathbb {R}^{d}))$ satisfying all the conditions of Theorem 5.1. The same argument applied to any subsequence $(P_{N_{k}})_{k\in \mathbb {N}}$ in place of the original $(P_{N})_{N\in \mathbb {N}}$, proves the claim above; this will be the content of Proposition 5.8.

Denote by $\Lambda \subset \text {C}([0,T];\mathcal {P}(\mathbb {R} ^{d}))$ the set of all $\left( \mu _{t}\right) _{t\in \left[ 0,T\right] }$ such that there exists $p:[0,T]\times \mathbb {R}^{d}\rightarrow \mathbb {R}$ with the property that $x\mapsto p\left( t,x\right) $ is continuous, bounded, non negative, $\int _{\mathbb {R}^{d}}p\left( t,x\right) dx=1$ and $\mu _{t}\left( dx\right) =p\left( t,x\right) dx$ for all $t\in \left[ 0,T\right] $. Since

$$\begin{aligned} t\mapsto \int _{\mathbb {R}^{d}}p\left( t,x\right) \varphi \left( x\right) dx=\left\langle \mu _{t},\varphi \right\rangle \end{aligned}$$

is continuous for every $\varphi \in \text {C}_{b}\left( \mathbb {R}^{d}\right) $, p is measurable in $\left( t,x\right) $ and weakly continuous in t, in the previous sense.

Given $\alpha \in \text {C}_b([0,T] \times \mathbb {R}^{d} ; \mathbb {R}^{d})$, $\varphi \in \text {C}_{c}^{1,2}\left( [0,T]\times \mathbb {R}^{d}\right) $ and $\mu \in \Lambda $, set

$$\begin{aligned} \Phi _{\varphi }\left( \mu \right)= & {} \sup _{t\in \left[ 0,T\right] }\left| \left\langle \mu _{t},\varphi (t,\,\cdot \,)\right\rangle -\left\langle \mu _{0} ,\varphi (0,\cdot )\right\rangle -\int _{0}^{t}\left\langle \mu _{s}, \mathcal {A}\varphi (s,\,\cdot \,) \right. \right. \nonumber \\&\left. \left. -\left( \alpha (s)+b\left( p(s)\right) \right) \cdot \nabla \varphi (s,\,\cdot \,)\right\rangle ds\right| \end{aligned}$$

(5.9)

where for the sake of space $b\left( p(s)\right) $ denotes the function $b\left( \,\cdot \,,p\left( s,\,\cdot \,\right) \right) $ and $p\left( s,\,\cdot \,\right) $ is the density of $\mu _{s}$. Moreover, we remind that $\mathcal {A}$ is the operator defined in Eq. (4.2).

Proposition 5.8

Let $\left( N_{k}\right) $ be a subsequence such that $P_{N_{k}}$ converges in law to P on $\text {C}([0,T];\mathcal {P}(\mathbb {R}^{d}))$. Then:

(i):: $P\left( \Lambda \right) =1$.
(ii):: $\int \left( \Phi _{\varphi }\left( \mu \right) \wedge 1\right) P\left( d\mu \right) =0$ for every $\varphi \in $C$_{c} ^{1,2}\left( [0,T]\times \mathbb {R}^{d}\right) $.

Proof

The proof is divided in four steps. Before proceeding, notice that by Lemma 5.7 also $(Q_{N_{k}})_{k\in \mathbb {N}}$ converges weakly to P.

Step 1 On an auxiliary probability space, let $(\mu _{t})_{0 \le t \le T}$ be a process with law P. Given $t\in \left[ 0,T\right] $, $S_{t}^{N_{k}}$ converges in law to $\mu _{t}$. Moreover, $V^{N_{k}}*S_{t}^{N_{k}}$ satisfies the assumptions of Lemma D.2 of Appendix 1. Therefore $P\left( \Lambda \right) =1$.

Step 2 For every $\delta \in (0,1)$ and $\mu \in \mathcal {P}(\mathbb {R}^{d})$, let $\mathcal {P}_{\delta }\mu $ denote the following function:

$$\begin{aligned} \left( \mathcal {P}_{\delta }\mu \right) \left( x\right) =\int _{\mathbb {R} ^{d}}G\left( \delta ,x-y\right) \mu \left( dy\right) . \end{aligned}$$

Moreover, introduce for $\varphi \in \text {C}_{c}^{1,2}\left( [0,T]\times \mathbb {R}^{d}\right) $ and $\delta \in \left( 0,1\right) $, the regularized functional, defined on $\mu \in C([0,T];\mathcal {P}(\mathbb {R}^{d}))$ (instead of $\Lambda $)

$$\begin{aligned} \Phi _{\varphi ,\delta }\left( \mu \right)= & {} \sup _{t\in \left[ 0,T\right] }\left| \left\langle \mu _{t},\varphi (t,\,\cdot \,)\right\rangle -\left\langle \mu _{0},\varphi (0,\,\cdot \,)\right\rangle -\int _{0}^{t}\left\langle \mu _{s} , \mathcal {A}\varphi (s,\,\cdot \,)-\left( \alpha (s)\right. \right. \right. \\&\left. \left. \left. +b\left( \mathcal {P}_{\delta }\mu _{s}\right) \right) \cdot \nabla \varphi (s,\,\cdot \,)\right\rangle ds\right| . \end{aligned}$$

It is easy to check the previous functional is continuous on $\text {C} ([0,T];\mathcal {P}(\mathbb {R}^{d}))$. Therefore, being $\Phi _{\varphi ,\delta }\left( \cdot \right) \wedge 1$ continuous and bounded,

$$\begin{aligned} \lim _{k\rightarrow \infty }\int \left( \Phi _{\varphi ,\delta }\left( \mu \right) \wedge 1\right) Q_{N_{k}}\left( d\mu \right) =\int \left( \Phi _{\varphi ,\delta }\left( \mu \right) \wedge 1\right) P\left( d\mu \right) . \end{aligned}$$

Recall we know that $P\left( \Lambda \right) =1$. For each $\mu \in \Lambda $ and $s\in \left[ 0,T\right] $ it holds that:

$$\begin{aligned} \lim _{\delta \rightarrow 0}\mathcal {P}_{\delta }\mu _{s}=p(s), \end{aligned}$$

locally in the uniform topology, where p(s) is the density of $\mu _{s}$; therefore,

$$\begin{aligned} \lim _{\delta \rightarrow 0}b\left( \mathcal {P}_{\delta }\mu _{s}\right) =b\left( p(s)\right) \end{aligned}$$

locally in the uniform topology, and it is a bounded convergence. Hence, thanks to the local cut-off given by $\varphi (s)$ we have:

$$\begin{aligned} \begin{aligned} \left\langle \mu _{s},b\left( \mathcal {P}_{\delta }\mu _{s}\right) \cdot \nabla \varphi (s ,\,\cdot \,)\right\rangle&=\left\langle p(s),b\left( \mathcal {P} _{\delta }\mu _{s}\right) \cdot \nabla \varphi (s,\,\cdot \,)\right\rangle \\&\overset{\delta \rightarrow 0}{\rightarrow }\left\langle p(s),b\left( p(s)\right) \cdot \nabla \varphi (s,\,\cdot \,)\right\rangle \\&=\left\langle \mu _{s},b\left( p(s)\right) \cdot \nabla \varphi (s,\,\cdot \,)\right\rangle . \end{aligned} \end{aligned}$$

By Lebesgue dominated convergence we conclude that

$$\begin{aligned} \lim _{\delta \rightarrow 0}\Phi _{\varphi ,\delta }\left( \mu \right) =\Phi _{\varphi }\left( \mu \right) \end{aligned}$$

and thus again, by the same theorem,

$$\begin{aligned} \lim _{\delta \rightarrow 0}\int \left( \Phi _{\varphi ,\delta }\left( \mu \right) \wedge 1\right) P\left( d\mu \right) =\int \left( \Phi _{\varphi }\left( \mu \right) \wedge 1\right) P\left( d\mu \right) . \end{aligned}$$

Therefore

$$\begin{aligned} \int \left( \Phi _{\varphi }\left( \mu \right) \wedge 1\right) P\left( d\mu \right) =\lim _{\delta \rightarrow 0}\lim _{k\rightarrow \infty }\int \left( \Phi _{\varphi ,\delta }\left( \mu \right) \wedge 1\right) Q_{N_{k}}\left( d\mu \right) . \end{aligned}$$

In the next step, we prove that this double limit, taken in the specified order, is zero.

Step 3 We have the following identity:

$$\begin{aligned} \int \left( \Phi _{\varphi ,\delta }\left( \mu \right) \wedge 1\right) Q_{N_{k} }\left( d\mu \right) =\mathbb {E}\left[ \Phi _{\varphi ,\delta }\left( V^{N_{k}}*S^{N_{k}}\right) \wedge 1\right] . \end{aligned}$$

(5.10)

Choosing $V^{N_{k},-}*\varphi $ as test function,

$$\begin{aligned} \begin{aligned}&\left\langle S_{t}^{N_{k}},(V^{N_{k},-}*\varphi )\left( t, \,\cdot \,\right) \right\rangle -\left\langle S_{0}^{N_{k}},(V^{N_{k},-}*\varphi )\left( 0,\,\cdot \,\right) \right\rangle \\&\qquad -\int _{0}^{t}\left\langle S_{s}^{N_{k}}, \mathcal {A}(V^{N_{k},-}*\varphi )(s,\,\cdot \,) \right\rangle ds\\&\quad =\int _{0}^{t}\left\langle S_{s}^{N_{k}},\left( \alpha (s)+b\left( V^{N_{k}}*S_{s}^{N_{k}}\right) \right) \cdot (\nabla V^{N_{k},-} *\varphi )(s,\,\cdot \,)\right\rangle ds+M_{t}^{N_{k},V^{N_{k},-}*\varphi }, \end{aligned} \end{aligned}$$

where $M_t^{N_k, V^{N_k,-} *\varphi }$ denotes the martingale (5.6) in which N and $\varphi $ have been replaced by $N_k$ and $V^{N_k,-} *\varphi $, respectively. Thus,

$$\begin{aligned} \begin{aligned}&\left\langle V^{N_{k}}*S_{t}^{N_{k}}, \varphi \left( t,\,\cdot \,\right) \right\rangle -\left\langle V^{N_{k}}*S_{0}^{N_{k}},\varphi \left( 0,\,\cdot \,\right) \right\rangle -\int _{0}^{t}\left\langle V^{N_{k}}*S_{s}^{N_{k} },\mathcal {A}\varphi (s,\,\cdot \,)\right\rangle ds\\&\quad =\int _{0}^{t}\left\langle V^{N_{k}}*\left[ \left( \alpha (s)+b\left( V^{N_{k}}*S_{s}^{N_{k}}\right) \right) S_{s}^{N_{k}}\right] ,\nabla \varphi (s,\,\cdot \,)\right\rangle ds+M_{t}^{N_{k},V^{N_{k},-}*\varphi }. \end{aligned} \end{aligned}$$

(5.11)

For the sake of space, we set for $t\in [0,T]$:

$$\begin{aligned} \begin{aligned}&V_{t}^{k,*}\doteq V^{N_{k}}*S_{t}^{N_{k}}\quad \quad \quad \,\, V_{t}^{k,\alpha ,*}\doteq V^{N_{k}}*(\alpha (t)S_{t}^{N_{k}})\\&V_{t}^{k,\alpha ,b} \doteq \alpha (s)+b(V_{s}^{k,*})\quad V_{t}^{k,\alpha ,b,\delta }\doteq \alpha (s)+b(\mathcal {P}_{\delta }(V_{s}^{k,*})). \end{aligned} \end{aligned}$$

Now, we compute the expected value on the right-hand side of Eq. (5.10).

$$\begin{aligned} \begin{aligned}&\mathbb {E}\left[ \Phi _{\varphi ,\delta }(V_{t}^{k,*})\wedge 1\right] \\&\quad \le \mathbb {E}\left[ \sup _{t\in \left[ 0,T\right] }\left| \left\langle V_{t}^{k,*},\varphi (t,\,\cdot \,)\right\rangle -\left\langle V_{0}^{k,*},\varphi (0,\,\cdot \,)\right\rangle \right. \right. \\&\left. \left. \quad -\int _{0}^{t}\left\langle V_{s}^{k,*},\mathcal {A}\varphi (s,\,\cdot \,) -V_{t}^{k,\alpha ,b,\delta }\cdot \nabla \varphi (s,\,\cdot \,)\right\rangle ds\right| \right] \\&\quad =\mathbb {E}\left[ \sup _{t\in \left[ 0,T\right] }\left| \int _{0} ^{t}\left\langle V^{N_{k}}*\left[ V_{t}^{k,\alpha ,b}\,S_{s}^{N_{k} }\right] ,\nabla \varphi (s,\,\cdot \,)\right\rangle ds+M_{t}^{N_{k},V^{N_{k},-}*\varphi }\right. \right. \\&\qquad \left. \left. -\int _{0}^{t}\left\langle V_{s}^{k,*},V_{s}^{k,\alpha ,b,\delta }\cdot \nabla \varphi (s,\,\cdot \,)\right\rangle ds\right| \right] \\&\quad \le \mathbb {E}\left[ \left| M_{T}^{N_{k},V^{N_{k},-}*\varphi }\right| ^{2}\right] ^{1/2}\\&\qquad +\mathbb {E}\left[ \int _{0}^{T}\left| \left\langle V_{t}^{k,\alpha ,*},\nabla \varphi (s,\,\cdot \,)\right\rangle -\left\langle V_{s}^{k,*},\alpha (s)\cdot \nabla \varphi (s,\,\cdot \,)\right\rangle \right| ds\right] \\&\qquad +\mathbb {E}\left[ \int _{0}^{T}\left| \left\langle V^{N_{k}}*\left[ b(V_{s}^{k,*})S_{s}^{N_{k}}\right] ,\nabla \varphi (s,\,\cdot \,)\right\rangle \right. \right. \\&\left. \left. \qquad -\left\langle V_{s}^{k,*},b(\mathcal {P}_{\delta }(V_{s}^{k,*} ))\cdot \nabla \varphi (s,\,\cdot \,)\right\rangle \right| ds\right] \\&\quad \doteq (i)+(ii)+(iii). \end{aligned} \end{aligned}$$

(5.12)

In the previous equation, we use the following bound

$$\begin{aligned} \mathbb {E}\left[ \sup _{t\in \left[ 0,T\right] }\left| M_{t} ^{N_{k},V^{N_{k},-}*\varphi }\right| \right] \le C\mathbb {E}\left[ \left| M_{T}^{N_{k},V^{N_{k},-}*\varphi }\right| ^{2}\right] ^{1/2} \end{aligned}$$

due to Doob’s inequality. At this point, we have that the terms $(i)-(iii)$ in Eq. (5.12) converge to zero as $N_{k}\rightarrow \infty $. By hypothesis $\Vert V^{N_{k},-}*\nabla \varphi \Vert _{\infty }$ is bounded because $\nabla \varphi $ is uniformly continuous and hence $V^{N_{k},-} *\nabla \varphi $ converges uniformly to $\nabla \varphi $. This implies that (5.12)-(i) converges to zero. Indeed,

$$\begin{aligned} \mathbb {E}\left[ \left( M_{T}^{N_{k},V^{N_{k},-}*\varphi }\right) ^{2}\right]= & {} \frac{1}{N^{2}}\sum _{i=1}^{N}\int _{0}^{t}\mathbb {E}\left[ \left| \nabla V^{N_{k},-}*\varphi (s,X_{s}^{N,i})\right| ^{2}\right] ds\\\le & {} \frac{1}{N}\left\| V^{N_{k},-}*\nabla \varphi \right\| _{\infty }^{2}T \end{aligned}$$

The uniform converges of $V^{N_{k},-}*\nabla \varphi $ and $V^{N_{k},-} *\left( \alpha (\,\cdot \,) \cdot \nabla \varphi \right) $, the weak convergence of $S_{s}^{N_{k}}$ (realized a.s. on an auxiliary probability space, by Skorohod theorem) and Lebesgue dominated convergence theorem implies that also the term (5.12)-(ii) converges to zero. The converges to zero of the third term is more delicate and it will be proved here below in the third step.

Step 4 Let us consider

$$\begin{aligned} \begin{aligned}&\mathbb {E}\left[ \int _{0}^{T}\left| \left\langle S_{s}^{N_{k}} ,b(V_{s}^{k,*})\cdot (V^{N_{k}}*\nabla \varphi )(s,\,\cdot \,)-V^{N_{k}}*\left[ b(\mathcal {P}_{\delta }(V_{s}^{k,*}))\cdot \nabla \varphi (s,\,\cdot \,)\right] \right\rangle \right| (\,\cdot \,) ds\right] \\&\quad \le \int _{0}^{T}\mathbb {E}\left[ \int _{\mathbb {R}^{d}}\int _{\mathbb {R} ^{d}}V^{N_{k}}\left( x-y\right) \left| b(\mathcal {P}_{\delta } (V_{s}^{k,*})(y))-b(V_{s}^{k,*}(x))\right| \left| \nabla \varphi \left( s, y\right) \right| \,dy\,S_{s}^{N_{k}}\left( dx\right) \right] \,ds\\&\quad \le L_{b}\int _{0}^{T}\mathbb {E}\left[ \int _{\mathbb {R}^{d}} \int _{\mathbb {R}^{d}}V^{N_{k}}\left( x-y\right) \left| \mathcal {P} _{\delta }(V_{s}^{k,*})(y)-V_{s}^{k,*}(x)\right| \left| \nabla \varphi \left( s, y\right) \right| dy\,S_{s}^{N_{k}}\left( dx\right) \right] ds\\&\quad \le \int _{0}^{T}\mathbb {E}\left[ \int _{\mathbb {R}^{d}}\int _{\mathbb {R} ^{d}}V^{N_{k}}\left( x-y\right) \left| \mathcal {P}_{\delta } (V_{s}^{k,*})(y)-V_{s}^{k,*}(y)\right| \left| \nabla \varphi (s, y)\right| dy\,S_{s}^{N_{k}}(dx)\right] \,ds\\&\qquad +\int _{0}^{T}\mathbb {E}\left[ \int _{\mathbb {R}^{d}}\int _{\mathbb {R}^{d} }V^{N_{k}}\left( x-y\right) \left| V_{s}^{k,*}\left( y\right) -V_{s}^{k,*}\left( x\right) \right| \left| \nabla \varphi \left( s, y\right) \right| dy\,S_{s}^{N_{k}}\left( dx\right) \right] ds. \end{aligned} \end{aligned}$$

We now compute the following two bounds (notice that we use the explicit expression). The first is given by:

$$\begin{aligned} \begin{aligned}&\left| \mathcal {P}_{\delta }\left( V^{N_{k}}*S_{s}^{N_{k}}\right) \left( y\right) - (V^{N_{k}}*S_{s}^{N_{k}} )\left( y\right) \right| \\&\quad =\left| \mathbb {E}\left[ \left( V^{N_{k}}*S_{s}^{N_{k}}\right) \left( y+W_{\delta }\right) -\left( V^{N_{k}}*S_{s}^{N_{k}}\right) \left( y\right) \right] \right| \\&\quad \le \mathbb {E}\left[ \left| \left( V^{N_{k}}*S_{s}^{N_{k} }\right) \left( y+W_{\delta }\right) -\left( V^{N_{k}}*S_{s}^{N_{k} }\right) \left( y\right) \right| \right] \\&\quad \le \left[ V^{N_{k}}*S_{s}^{N_{k}}\right] _{\gamma }\mathbb {E}\left[ \left| W_{\delta }\right| ^{\gamma }\right] \\&\quad \le C_{\gamma }\left[ V^{N_{k}}*S_{s}^{N_{k}}\right] _{\gamma } \delta ^{\gamma /2}, \end{aligned} \end{aligned}$$

whereas the second, since V has compact support, say 1, so that the support of $V^{N_{k}}$ is $\epsilon _{N_{k}}$, by

$$\begin{aligned} \begin{aligned}&V^{N_{k}}\left( x-y\right) \left| (V^{N_{k}}*S_{s}^{N_{k}})\left( y\right) - (V^{N_{k}}*S_{s}^{N_{k}})\left( x\right) \right| \\&\quad \le V^{N_{k}}\left( x-y\right) \left[ V^{N_{k}}*S_{s}^{N_{k}}\right] _{\gamma }\left| x-y\right| ^{\gamma }\\&\quad \le \epsilon _{N_{k}}^{\gamma }V^{N_{k}}\left( x-y\right) \left[ V^{N_{k} }*S_{s}^{N_{k}}\right] _{\gamma }. \end{aligned} \end{aligned}$$

Therefore

$$\begin{aligned} \begin{aligned}&\mathbb {E}\left[ \int _{0}^{T}\left| \left\langle S_{s}^{N_{k}} ,b(V_{s}^{k,*})\cdot (V^{N_{k}}*\nabla \varphi )(s,\,\cdot \,)-V^{N_{k}}*\left[ b(\mathcal {P}_{\delta }(V_{s}^{k,*}))\cdot \nabla \varphi (s,\,\cdot \,)\right] \right\rangle \right| (\,\cdot \,) ds\right] \\&\le C_{\gamma }\delta ^{\gamma /2}\int _{0}^{T}\mathbb {E}\left[ \left[ V^{N_{k}}*S_{s}^{N_{k}}\right] _{\gamma }\int _{\mathbb {R}^{d}} \int _{\mathbb {R}^{d}}V^{N_{k}}\left( x-y\right) \left| \nabla \varphi \left( s,y\right) \right| dy\,S_{s}^{N_{k}}\left( dx\right) \right] ds\\&+\epsilon _{N_{k}}^{\gamma }\int _{0}^{T}\mathbb {E}\left[ \left[ V^{N_{k} }*S_{s}^{N_{k}}\right] _{\gamma }\int _{\mathbb {R}^{d}}\int _{\mathbb {R} ^{d}}V^{N_{k}}\left( x-y\right) \left| \nabla \varphi \left( s, y\right) \right| dy\,S_{s}^{N_{k}}\left( dx\right) \right] ds\\&\le \left( C_{\gamma }\delta ^{\gamma /2}+\epsilon _{N_{k}}^{\gamma }\right) \left\| \nabla \varphi \right\| _{\infty }\int _{0}^{T}\mathbb {E}\left[ \left[ V^{N_{k}}*S_{s}^{N_{k}}\right] _{\gamma }\int _{\mathbb {R}^{d}} \int _{\mathbb {R}^{d}}V^{N_{k}}\left( x-y\right) dy\,S_{s}^{N_{k}}\left( dx\right) \right] ds\\&=\left( C_{\gamma }\delta ^{\gamma /2}+\epsilon _{N_{k}}^{\gamma }\right) \left\| \nabla \varphi \right\| _{\infty }\int _{0}^{T}\mathbb {E}\left[ \left[ V^{N_{k}}*S_{s}^{N_{k}}\right] _{\gamma }\right] ds, \end{aligned} \end{aligned}$$

which converges to zero as $k\rightarrow \infty $ and then $\delta \rightarrow \infty $ thanks to the first estimate of Lemma 5.6. $\square $

In order to complete the proof of Theorem 5.1 we have to prove that P is supported on a class of solutions of equation (5.3) where we may apply the uniqueness result of Appendix 1 now we know that P is supported on $\Lambda $ and satisfies $\int \left( \Phi _{\varphi }\left( \mu \right) \wedge 1\right) P\left( d\mu \right) =0$ for every $\varphi \in $C$_{c}^{1,2}\left( [0,T]\times \mathbb {R}^{d}\right) $. On an auxiliary probability space $\left( \widetilde{\Omega },\widetilde{\mathcal {F}},\widetilde{\mathbb {P} }\right) $ with expectation $\widetilde{\mathbb {E}}$, let $(\widetilde{\mu }_{t})_{0 \le t \le T}$ be a process with law P. We know that

$$\begin{aligned} \widetilde{\mathbb {E}}\left[ \Phi _{\varphi }\left( \widetilde{\mu }\right) \wedge 1\right] =0 \end{aligned}$$

hence

$$\begin{aligned}&\sup _{t\in \left[ 0,T\right] }\left| \left\langle \widetilde{\mu } _{t},\varphi (t,\,\cdot \,)\right\rangle \right. \nonumber \\&\quad \left. -\left\langle \mu _{0},\varphi (0,\,\cdot \,)\right\rangle -\int _{0}^{t}\left\langle \widetilde{\mu }_{s}, \mathcal {A}\varphi (s,\,\cdot \,) -\left( \alpha (s)+b\left( \widetilde{p}_{s}\right) \right) \cdot \nabla \varphi (s,\,\cdot \,)\right\rangle ds\right| =0\nonumber \\ \end{aligned}$$

(5.13)

with $\widetilde{\mathbb {P}}$-probability one. The set C$_{c}^{1,2}\left( [0,T]\times \mathbb {R}^{d}\right) $ is separable in the natural metric and therefore we may find a dense countable family $\mathcal {D}\subset $ C$_{c}^{1,2}\left( [0,T]\times \mathbb {R}^{d}\right) $; it follows that we may reverse the quantifiers and get that with $\widetilde{\mathbb {P}} $-probability one identity (5.13) holds for all $\varphi \in \mathcal {D}$. Obviously we can also write

$$\begin{aligned}&\sup _{t\in \left[ 0,T\right] }\left| \left\langle \widetilde{p}(t),\varphi (t,\,\cdot \,)\right\rangle -\left\langle p_0,\varphi (0,\,\cdot \,)\right\rangle \right. \\&\left. \quad -\int _{0}^{t}\left\langle \widetilde{p}(s), \mathcal {A}\varphi (s, \,\cdot \,) -\left( \alpha (s)+b\left( \widetilde{p}_{s}\right) \right) \cdot \nabla \varphi (s,\,\cdot \,)\right\rangle ds\right| =0 \end{aligned}$$

since $\widetilde{\mu }_{t}$ has density $\widetilde{p}_{t}$, and also $\mu _{0}$ has density $p_{0}$ by assumption. From the density of $\mathcal {D}$ and classical limit theorems we get that, with $\widetilde{\mathbb {P}} $-probability one, the previous identity holds for every $\varphi \in $ C$_{c}^{1,2}\left( [0,T]\times \mathbb {R}^{d}\right) $. Recall that we denote by $G\left( t,x\right) $ the density of blackian motion in $\mathbb {R}^{d}$ and by $\mathcal {P}_{t}$ the associated heat semigroup. From the previous identity we deduce

$$\begin{aligned} \left\langle \widetilde{p}(t),\psi (\,\cdot \,)\right\rangle =\left\langle \mathcal {P} _{t}p_0,\psi (\,\cdot \,)\right\rangle -\int _{0}^{t}\left\langle \nabla \mathcal {P} _{t-s}\left( \alpha (s)+b(\widetilde{p}(s))\right) ,\psi (\,\cdot \,)\right\rangle ds \end{aligned}$$

(5.14)

for every $\psi \in \text {C}_{c}^{2}\left( \mathbb {R}^{d}\right) $. Indeed, given $t\in \left[ 0,T\right] $ and $\psi \in \text {C}_{c}^{2}\left( \mathbb {R} ^{d}\right) $, consider the test function $\varphi ^{(t)}(s)=\mathcal {P}_{t-s}\psi $ for $s \in [0,t]$; by approximation by functions of class C$_{c}^{1,2}\left( [0,T]\times \mathbb {R}^{d}\right) $, we deduce

$$\begin{aligned} \begin{aligned} \left\langle \widetilde{p}(t),\mathcal {P}_{t-t}\psi \right\rangle&-\left\langle p_0,\mathcal {P}_{t-0}\psi \right\rangle \\&-\int _{0} ^{t}\left\langle \widetilde{p}(s), \mathcal {A}(\mathcal {P}_{t-\cdot }\psi )(s) -\left( \alpha (s)+b\left( \widetilde{p}(s)\right) \right) \cdot \nabla \mathcal {P}_{t-s}\psi \right\rangle ds \end{aligned} \end{aligned}$$

(5.15)

which simplifies to

$$\begin{aligned} \left\langle \widetilde{p}(t),\psi \right\rangle =\left\langle p_0 ,\mathcal {P}_{t}\psi \right\rangle -\int _{0}^{t}\left\langle \alpha _{s}+b(\widetilde{p}(s)),\nabla \mathcal {P}_{t-s}\psi \right\rangle ds \end{aligned}$$

and therefore leads to equation (5.14) by simple manipulations. By the arbitrariness of $\psi $ and the continuity in x of $\widetilde{p}_{t}$ and of both $\mathcal {P}_{t}f$ and $\nabla \mathcal {P}_{t-s}f$ (this one only for $s<t$) for every continuous bounded f (here we also use the bound $\left\| \nabla \mathcal {P}_{t-s}f\right\| _{\infty }\le \frac{C}{\left( t-s\right) ^{1/2}}\left\| f\right\| _{\infty }$ and the integrability of $\frac{C}{\left( t-s\right) ^{1/2}}$) we get

$$\begin{aligned} \widetilde{p}\left( t, x\right) =\left( \mathcal {P}_{t}p_0\right) \left( x\right) -\int _{0}^{t}\nabla \mathcal {P}_{t-s}\left( \alpha (s)+b(\widetilde{p}(s))\right) \left( x\right) ds. \end{aligned}$$

By the same arguments we deduce that $\widetilde{p}$ is continuous in $\left( t,x\right) $. Moreover, it is bounded uniformly in $\left( t,x\right) $ by the identity itself, because $\mathcal {P}_{\cdot }p_{0}$ is bounded, $\alpha $ is bounded, b is bounded and again we use $\left\| \nabla \mathcal {P} _{t-s}f\right\| _{\infty }\le \frac{C}{\left( t-s\right) ^{1/2}}\left\| f\right\| _{\infty }$. In conclusion $\widetilde{p}$ is of class $\text {C}_{b}\left( \left[ 0,T\right] \times \mathbb {R}^{d}\right) $. In Appendix 1 it is proved that in this class there is a unique solution of the previous mild equation, hence P is supported by a single element. This completes the proof of Theorem 5.1.

6 Approximate Nash Equilibria from the Mean Field Game

In this section we show that if we have a weak solution (u, p) of the PDE system in Eq. (4.1), then we can construct a sequence of approximate Nash equilibria for the corresponding N-player game. This is the content of the following theorem.

Theorem 6.1

Let $N \in \mathbb {N}$, $N > 1$. Grant (H1)-(H4). Suppose (u, p) is a weak solution of the PDE system in Eq. (4.1) and let $\alpha ^{*}(t, x) \doteq - \triangledown u(t, x)$ the optimal control of the problem OC in the class $\mathcal {A}^{fb}_{K}$ with K given by Definition (4.13). Set

$$\begin{aligned}&\alpha ^{N, i}(t, \mathbf {x}) \doteq \alpha ^{*}(t, x_i) \doteq -\triangledown u(t, x_i),\quad t \in [0,T],\nonumber \\&\mathbf {x} = \left( x_1, \ldots , x_N\right) \in \mathbb {R}^{d \times N},\,\,i \in {[[N]]} \end{aligned}$$

(6.1)

and $\varvec{\alpha }^{N} = (\alpha ^{N,1}, \ldots , \alpha ^{N,N}){\in \mathcal {A}^{N;fb}_K}$. Then for every $\varepsilon > 0$, there exist $N_0 = N_0(\varepsilon ) \in \mathbb {N}$ such that $\varvec{\alpha }^{N}$ is an $\varepsilon $-Nash equilibrium for the N-player game whenever $N \ge N_0$.

Proof

The proof is divided in three steps.

Step 1 Let $((\Omega _{N}, \mathcal {F}_{N}, (\mathcal {F}_t^{N}), \mathbb {P}^N), \varvec{W}^{N}, \varvec{X}^{N})$ be a weak solution of Eq. (3.1) under strategy vector $\varvec{\alpha }^{N}$. We note that the function F defined in (5.1) with $\alpha (s, X_s^{N,i}) = - \triangledown u(s, X_s^{N,i})$ is continuous and bounded; this guarantees the existence of a weak solution of the the system in Eq. (3.1) for any $N\in \mathbb {N}$ Let $S^{N}_t$ (resp. $S^{N}$) denote the associated empirical measure on $\mathbb {R}^{d}$ (resp. on the path space $\mathcal {X}$). We are going to show that

$$\begin{aligned} \lim _{N \rightarrow \infty } J^{N}_{i}(\varvec{\alpha }^{N}) = J(\alpha ^{*}). \end{aligned}$$

(6.2)

Theorem (5.1)-(i) enables us to prove the convergence result in Eq. (6.2) for the following simplified cost functional, where we do not change the notation for the sake of simplicity:

$$\begin{aligned} J_i^{N}(\varvec{\alpha }^{N}) = \mathbb {E}\left[ \int _{0}^{T}\frac{1}{2}|-\triangledown u(s, X_s^{N,i})|^2\,ds + g(X_T^{N,i})\right] . \end{aligned}$$

Symmetry of the coefficients allows us to re-write the previous cost functional in terms of $S^{N}_{t}$, $t \in [0,T]$ as

$$\begin{aligned} \begin{aligned} J_i^{N}(\varvec{\alpha }^{N})&= \mathbb {E}\left[ \int _{0}^{T}\frac{1}{N}\sum _{i=1}^{N}\frac{1}{2}|-\triangledown u(s, X_s^{N,i})|^2\,ds + \frac{1}{N}\sum _{i=1}^{N} g(X_T^{N,i})\right] \\&= \mathbb {E}\left[ \int _{0}^{T} \langle S_s^{N}, \frac{1}{2}|-\triangledown u(s,\,\cdot \,)|^2\rangle \,ds + \langle S_T^{N}, g(\,\cdot \,)\rangle \right] \\ \end{aligned} \end{aligned}$$

which converges, as $N \rightarrow \infty $, to

$$\begin{aligned} \begin{aligned} {\int _{0}^{T} \langle S_s^{\infty }, \frac{1}{2}|-\triangledown u(s,\,\cdot \,)|^2\rangle \,ds + \langle S_T^{\infty }, g(\,\cdot \,)\rangle } \end{aligned} \end{aligned}$$

where $S^{\infty }$ is the deterministic limit in probability of the sequence of random empirical measures $(S^{N})_{N\in \mathbb {N}}$ given by Theorem (5.1)-(i).

We claim that $S_t^{\infty } \equiv p(t,\,\cdot \,)$, $t \in [0, T]$, with p the second component of the pair (u, p), i.e. the density of the solution of Eq. (4.15) as stated by the Verification Theorem 4.8. Theorem 5.1 states that, given $\varvec{\alpha }^{N}$, the empirical measure $S_t^{N}$ corresponding to the interacting system with this control converges to a flow of measures with density $p^{\alpha }(t,\,\cdot \,)$, where we stress the dependence on $\alpha $. In addition, Theorem 5.1-(ii) states that $p^{\alpha }(t,\,\cdot \,)$ is the mild solution of Eq. (5.3). By applying the previous result to the optimal control we have that the corresponding empirical measure on $\mathbb {R}^{d}$ converges to $p^{\alpha ^{*}}(t,\,\cdot \,)$, mild solution of Eq. (5.3). Also p, the second component of (u, p), is a mild solution of this equation. The uniqueness Theorem 4.5 now implies that $p^{\alpha ^{*}}(t,\,\cdot \,)$ coincides with $p(t,\,\cdot \,)$. Hence, we can conclude that Eq. (6.2) holds.

Step 2 For each $N \in \mathbb {N} \setminus \left\{ i\right\} $, let $\beta ^{N,i} \in \mathcal {A}^{N;1;fb}_{K}$ such that

$$\begin{aligned} J^{N}_i([\varvec{\alpha }^{N,-i}, {\beta ^{N,i}}]) \le \inf _{\beta \in {\mathcal {A}^{N;1;fb}_{K}}} J^{N}_{i}([\varvec{\alpha }^{N,-i}, \beta ]) + \varepsilon /2. \end{aligned}$$

We are going to show the following result:

$$\begin{aligned} \liminf _{N \rightarrow \infty } J^{N}_i([\varvec{\alpha }^{N,-i}, {\beta ^{N,i}}]) \ge J(\alpha ^{*}). \end{aligned}$$

(6.3)

To this aim, we introduce the N-player dynamics in the case the first player only deviates from the Nash equilibrium. For $N \in \mathbb {N}$, consider the system of equations:

$$\begin{aligned} \begin{aligned} X_t^{N,1;\beta }&= {X_0}^{N,1}+\int _0^t \left( {\beta ^{N,1}(s,\varvec{X}^{N;\beta }_s)} + b(X_s^{N,1;\beta },\frac{1}{N}\sum _{j=1}^NV^N(X_s^{N,1;\beta }-X_s^{N,j;\beta }))\right) \,ds\\&\quad + {W_t^{N,1;\beta }}\\ X_t^{N,i;\beta }&= {X_0}^{N,i}+\int _0^t\left( \alpha ^*(s,X^{N,i;\beta }_s) + b(X_s^{N,i;\beta },\frac{1}{N}\sum _{j=1}^NV^N(X_s^{N,i;\beta }-X_s^{N,j;\beta }))\right) \,ds\\&\quad + {W^{N, i;\beta }_t}\\&\quad \quad \quad i\in \left\{ 2, \ldots , N\right\} ,\,\,t \in [0, T], \end{aligned} \end{aligned}$$

(6.4)

where $\beta ^{N,1} \in {\mathcal {A}^{N;1;fb}_{K}}$. We denote with $S^{N;\beta }\doteq (S^{N;\beta }_t)_{t \in [0, T]}$ the empirical measure process on $\mathbb {R}^{d}$ of the previous system.

Now, for each $N \in \mathbb {N}$, let $((\Omega _{N}, \mathcal {F}_{N}, (\mathcal {F}_t^{N}), \mathbb {Q}^N), {\varvec{W}^{N;\beta }}, \varvec{X}^{N;\beta })$ be a weak solution of Eq. (6.4). Since the presence of a deviating player destroys the symmetry of the pre-limit system, following Lacker [15] proof of Theorem 3.10 therein, we perform a change of measure to restore it. More precisely, we define as $\mathbb {P}^{N}$ the probability measure under which $\varvec{X}^{N;\beta }$ has the following dynamics:

$$\begin{aligned} \begin{aligned} X_t^{N,{i};\beta }&= {X_0}^{N,{i}}+\int _0^t \left( \alpha ^*(s,X^{N,i;\beta }_s) + b(X_s^{N,{i};\beta },\frac{1}{N}\sum _{j=1}^NV^N(X_s^{N,{i};\beta }-X_s^{N,j;\beta }))\right) \,ds +\\&\quad \,\, + {\widehat{W}_t^{N,i;\beta }}, \quad \quad \quad i\in {[[N]]},\,\,t \in [0, T], \end{aligned} \end{aligned}$$

where the $\widehat{W}_t^{N,i;\beta }$ are $\mathbb {P}^N$-Wiener processes, i.e. $\mathbb {P}^N$ is defined via $\frac{d\mathbb {P}^{N}}{d\mathbb {Q}^{N}}\Big \vert _{t=T} \doteq Z_T^N$ where

$$\begin{aligned} {Z^N_t \doteq \mathcal {E}_t\left( \int _0^{\cdot }\left( \varvec{\beta }^N(s,\varvec{X}^{N;\beta }_s)-\varvec{\alpha }^N(s,\varvec{X}^{N;\beta }_s)\right) d\varvec{W}_s^{N;\beta } \right) } \end{aligned}$$

where $\varvec{\beta }^N=[\varvec{\alpha }^{N,-1},\beta ^{N,1}]$ and $\varvec{W}^{N;\beta }=(W^{N,1;\beta },\ldots ,W^{N,N;\beta })$. We notice that $Z^N$ is a well-defined $\mathbb {Q}^N$-martingale thanks to boundedness of the coefficients. Theorem 5.1-(i) ensures the convergence under $\mathbb {P}^{N}$ of the $S^{N;\beta }$ to $S^{\infty } \equiv \delta _p$. Boundedness of the coefficients also gives uniform integrability of the sequence $((Z^{N}_T)^{-1})_{N \in \mathbb {N}}$; therefore, the probability measures $\mathbb {Q}^{N}(A) \doteq \mathbb {E}^{\mathbb {P}^{N}}\left[ {(Z^{N}_T)^{-1}}\mathsf {1}_{A}\right] $, $A \in \mathcal {F}^{N}$, converge to zero whenever $\mathbb {P}^{N}(A)$ converges to zero in the limit $N \rightarrow \infty $. So the convergence (in law and also in probability) of $S^{N;\beta }$ to $S^{\infty }$ under $\mathbb {P}^{N}$ implies its convergence (in law and also in probability) under $\mathbb {Q}^{N}$ to the same (constant) limit.

Now, in order to gain more compactness in the space of admissible controls, we interpret the controls in Eq. (6.4) as stochastic relaxed controls (Appendix 1). To this end, we denote with ${\overline{B}_{K}(0)} \subset \mathbb {R}^{d}$ the closed ball of radius K around the origin and $\mathcal {R}_K \doteq \mathcal {R}_{\overline{B}_{K}(0)}$. Then ${\mathcal {R}_{K}}$ is compact (Appendix 1). For $N \in \mathbb {N}$, let $\tilde{\beta }_{t}^{1}$ and $\tilde{\alpha }_{t}^{*,i}$, $i \in \{2, \ldots , N\}$, be ${\mathcal {R}_{K}}$-valued random measures determined by:

$$\begin{aligned} \begin{aligned}&{\tilde{\beta }_{t}^{N,1}(dx)\,dt \doteq \delta _{\beta ^{N,1}(t,\varvec{X}_t^{N;\beta })}(x)\,dx\,dt,\quad \quad \quad \,\,\,\,(t, x) \in [0,t] \times \overline{B}_{K}(0)}\\&\tilde{\alpha }_{t}^{*,i}(dx)\,dt \doteq \delta _{\alpha ^{*}(t, X_t^{N, i; \beta })}(x)\,dx\,dt\quad \,\,\,\, (t, x) \in [0,t] \times {\overline{B}_{K}(0)},\,\,i\in \{2,\ldots ,N\}.\\ \end{aligned} \end{aligned}$$

We rewrite Eq. (6.4) in terms of these relaxed controls:

$$\begin{aligned} \begin{aligned} X_t^{N,1;\beta }&= {X_0}^{N,1} + \int _{[0,t]\times \overline{B}_{{K}}(0)} x \tilde{\beta }_{s}^{N,1}(dx)\,ds\\&\quad + \int _{0}^{t} b(X_s^{N,1;\beta },\frac{1}{N}\sum _{j=1}^NV^N(X_s^{N,1;\beta }-X_s^{N,j;\beta }))\,ds + {W_t^{N,1;\beta }}\\ X_t^{N,i;\beta }&= {X_0}^{N,i} + \int _{[0,t]\times \overline{B}_{{K}}(0)} x \tilde{\alpha }_{i,s}^{*}(dx){\,ds}\\&\quad + \int _{0}^{t} b(X_s^{N,i;\beta },\frac{1}{N}\sum _{j=1}^NV^N(X_s^{N,i;\beta }-X_s^{N,j;\beta })) \,ds + {W^{N, i;\beta }_t}\\&\quad \quad \quad i\in \left\{ 2, \ldots , N\right\} ,\,\,t \in [0, T]. \end{aligned} \end{aligned}$$

(6.5)

We do the following claims. Claim a.: the family $\left( \mathbb {P}^{N} \circ (X^{N, 1; \beta }, {\tilde{\beta }^{N,1}}, S^{N;\beta })^{-1}\right) _{N \in \mathbb {N}}$ is tight in $\mathcal {P}(\mathcal {X} \times {\mathcal {R}_{K}} \times \mathcal {P}(\mathbb {R}^{d}))$ and thus it admits a convergent subsequence. We denote by $(X^{\beta ^{*}}, \tilde{\beta }^{*,1},\,p)$ the limit of the subsequence that can be constructed by means of Skorokhod’s representation theorem on a suitable limiting probability space $(\Omega ^{\beta ^*},\mathcal {F}^{\beta ^*},\mathbb {Q}^{\beta ^*})$; Claim b.: the limit $X^{\beta ^{*}}$ has the following representation:

$$\begin{aligned} X_t^{\tilde{\beta }^{*}}= & {} {X_0} + \int _{[0,T]\times \overline{B}_{{K}}(0)} x \tilde{\beta }^{*,1}_{s}(dx)\,ds + \int _{0}^{t} b(X_s^{\tilde{\beta }^{*}}, p(s, X_s^{\tilde{\beta }^{*}}))\,ds + {W^{\beta ^*}_t},\nonumber \\&\quad t \in [0,T] \end{aligned}$$

(6.6)

on $(\Omega ^{\beta ^*},\mathcal {F}^{\beta ^*},\mathbb {Q}^{\beta ^*})$ where $W^{\beta ^*}$ is a Wiener process, i.e. there exist a filtration $(\mathcal {F}^{\beta ^*}_t)$ and an $(\mathcal {F}^{\beta ^*}_t)$-Wiener process $W^{\beta ^*}$ on $(\Omega ^{\beta ^*},\mathcal {F}^{\beta ^*},\mathbb {Q}^{\beta ^*})$. such that $X^{\tilde{\beta }^{*}}$ has representation (6.6). If both Claim a and Claim b hold, by setting $\beta _t^{*} \doteq \int _{\overline{B}_{{K}}(0)} x \tilde{\beta }^{*,1}_{t}(dx)$, we have that $J^{N}_i([\varvec{\alpha }^{N,-i}, {\beta ^{N,i}}])$ converges to

$$\begin{aligned} J(\beta ^{*}) = \mathbb {E}\left[ \int _{0}^{T} \frac{1}{2} |\beta ^{*}_s|^2\,ds + g(X_T^{\beta ^{*}})\right] \end{aligned}$$

along the selected subsequence with $J(\beta ^{*}) \ge J(\alpha ^{*})$. Equation (6.3) follows by taking the limit inferior of the sequence.

We now prove the two claims.

Proof of Claim a. Tightness of $(\mathbb {P}^{N} \circ (X^{N,1:\beta })^{-1})$ and of $(\mathbb {P}^{N} \circ (S^{N;\beta })^{-1})$ under $\mathbb {Q}^{N}$ follows from their tightness under $\mathbb {P}^{N}$. On the other hand, $(\mathbb {P}^{N} \circ ({\tilde{\beta }^{N,1}})^{-1})$ is tight in $\mathcal {P}({\mathcal {R}_{K}})$ because ${\mathcal {R}_{K}}$ is compact. This implies that $\left( \mathbb {P}^{N} \circ (X^{N, 1; \beta }, \tilde{\beta }^{N,1}, S^{N;\beta })^{-1}\right) _{N \in \mathbb {N}}$ is tight in $\mathcal {P}(\mathcal {X} \times {\mathcal {R}_{K}} \times \mathcal {P}(\mathbb {R}^{d}))$.

Proof of Claim b. We use a characterization of solutions to Eq. (6.6) with fixed measure variable through a martingale problem in the sense of Stroock and Varadhan [24] (see El Karoui and Méléard [9] for a study of the martingale problems we employ). Let $f \in \text {C}_{c}^{2}(\mathbb {R}^{d})$ and let us define the process $M^{f}$ on $(\mathcal {X}\times {\mathcal {R}_{K}}, \mathcal {B}(\mathcal {X}\times {\mathcal {R}_{K}}))$ by

$$\begin{aligned} \begin{aligned} M^{f}_t(\varphi , \rho )&\doteq f(\varphi (t))-f(\varphi (0)) -\int _{[0,t] \times \overline{B}_{{K}}(0)} x \rho _s(dx){\nabla f(\varphi (s))}\,ds\\&\quad - \int _{0}^{t}\left( b(\varphi (s), p(s, \varphi (s)) \nabla f(\varphi (s)) + \frac{1}{2}\Delta f(\varphi (s))\right) \,ds, \end{aligned} \end{aligned}$$

(6.7)

where $t \in [0, T]$. We claim that $\Theta ^{*}\doteq \mathbb {P} \circ (X^{\tilde{\beta }^{*}}, \tilde{\beta }^{*})^{-1}\in \mathcal {P}(\mathcal {X} \times \mathcal {R}_K)$ is a solution of the martingale problem associated to Eq. (6.7), i.e. such that for all $f \in \text {C}_{c}^{2}(\mathbb {R}^{d})$, $M^f$ is a $\Theta ^*$-martingale. The martingale property is intended on $(\mathcal {X} \times {\mathcal {R}_{K}}, \mathcal {B}(\mathcal {X} \times {\mathcal {R}_{K}}))$ with respect to the $\Theta ^{*}$-augmentation of the canonical filtration made right continuous by a standard procedure. However, to conclude it is sufficient to check that the martingale property holds with respect to the canonical filtration on $\mathcal {X} \times {\mathcal {R}_{K}}$ (see, for instance, Problem 5.4.13 in Karatzas and Shreve (1998)). We denote by $(\mathcal {G}_t)_{t \in [0,T]}$ such a filtration show that the process in Eq. (6.7), which is bounded, measurable and $\mathcal {G}_t$-adapted, is a $\Theta ^{*} \doteq \mathbb {P} \circ (X^{\tilde{\beta }^{*}}, \tilde{\beta }^{*})^{-1}$ martingale for all $f \in \text {C}_{c}^{2}(\mathbb {R}^{d})$. This is equivalent to having

$$\begin{aligned} \mathbb {E}^{\Theta ^{*}}\left[ Y\cdot (M^{f}_{t_2}-M^{f}_{t_1})\right] = 0 \end{aligned}$$

for every choice of $(t_1, t_2, Y) \in [0,T]^{2} \times \text {C}_b(\mathcal {X} \times {\mathcal {R}_{K}})$ such that $t_1 \le t_2$ and Y is $\mathcal {G}_{t_1}$-measurable. To this aim, we define and compute the following function $\Psi ^{p} = \Psi ^{p}_{(t_1, t_2, Y, f)}:\mathcal {P}(\mathcal {X} \times {\mathcal {R}_{K}})\rightarrow \mathbb {R}$:

$$\begin{aligned} \begin{aligned} \Psi ^{p}(\Theta ^{*})&= \Psi ^{p}_{(t_1, t_2, Y, f)}(\Theta ^{*}) \doteq \mathbb {E}^{\Theta ^{*}}\left[ Y\cdot (M^{f}_{t_2}-M^{f}_{t_1})\right] \\&= \int _{\mathcal {X} \times \mathcal {R}_C}Y(\varphi ,\rho )\left( f(\varphi (t_2))-f(\varphi (t_1)) \right) \,\Theta ^*(d\varphi ,d\rho )\\&\quad -\int _{\mathcal {X} \times \mathcal {R}_C}Y(\varphi ,\rho )\int _{\bar{B}_C\times [t_1,t_2]}x\rho _t(dx)\nabla f(\varphi (t)) dt\,\Theta ^*(d\varphi ,d\rho )\\&\quad -\int _{\mathcal {X} \times \mathcal {R}_C}Y(\varphi ,\rho )\int _{t_1}^{t_2} b(\varphi (t),p(t,\varphi (t)) \nabla f(\varphi (t))dt\,\Theta ^*(d\varphi ,d\rho )\\&\quad -\frac{1}{2} \int _{\mathcal {X} \times \mathcal {R}_C} Y(\varphi ,\rho )\int _{t_1}^{t_2}\Delta f(\varphi (t))dt\,\Theta ^*(d\varphi ,d\rho ). \end{aligned} \end{aligned}$$

(6.8)

The previous function, in particular, is continuous with respect to the weak convergence of measure since the integrands are bounded and continuous on $\mathcal {X} \times {\mathcal {R}_{K}}$. Also, we define:

$$\begin{aligned} \begin{aligned} \overline{M}^{f,i}_t(\varphi ^N,\rho ^N)&\doteq f(\varphi ^{N,i}(t))-f(\varphi ^{N,i}(0))\\&\quad -\int _0^t\left[ \int _{\bar{B}_C}x\rho ^{N,i}_s(dx)+b(\varphi ^{N,i}(s), v(\varphi ^N(s)))\right] \nabla f(\varphi ^{N,i}(s))\\&\quad +\frac{1}{2}\Delta f(\varphi ^{N,i}(s)) ds\\&v(\varphi ^N(t)) \doteq \frac{1}{N}\sum _{j=1}^NV^N(\varphi ^{N,i}(t)-\varphi ^{N,j}(t)),\quad t\in [0,T],\quad {i\in [[N]]}, \end{aligned} \end{aligned}$$

for $(\varphi ^{N}, \rho ^{N}) \in \mathcal {X}^{\times N} \times \mathcal {R}_{K}^{\times N}$, where $\rho ^{N,i}$ and $\varphi ^{N,i}$ are respectively the $i^{th}$ component of $\rho $ and $\varphi $, and the extended empirical measure $\overline{S}^{N;\beta }$ as

$$\begin{aligned} \overline{S}^{N;\beta } \doteq \frac{1}{N} \sum _{i = 1}^{N} \delta _{(X^{N,i;\beta }, \rho ^{N, i; \beta })}. \end{aligned}$$

Here, $X^{N, i; \beta }$ denotes the dynamics of player i in the system where the first player only deviates from the Nash equilibrium written in terms of relaxed controls $\rho ^{N, i;\beta }$.

Now, by construction, it holds that

$$\begin{aligned} \frac{1}{N}\sum _{i=1}^{N} \mathbb {E}^{\overline{\Theta }^*_N}\left[ \overline{Y}^{i}\cdot \left( \overline{M}^{f,i}_{t_2}-\overline{M}_{t_1}^{f,i} \right) \right] = 0, \end{aligned}$$

(6.9)

where $\overline{\Theta }^*_N \doteq \mathbb {P}^{N} \circ (X^{N, i; \beta }, \rho ^{N,i;\beta })^{-1}$ and for every choice of $(t_1, t_2, \overline{Y}^i) \in [0,T]^{2} \times \text {C}_b(\mathcal {X}^{\times N} \times {\mathcal {R}_{K}^{\times N}})$ such that $t_1 \le t_2$ and Y is $\mathcal {G}^N_{t_1}$-measurable, with $(\mathcal {G}^N_t)$ being the canonical filtration on $\mathcal {B}(\mathcal {X}^{\times N} \times {\mathcal {R}_{K}^{\times N}})$. To conclude, it then suffices to show that the previous term converges to the expected value of $\Psi ^{p}(\Theta ^{*})$ in the limit for $N \rightarrow \infty $. Let us set the sequence $\overline{Y}^{i}$ as $\overline{Y}^{i}(\varphi ^{N}) \doteq Y(\varphi ^{N,i})$ and show that the following decomposition for the term in Eq. (6.9) holds:

$$\begin{aligned} \frac{1}{N}\sum _{i=1}^{N} \mathbb {E}^{\overline{\Theta }^*_N}\left[ Y\cdot \left( \overline{M}^{f,i}_{t_2}-\overline{M}_{t_1}^{f,i} \right) \right] = \mathbb {E}\left[ \Psi _{(t_1, t_2, Y, f)}(\overline{S}^{N;\beta })\right] - \Delta ^{p}_{(t_1, t_2, Y, f)}(\overline{S}^{N;\beta }). \end{aligned}$$

(6.10)

Indeed, the first term is equal to:

$$\begin{aligned} \begin{aligned}&\mathbb {E}\left[ \Psi _{(t_1, t_2, Y, f)}(\overline{S}^{N;\beta })\right] \\&\quad = \frac{1}{N}\sum _{i=1}^N\mathbb {E} \left[ Y(X^{N,i;\beta },\rho ^{N,i;\beta })\left( f(X^{N,i;\beta }_{t_2})-f(X^{N,i;\beta }_{t_1}) \right) \right] \\&\qquad -\frac{1}{N}\sum _{i=1}^N\mathbb {E}\left[ Y(X^{N,i;\beta },\rho ^{N,i;\beta })\int _{ \overline{B}_{{K}}(0) \times [t_1,t_2]}x\rho ^{N,i;\beta }_t(dx)\nabla f(X^{N,i;\beta }_t) dt\right] \\&\qquad -\frac{1}{N}\sum _{i=1}^N\mathbb {E}\left[ Y(X^{N,i;\beta },\rho ^{N,i;\beta })\int _{t_1}^{t_2} b(X^{N,i;\beta }_t,p(t,X^{N,i;\beta }_t)) \nabla f(X^{N,i;\beta }_t)dt \right] \\&\qquad -\frac{1}{N}\sum _{i=1}^N\mathbb {E}\left[ Y(X^{N,i;\beta },\rho ^{N,i;\beta })\int _{t_1}^{t_2}\frac{1}{2}\Delta f(X^{N,i;\beta }_t)dt \right] ,\\ \end{aligned} \end{aligned}$$

(6.11)

whereas the second reads as:

$$\begin{aligned} \begin{aligned}&\Delta ^{p}_{(t_1, t_2, Y, f)}(\overline{S}^{N;\beta })\\&\quad = \frac{1}{N}\sum _{i=1}^N\mathbb {E}\left[ Y(X^{N,i;\beta },\rho ^{N,i;\beta })\int _{t_1}^{t_2} b(X^{N,i;\beta }_t,v(X^{N;\beta }_t) ) \nabla f(X^{N,i;\beta }_t)dt \right] \\&\qquad - \frac{1}{N}\sum _{i=1}^N\mathbb {E}\left[ Y(X^{N,i;\beta },\rho ^{N,i;\beta })\int _{t_1}^{t_2} b(X^{N,i;\beta }_t,p(t,X^{N,i;\beta }_t)) \nabla f(X^{N,i;\beta }_t)dt \right] \end{aligned} \end{aligned}$$

(6.12)

In particular, $\Psi _{(t_1, t_2, Y, f)}(\overline{S}^{N; \beta })$ corresponds to the integrals in Eq. (6.8) computed w.r.t. the extended empirical measure $\overline{S}^{N; \beta }$. The term in Eq. (6.11) converges to $\Psi ^{p}_{(t_1, t_2, Y, f)}(p)$ in the limit for $N \rightarrow \infty $ thanks to the weak continuity of the involved functional and weak convergence of measures. Term in Eq. (6.12), instead, vanishes in the limit as $N \rightarrow \infty $ thanks to Lemma D.2, since it can be bounded by: :

$$\begin{aligned} \begin{aligned}&\left| \Delta ^{p}_{(t_1, t_2, Y, f)}(\overline{S}^{N;\beta }) \right| \\&\quad \le \frac{1}{N}\sum _{i=1}^N\mathbb {E}\left[ \left| Y(X^{N,i;\beta })\right| \int _{t_1}^{t_2}\left| b(X_s^{N,i;\beta },p^N(s,X^{N,i;\beta }_s))\right. \right. \\&\left. \left. \quad \quad -b(X_s^{N,i;\beta },p(s,X^{N,i;\beta }_s) )\right| \left| \nabla f(X^{N,i;\beta }_s)\right| ds \right] \\&\quad \le \frac{1}{N}\sum _{i=1}^N \Vert Y\Vert _{\infty }\Vert \nabla f\Vert _{\infty }L\int _{t_1}^{t_2} \Vert p^N(s,\cdot )-p(s,\cdot )\Vert _{\infty }ds. \end{aligned} \end{aligned}$$

(6.13)

We conclude that $\Theta ^{*}\in \mathcal {P}(\mathcal {X} \times \mathcal {R}_K)$ solves the martingale problem associated to Eq. (6.7). By an argument analogous to that in the proofs of Proposition 5.4.6 and Corollary 5.4.8 in Karatzas and Shreve (1998), we finally conclude that there exists a weak solution $((\Omega ^{\beta ^*},\mathcal {F}^{\beta ^*},\mathbb {Q}^{\beta ^*}), X^{\tilde{\beta }^{*}},W^{\beta ^*})$ of Eq. (6.6).

Step 3 For every $N \in \mathbb {N}, $

$$\begin{aligned} \begin{aligned}&J^{N}_{i}(\varvec{\alpha }^N)-\inf _{\beta } J^{N}_{i}([\varvec{\alpha }^{N,-i},\beta ]) \\&\quad \le J^{N}_{i}(\varvec{\alpha }^N) -J(\alpha ^{*})+J(\alpha ^{*})- J^{N}_{i}([\varvec{\alpha }^{N,-i},\beta ^N_1]) + \varepsilon /2. \end{aligned} \end{aligned}$$

(6.14)

By Step 1 and Step 2 there exists $N_0(\varepsilon )$ such that

$$\begin{aligned} J^{N}_i(\varvec{\alpha }^N) -J(\alpha ^{*})\le \varepsilon /4 \qquad J(\alpha ^{*})- J^{N}_{i}([\varvec{\alpha }^{N,-i},\beta ^N_1])\le \varepsilon /4. \end{aligned}$$

for all $N\ge N_0(\varepsilon )$. This concludes the proof. $\square $

Notes

The authors warmly thank one of the two anonymous Referees for her/his suggestion to look at the Hopf-Cole reduction, to prove global in time existence, because of the quadratic structure of our Hamiltonian.

References

Aurell, A., Djehiche, B.: Mean-field type modeling of nonlocal crowd aversion in pedestrian crowd dynamics. SIAM J. Control Optim. 56(1), 434–455 (2018)
Article MathSciNet Google Scholar
Brezis, H.: Functional Analysis. Sobolev Spaces and Partial Differential Equations. Springer Science & Business Media, Berlin (2010)
Google Scholar
Cardaliaguet, P.: Notes from P-L lions’c lectures at the Collège de France. Technical report, Technical report (2012)
Cardaliaguet, P.: The convergence problem in mean field games with local coupling. Appl. Math. Optim. 76(1), 177–215 (2017)
Article MathSciNet Google Scholar
Cardaliaguet, P., Porretta, A.: An introduction to mean field game theory. In: Mean Field Games, pp. 1–158. Springer, Berlin (2020)
MATH Google Scholar
Carmona, R., Delarue, F., et al.: Probabilistic Theory of Mean Field Games with Applications I–II. Springer, Berlin (2018)
Book Google Scholar
Di Nezza, E., Palatucci, G., Valdinoci, E.: Hitchhiker’s guide to the fractional Sobolev spaces. Bull. Sci. Math. 136(5), 521–573 (2012)
Article MathSciNet Google Scholar
Dudley, R.: Convergence of Baire measures. Stud. Math. 27, 251–268 (1966)
Article MathSciNet Google Scholar
El Karoui, N., Méléard, S.: Martingale measures and stochastic calculus. Probab. Theory Relat. Fields 84(1), 83–101 (1990)
Article MathSciNet Google Scholar
El Karoui, N., Nguyen, D., Jeanblanc-Picqué, M.: Compactification methods in the control of degenerate diffusions: existence of an optimal control. Stochastics 20(3), 169–219 (1987)
Article MathSciNet Google Scholar
Funaki, T.: A certain class of diffusion processes associated with nonlinear parabolic equations. Zeitschrift für Wahrscheinlichkeitstheorie Verwandte Gebiete 67(3), 331–348 (1984)
Article MathSciNet Google Scholar
Gomes, D.A., Pimentel, E.A., Voskanyan, V.: Regularity Theory for Mean-field Game Systems. Springer, Berlin (2016)
Book Google Scholar
Huang, M., Malhamé, R.P., Caines, P.E., et al.: Large population stochastic dynamic games: closed-loop Mckean–Vlasov systems and the Nash certainty equivalence principle. Commun. Inf. Syst. 6(3), 221–252 (2006)
Article MathSciNet Google Scholar
Kushner, H.J.: Numerical methods for stochastic control problems in continuous time. SIAM J. Control Optim. 28(5), 999–1048 (1990)
Article MathSciNet Google Scholar
Lacker, D.: On the convergence of closed-loop nash equilibria to the mean field game limit. Ann. Appl. Probab. 30(4), 1693–1761 (2020)
Article MathSciNet Google Scholar
Lasry, J.-M., Lions, P.-L.: Jeux à champ moyen. ii–horizon fini et contrôle optimal. Comptes Rendus Mathématique 343(10), 679–684 (2006)
Lasry, J.-M., Lions, P.-L.: Mean field games. Jpn. J. Math. 2(1), 229–260 (2007)
Article MathSciNet Google Scholar
Lunardi, A.: Analytic Semigroups and Optimal Regularity in Parabolic Problems. Springer Science & Business Media, Berlin (2012)
MATH Google Scholar
Morale, D., Capasso, V., Oelschläger, K.: An interacting particle system modelling aggregation behavior: from individuals to populations. J. Math. Biol. 50(1), 49–66 (2005)
Article MathSciNet Google Scholar
Oelschlager, K.: A martingale approach to the law of large numbers for weakly interacting stochastic processes. Ann. Probab. 12, 458–479 (1984)
Article MathSciNet Google Scholar
Oelschläger, K.: A law of large numbers for moderately interacting diffusion processes. Zeitschrift für Wahrscheinlichkeitstheorie verwandte Gebiete 69(2), 279–322 (1985)
Article MathSciNet Google Scholar
Pazy, A.: Semigroups of Linear Operators and Applications to Partial Differential Equations, vol. 44. Springer Science & Business Media, Berlin (2012)
MATH Google Scholar
Porretta, A.: Weak solutions to Fokker–Planck equations and mean field games. Arch. Ration. Mech. Anal. 216(1), 1–62 (2015)
Article MathSciNet Google Scholar
Stroock, D.W., Varadhan, S.S.: Multidimensional Diffusion Processes. Springer, Berlin (2007)
MATH Google Scholar
Veretennikov, A.J.: On strong solutions and explicit formulas for solutions of stochastic integral equations. Math. USSR-Sbornik 39(3), 387 (1981)
Article Google Scholar

Download references

Funding

Open access funding provided by Scuola Normale Superiore within the CRUI-CARE Agreement.

Author information

Authors and Affiliations

Scuola Normale Superiore, Piazza dei Cavalieri 7, 56126, Pisa, Italy
Franco Flandoli, Maddalena Ghio & Giulia Livieri

Authors

Franco Flandoli
View author publications
You can also search for this author in PubMed Google Scholar
Maddalena Ghio
View author publications
You can also search for this author in PubMed Google Scholar
Giulia Livieri
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Giulia Livieri.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

M. Ghio and G. Livieri acknowledge the financial support of UniCredit Bank R &D group through the Dynamical and Information Research Institute at the Scuola Normale Superiore. All authors thank Prof. Fausto Gozzi (LUISS Guido Carli), Prof. Luciano Campi (University of Milan) and Prof. Markus Fisher (University of Padova) for useful suggestions.

Appendices

Appendix A: Some Well Known Results

For the reader convenience, we collect here some (well-known) results on convolutions, regularizations and mollifiers that have been used through the paper.

First, we remind some properties on convolution and regularization.

Proposition A.1

(Convolution and regularization) [2, Propositions 4.4.15, 4.4.19 and 4.4.20] The following statements on convolution hold true:

(i):: Let $f\in L^1(\mathbb {R}^d)$ and $g\in L^p(\mathbb {R}^d)$, $1\le p\le \infty $. Then $f*g$ is well defined in $L^p(\mathbb {R}^d)$.
(ii):: Let $\theta \in \text {C}_c(\mathbb {R}^d)$ and $\varphi \in L^1_{loc}(\mathbb {R }^d)$. Then $\Theta *\varphi $ is well defined in $\text {C}(\mathbb {R}^d)$.
(iii):: Let $\theta \in \text {C}_c^k(\mathbb {R}^d)$ and $\varphi \in L^1_{loc}( \mathbb {R}^d)$. Then $\Theta *\varphi $ is well defined in $\text {C}^k(\mathbb {R}^d)$, $k\ge 1$, also $k=\infty $.

In particular, in our work we used convolution of the type $\theta *\mu $, where $\theta \in \text {C}_c^{\infty }(\mathbb {R}^d)$ and $\mu \in \mathcal {P}(\mathbb {R}^d)$. Therefore, since $\mu \in L^1(\mathbb {R}^d)$ and $\theta \in L^p(\mathbb {R}^d)$ for any $1\le p\le \infty $, by item (i) of Proposition A.1 the convolution $\theta *\mu $ is well defined in $L^p(\mathbb {R}^d)$. Moreover, by items (ii) and (iii) of Proposition A.1, $ \theta *\mu \in \text {C}^k(\mathbb {R}^d)$ for any $k\ge 1$, also $k=\infty $. Also, we use scalar product of the type $\langle \theta *\mu ,\varphi \rangle $, where $\varphi \in L^2(\mathbb {R}^d)$. In particular, for any function $g:\mathbb {R}^d\rightarrow \mathbb {R}$ if we denote $g^-\doteq g(-\cdot )$, then

$$\begin{aligned} \langle \theta *\mu ,\varphi \rangle= & {} \int _{\mathbb {R}^d}\int _{\mathbb {R} ^d}\theta (x-y)\mu (dy)\varphi (x)dx \\= & {} \int _{\mathbb {R}^d}\int _{\mathbb {R}^d}\theta (x-y)\varphi (x)dx\mu (dy) \\= & {} \int _{\mathbb {R}^d}\theta (-\cdot )*\varphi (y)\mu (dy)=\langle \mu ,\theta ^-*\varphi \rangle . \end{aligned}$$

Second, we give the following definition and proposition.

Definition A.2

(Mollifiers) [2, Chapter 4.4] A sequence of mollifiers is any sequence of functions $(\theta _N)_{N\in \mathbb {N}}$ from $\mathbb {R}^d$ to $\mathbb {R}$ such that for each $N \in \mathbb {N}$: $\theta _N\in \text {C}^{\infty }_c(\mathbb {R}^d)$ with support in $\overline{B}_{1/N}(0)$, $\theta _N \ge 0$ and $\int _{\mathbb {R}^d}\theta ^N(dx)=1$.

Proposition A.3

(Mollification) [2, Proposition 4.4.21] Let $f\in \text {C}(\mathbb {R}^d)$. Then $\theta _N*f\rightarrow f$ uniformly on compact sets.

Third, we give the following results on weak convergence.

Lemma A.4

(Weak convergence and the double index problem) Let $(\mu _N)_{N \in \mathbb {N}} \subset \mathcal {P}(\mathbb {R}^{d})$ a sequence converging weakly to $\mu \in \mathcal {P}(\mathbb {R}^{d})$. Let $(f_N)_{N \in \mathbb {N}} \in \text {C}_b(\mathbb {R}^d)$ be a sequence converging to $f \in \text {C}_b(\mathbb {R}^d)$ uniformly on compact sets and such that $\sup _{n\in \mathbb {N}}\Vert f_N\Vert _{\infty }\le C<\infty $, $\Vert f\Vert _{\infty }\le C<\infty $ for some $C>0$. Then

$$\begin{aligned} \int _{\mathbb {R}^d}f_N(x)\mu _N(dx)\underset{N\rightarrow \infty }{ \longrightarrow } \int _{\mathbb {R}^d}f(x)\mu (dx). \end{aligned}$$

Proof

The proof is based on the following decomposition, holding for any $R>0$:

$$\begin{aligned}&\int _{\mathbb {R}^d}f_N(x)\mu _N(dx)-\int _{\mathbb {R}^d}f(x)\mu (dx)\\&\quad = \int _{\overline{B}_R(0)}(f_N(x)-f(x))\mu _N(dx)\\&\qquad +\int _{\overline{B}_R(0)}f(x)(\mu _N-\mu )(dx)\\&\qquad +\int _{\mathbb {R}^d\setminus \overline{B}_R(0)}f_N(x)\mu _N(dx)-\int _{\mathbb {R}^d\setminus \overline{B}_R(0)}f(x)\mu (dx) \end{aligned}$$

where $\overline{B}_R(0)\subset \mathbb {R}^d$ is the closed ball of radius R centered at the origin. Hence

$$\begin{aligned} \left| \int _{\mathbb {R}^d}f_N(x)\mu _N(dx)-\int _{\mathbb {R}^d}f(x)\mu (dx)\right|\le & {} \Vert f_N-f \Vert _{\infty ,\overline{B}_R(0)}\\&+\left| \int _{\overline{B}_R(0)}f(x)(\mu _N-\mu )(dx)\right| \\&+C\left( \mu _N(\mathbb {R}^d\setminus \overline{B}_R(0))+\mu (\mathbb {R}^d\setminus \overline{B}_R(0))\right) \end{aligned}$$

where $ \Vert \cdot \Vert _{\infty ,\overline{B}_R(0)}$ is the infinity norm on $\overline{B}_R(0)$. Now let $\varepsilon >0$ and choose $R>0$ be such that

$$\begin{aligned} \sup _{N\in \mathbb {N}}\mu _N(\mathbb {R}^d\setminus \overline{B}_R(0))<\frac{\varepsilon }{4C}\quad \text {and}\quad \mu (\mathbb {R}^d\setminus \overline{B}_R(0))<\frac{\varepsilon }{4C} \end{aligned}$$

by the tightness of the family $(\mu _N)_{N \in \mathbb {N}}$. Then, by uniform convergence on compact sets of the sequence $(f_N)_{N \in \mathbb {N}}$ to f and by weak convergence of the $(\mu _N)_{N \in \mathbb {N}}$ to $\mu $ there exists $N_0\in \mathbb {N}$ such that the first and second terms are lower than $\frac{\varepsilon }{4}$ for all $N\ge N_0$. We conclude that for all $\varepsilon >0 $ there exists $N_0\in \mathbb {N}$ such that

$$\begin{aligned} \left| \int _{\mathbb {R}^d}f_N(x)\mu _N(dx)-\int _{\mathbb {R}^d}f(x)\mu (dx)\right| < \varepsilon \end{aligned}$$

for all $N\ge N_0$. $\square $

Lemma A.5

Let $(\mu _N)_{N \in \mathbb {N}} \subset \mathcal {P}(\mathbb {R}^{d})$ a sequence converging weakly to $\mu \in \mathcal {P}(\mathbb {R}^{d})$. Set $f_N \doteq \theta _N*\mu _N$ for some mollifiers $\theta _N$ and assume $ \lim _{N \rightarrow \infty } f_N = f$ in $L^2(\mathbb { R}^d)$ for some $f\in L^2(\mathbb {R}^d)$. Then $\mu $ has density f with respect to the Lebesgue measure on $\mathbb {R}^d$.

Proof

First, notice that $\langle f_N, \varphi \rangle =\langle \theta _N *\mu _N\varphi \rangle = \langle \mu _N\theta _N^{-}*\varphi \rangle $ for any $\varphi \in L^2(\mathbb {R}^d)\cap C(\mathbb {R}^d)$ and for each $N \in \mathbb {N}$. Set $\varphi _N \doteq \theta _N^-*\varphi $ for each $N \in \mathbb {N}$. Now $\langle f_N, \varphi \rangle \rightarrow \langle f, \varphi \rangle $ for any $\varphi \in L^2(\mathbb {R}^d)$, by strong convergence in $L^2(\mathbb {R}^d)$ of the $f^N$, but also

$$\begin{aligned} \int _{\mathbb {R}^d}f_N(x)\varphi (x) dx= \int _{\mathbb {R}^d}\varphi _N^-(x) \mu _N(dx) \rightarrow \int _{\mathbb {R}^d}\varphi (x) \mu (dx) \end{aligned}$$

by weak convergence of the $\mu ^N$ and uniform convergence on compact sets of the $\phi _N$ to $\phi $ (Lemma A.4). Hence

$$\begin{aligned} \int _{\mathbb {R}^d}\varphi (x) \mu (dx)=\int _{\mathbb {R}^d}\varphi (x)f(x)dx \end{aligned}$$

for any $\varphi \in L^2(\mathbb {R}^d)\cap \text {C}(\mathbb {R}^d)$. The same reasoning holds for any $\varphi \in \text {C}_b(\mathbb {R}^d)$ hence we conclude. $\square $

Appendix B: Hamilton–Jacobi Equation, Kolmogorov Equation Equations and Mild Solutions

In Sect. 1 we study the decoupled Hamilton–Jacobi Bellman equation and Kolmogorov equation equations defining the PDE system in Eq. (4.1) via the mild formulation; see Theorem B.1, Theorem B.2. This enables us to prove the equivalence between the mild and weak formulations; see proof of Lemma 4.2 in Sect. 1. In Sect. 1 we prove Theorem 4.4, i.e. the existence of a global solution of the PDE system (see Theorem 4.4 in Sect. 4). On the other hand, in Sect. 1 we prove Theorem 4.5, i.e. the local uniqueness of a solution of the PDE system (see Theorem 4.5 in Sect. 4). Finally, in Sect. 1 we give the proof of Theorem 4.8.

1.1 B.1: The Hamilton–Jacobi and the Kolmogorov Equation Equation In Mild Form

Throughout this section, we assume that $p_0, b, f, g$ satisfy the hypotheses (H1)–(H2) and (H4) in Sect. 2.

Theorem B.1

Given $p_{0}\in \text {C}_{b}\left( \mathbb {R} ^{d}\right) $, given $\alpha \in \text {C}_{b}\left( \left[ 0,T\right] \times \mathbb {R}^{d};\mathbb {R}^{d}\right) $, there exists at most one solution of equation

$$\begin{aligned} p\left( t\right) =\mathcal {P}_{t}p_{0}-\int _{0}^{t}\nabla \mathcal {P} _{t-s}\left( p(s)\left( \alpha (s)-b(\,\cdot \,,p(s))\right) \right) \,ds. \end{aligned}$$

(B.1)

in the class $\text {C}_{b}\left( \left[ 0,T\right] \times \mathbb {R}^{d}\right) $.

Proof

Assume by contradiction that $p^{(i)}(t)$, $i=1, 2$, are two solutions of Eq. (B.1) of class $\text {C}_b([0,T]\times \mathbb {R}^{d})$ and set q(t) as their difference. By a generalized form of Gronwall’s lemma one has that $\left\| q\left( t\right) \right\| _{\infty }=0$ for every $t\in \left[ 0,T\right] $, from which the conclusion readily follows. The precise estimates can be found in the proof of Theorem 4.5 in Sect. (1). For the sake of space, we refer the reader to that proof; in particular one has to use the estimate for the map $\Gamma _1$, first component of the map $\Gamma $ defined in (B.7). $\square $

Theorem B.2

Given $p \in \text {C}_{b}([0,T] \times \mathbb {R}^{d})$, If $\alpha \in \text {C}_{b}([0,T]\times \mathbb {R}^{d};\mathbb {R}^{d})$, then there exists at most one solution u, in the class $\text {C}_{b}\left( \left[ 0,T\right] \times \mathbb {R} ^{d}\right) $ and such that its partial derivatives are also of class $\text {C}_{b}\left( \left[ 0,T\right] \times \mathbb {R}^{d}\right) $, of the following equation

$$\begin{aligned} u\left( t\right) =\mathcal {P}_{T-t}g-\int _{t}^{T}\mathcal {P}_{s-t}\left( b\left( \,\cdot \,,p\left( s\right) \right) \cdot \alpha \left( s\right) -\frac{1}{2}\left| \nabla u\left( s\right) \right| ^{2} +f(\,\cdot \,,p(s))\right) ds \end{aligned}$$

(B.2)

Proof

Assume by contradiction that $u^{(i)}(t)$, $i = 1, 2$, are two solutions of Eq. (B.2) of class $\text {C}_{b}\left( \left[ 0,T\right] \times \mathbb {R} ^{d}\right) $ and such that their partial derivatives are of class $\text {C}_{b}\left( \left[ 0,T\right] \times \mathbb {R}^{d}\right) $. Set $\theta ^{(i)} = \nabla u^{(i)}$, $i = 1, 2$ and q(t) their difference. Using the estimates for the map $\Gamma _2$, second component of the map $\Gamma $ defined in (B.7), one has that $\Vert q(t) \Vert _{\infty } = 0$ for every $t \in [0, T]$, from which $\theta ^{(1)}(t) = \theta ^{(2)}(t)$ for every $t \in [0, T]$. Therefore, $u^{(1)}(t) = u^{(2)}(t)$ because of Eq. (B.2). $\square $

1.2 B.2: Proof of Lemma 4.2

Proof

Let (u, p) be a weak solution of the PDE system in Eqs. (4.3)–(4.3), and consider Eq. (4.3). In particular, for a given $t \in \left[ 0, T \right] $,

$$\begin{aligned} \begin{aligned} \left\langle u\left( t\right) ,\varphi \left( t\right) \right\rangle&-\left\langle g,\varphi \left( T\right) \right\rangle +\int _{t}^{T}\left\langle u\left( s\right) , \mathcal {A}\varphi (s) \right\rangle ds \\&= \int _{t}^{T}\left\langle b(\,\cdot \,,p(s))\cdot \nabla u\left( s\right) -\frac{1}{ 2}\left| \nabla u\left( s\right) \right| ^{2}+f(\,\cdot \,,p(s)),\varphi \left( s\right) \right\rangle ds. \\ \end{aligned} \end{aligned}$$

(B.3)

Using on $\left[ t,T\right] $ the following test function

$$\begin{aligned} \varphi ^{\left( t\right) }\left( s,x\right) = \left( \mathcal {P}_{s-t}\phi \right) \left( x\right) ,\qquad s\in \left[ t,T\right] , \end{aligned}$$

with $\phi \in \text {C}^{1}([0,T] \times \text {C}^2_{b}(\mathbb {R}^{d}) \cap W^{2,2}(\mathbb {R}^{d}))$, we get

$$\begin{aligned} \begin{aligned}&\left\langle u\left( t\right) ,\phi \right\rangle -\left\langle g,\mathcal { P}_{T-t}\phi \right\rangle +\int _{t}^{T}\left\langle u\left( s\right) ,\mathcal {A}\mathcal {P}_{s-t}\phi \right\rangle ds \\&=\int _{t}^{T}\left\langle b(\,\cdot \,,p(s))\cdot \nabla u\left( s\right) -\frac{1}{ 2}\left| \nabla u\left( s\right) \right| ^{2}+f(\,\cdot \,,p(s)),\mathcal {P} _{s-t}\phi \right\rangle ds.\\ \end{aligned} \end{aligned}$$

Notice that $\mathcal {A} \mathcal {P} _{s-t}\phi =0$ and that $\left\langle a,\mathcal {P}_{t}b\right\rangle =\left\langle \mathcal {P}_{t}a,b\right\rangle $ for every pair of functions $ a,b\in \text {C}_{b}\left( \mathbb {R}^{d}\right) $. Then

$$\begin{aligned} \begin{aligned}&\left\langle u\left( t\right) ,\phi \right\rangle -\left\langle \mathcal {P} _{T-t}g,\phi \right\rangle \\&\int _{t}^{T}\left\langle \mathcal {P}_{s-t}\left( b(\,\cdot \,,p(s))\cdot \nabla u\left( s\right) -\frac{1}{2}\left| \nabla u\left( s\right) \right| ^{2}+f(\,\cdot \,,p(s))\right) ,\phi \right\rangle ds.\\ \end{aligned} \end{aligned}$$

Because $\phi $ can be chosen in an arbitrary way, we deduce the mild formulation of Eq. (4.6). The equation for p is similar, as well as the other direction. $\square $

1.3 B.3: Proof of Theorem 4.4

Throughout this section, we assume that $p_0, b, f,$ and g satisfy the hypotheses (H1)–(H2) and (H4) in Sect. 2 and (H5) in Sect. 4. In addition, we shall repeatedly use the following well-known inequality:

$$\begin{aligned} \left\| \nabla \mathcal {P}_{t}f\right\| _{\infty }\le C_{d} t^{-1/2}\left\| f\right\| _{\infty } \end{aligned}$$

(B.4)

for all $f\in L^{\infty }\left( \mathbb {R}^{d}\right) $, with $C_{d}=d^{1/2}$, which follows for instance from the formula $\nabla \mathcal {P}_{t}f\left( x\right) =t^{-1}\mathbb {E}\left[ W_{t}f\left( x+W_{t}\right) \right] $ (elementary proved by differentiating the heat kernel):

$$\begin{aligned} \left| \nabla \mathcal {P}_{t}f\left( x\right) \right| \le t^{-1}\left\| f\right\| _{\infty }\mathbb {E}\left[ \left| W_{t}\right| \right] \le t^{-1}\left\| f\right\| _{\infty }\mathbb {E}\left[ \left| W_{t}\right| ^{2}\right] ^{1/2}\le C_{d}t^{-1/2}\left\| f\right\| _{\infty }. \end{aligned}$$

We use the Brouwer–Schauder fixed point theorem to prove Theorem 4.4. Brouwer–Schauder fixed point theorem says that if K is a non empty, closed, bounded and convex subset of a Banach space V and $\Phi \,:\,K \rightarrow K$ is a continuous map such that $\Phi \left( K\right) $ is relatively compact in V, then $\Phi $ has a fixed point in K.

We will apply this theorem to the space $V=\text {C}_{b}(\left[ 0,T\right] \times \mathbb {R}^{d})$. Instead, in order to define the map $\Phi $, let $p \in V$ be given and let $w = w_{p}$ be a weak solution of the first equation of the PDE system (4.9). Existence and uniqueness of such a solution is given by classical parabolic results; e.g., one proof can be done by contraction principle applied to the mild formulation in Eq. (B.5) below. In particular, $w_p$ satisfies the following properties:

$$\begin{aligned} w_{p}\left( t,x\right) \ge e^{-\left( \left\| g\right\| _{\infty }+T\left\| f\right\| _{\infty }\right) }\quad \text {and}\quad \left\| w_{p}\right\| _{\infty }+\left\| \nabla w_{p}\right\| _{\infty }\le C_{1}\left( b,f,g,T\right) \end{aligned}$$

independently of $p\in V$, with $C_{1}\left( b,f,g,T\right) >0$ depending only on $\Vert b \Vert _{\infty }, \Vert f \Vert _{\infty }$ and $\Vert g \Vert _{\infty }$. One way to prove this fact is by using the following identity

$$\begin{aligned} w_{p}\left( t\right) =\mathcal {P}_{T-t}\exp \left( -g\right) -\int _{t}^{T} \mathcal {P}_{s-t}\left( b\left( \cdot ,p\left( s\right) \right) \cdot \nabla w_{p}\left( s\right) -w_{p}\left( s\right) f\left( \cdot ,p\left( s\right) \right) \right) ds \end{aligned}$$

(B.5)

and estimate B.4 of the heat semi-group’s gradient. At this point, we call $\Phi \left( p\right) $ the solution of the following equation

$$\begin{aligned} \Phi \left( p\right) \left( t\right) =\mathcal {P}_{t}p_{0}+\int _{0}^{t} \nabla \mathcal {P}_{t-s}\left( \Phi \left( p\right) \left( s\right) \left( \frac{\nabla w_{p}\left( s\right) }{w_{p}\left( s\right) }+b\left( \cdot ,p\left( s\right) \right) \right) \right) ds. \end{aligned}$$

(B.6)

Notice that this is not the second equation of the PDE system (4.9) with $w=w_{p}$ because we keep the original p in $b\left( \cdot ,p\left( s\right) \right) $. Existence of a global solution $\Phi \left( p\right) \in V$ can be proved by iteration, using B.4 and $\left\| \frac{\nabla w_{p}}{w_{p}}\right\| _{\infty }\le C_{w}\left( g,f,b,T\right) $. In addition, one gets

$$\begin{aligned} \left\| \Phi \left( p\right) \right\| _{\infty }\le C_{2}\left( b,f,g,p_{0},T\right) \end{aligned}$$

for a suitable constant $C_{2}\left( b,f,g,p_{0},T\right) >0$ depending, again, only on $\Vert b \Vert _{\infty }, \Vert f \Vert _{\infty }$ and $\Vert g \Vert _{\infty }$. Therefore, the set

$$\begin{aligned} K \doteq \left\{ p\in V:\left\| p\right\| _{\infty }\le C_{2}\left( b,f,g,p_{0},T\right) \right\} \end{aligned}$$

is bounded, closed, convex and invariant.

We prove now that the map $\Phi $ satisfies the assumptions in the Brouwer-Schauder fixed point theorem. It is not difficult to prove that the map $\Phi $ is continuous by using B.4 again. Instead, it is non straightforward to prove that $\Phi \left( K\right) $ is relatively compact, due to the unboundedness of the space domain. In order to do so, we use the following compactness result, which is an easy variant of the Ascoli-Arzelà theorem.

Theorem B.3

Let $\alpha (\cdot ), \beta (\cdot )$ and $C_{1}(\cdot ), C_{2}(\cdot )$ be four positive and non-decreasing functions and $\rho $ as in (H5); see Sect. 4. Let $C_{3}>0$ a constant. Then the set $\Xi _{\alpha ,C_{1},\beta ,C_{2},\rho ,C_{3}}$ of all functions $f\in \text {C}_{b}\left( \left[ 0,T\right] \times \mathbb {R}^{d}\right) $ such that

$$\begin{aligned} \sup _{t\in \left[ 0,T\right] }\left\| f\left( t\right) \right\| _{\alpha ,R}\le & {} C_{1}\left( R\right) \text { for every }R>0\quad \quad \quad \quad \quad \quad \,\,\,\,\mathrm{(H1.1)}\\ \sup _{\begin{array}{c} t,s\in \left[ 0,T\right] \\ t\ne s \end{array}}\frac{\left\| f\left( t\right) -f\left( s\right) \right\| _{\infty ,R}}{\left| t-s\right| ^{\beta }}\le & {} C_{2}\left( R\right) \text { for every }R>0 \quad \quad \quad \quad \quad \quad \,\,\,\,\mathrm{(H2.1)}\\ \left| f\left( t,x\right) \right|\le & {} C_{3}\rho \left( x\right) \text { for all }\left( t,x\right) \in \left[ 0,T\right] \times \mathbb {R}^{d}\quad \quad \mathrm{(H3.1)}\\ \end{aligned}$$

is relatively compact in $\text {C}_{b}\left( \left[ 0,T\right] \times \mathbb {R} ^{d}\right) $.

Before proceeding with the proof of Theorem B.3, we recall the following version of the Ascoli-Arzelà theorem.

Theorem B.4

Assume that that a family of functions $F\subset \text {C}\left( \left[ 0,T\right] ; \text {C}_{b}\left( B_{M}\right) \right) $ satisfies the following two properties:

(i):: $\left\{ f\left( t\right) ;f\in F,t\in \left[ 0,T\right] \right\} \subset K_{M}$ for some compact set $K_{M} \subset \text {C}_{b}\left( B_{M}\right) $
(ii):: F is uniformly equicontinuous in $\text {C} \left( \left[ 0,T\right] ; \text {C}_{b}\left( B_{M}\right) \right) $, namely for every $\epsilon >0$ there exists a $\delta >0$ such that $\left\| f\left( t\right) -f\left( s\right) \right\| _{\text {C}_{b}\left( B_{M}\right) }\le \epsilon $ for every $f\in F$ and $t, s\in \left[ 0,T\right] $ such that $\left| t-s\right| \le \delta $.

Then F is relatively compact in $\text {C}\left( \left[ 0,T\right] ; \text {C}_{b}\left( B_{M}\right) \right) $.

Proof of Theorem B.3

Notice that given any closed ball $B_M \doteq \overline{B}_{M}(0) \subset \mathbb {R}^{d}$ of radius M around the origin, the space $\text {C}_{b}\left( \left[ 0,T\right] \times B_{M}\right) $ and the space $\text {C}\left( \left[ 0,T\right] ;\text {C}_{b}\left( B_{M}\right) \right) $ are equivalent. This is not longer true for $\text {C}_{b}\left( \left[ 0,T\right] \times \mathbb {R}^{d}\right) $ and $\text {C}\left( \left[ 0,T\right] ; \text {C}_{b}\left( \mathbb {R}^{d}\right) \right) $. Indeed, it holds that $\text {C}\left( \left[ 0,T\right] ; \text {C}_{b}\left( \mathbb {R}^{d}\right) \right) \subset \text {C}_{b}\left( \left[ 0,T\right] \times \mathbb {R}^{d}\right) $. On any $B_{M}$ we use Theorem B.4. Now, consider a sequence $\left( p_{n}\right) _{n\in \mathbb {N}}\subset \Xi _{\alpha ,C_{1},\beta ,C_{2},\rho ,C_{3}}$. For every $B_{M}$, denote by $ p_{n}^{M}$ the restriction of $p_{n}$ to $\left[ 0,T\right] \times B_{M}$. They belong to $\text {C}_{b}\left( \left[ 0,T\right] \times B_{M}\right) $ which is equivalent to $\text {C}\left( \left[ 0,T\right] ; \text {C}_{b}\left( B_{M}\right) \right) $. The space $\text {C}_{b}^{\alpha }\left( B_{M}\right) $ has compact embedding into $\text {C}_{b}\left( B_{M}\right) $ by Ascoli-Arzelà theorem. By (H1.1) in Theorem B.3, the set $\left\{ p_{n}^{M}\left( t\right) ,n\in \mathbb {N},t\in \left[ 0,T\right] \right\} $ is bounded in $\text {C}_{b}^{\alpha }\left( B_{M}\right) $, hence assumption (i) of Theorem B.4 is satisfied. On the other hand, by (H2.1) in Theorem B.3 the sequence $\left( p_{n}^{M}\right) _{n\in \mathbb {N}}$ is uniformly equicontinuous in $\text {C}\left( \left[ 0,T\right] ; \text {C}_{b}\left( B_{M}\right) \right) $. Hence, by Theorem B.4 we may extract a subsequence which converges in $\text {C}\left( \left[ 0,T\right] ; \text {C}_{b}\left( B_{M}\right) \right) $. By a diagonal argument, we can find a function $p\in \text {C}\left( \left[ 0,T\right] \times \mathbb {R}^{d}\right) $ and a subsequence $\left( p_{n_{k}}\right) $ such that $\left\| (p_{n}^{M}-p)_{|_{\left[ 0,T\right] \times B_{M}}} \right\| _{\infty }\rightarrow 0$ as $k\rightarrow \infty $, for every M. Given $\epsilon >0$, let $M_{\epsilon }$ be such that

$$\begin{aligned} \left\| \rho _{|_{B_{M}^{c}}}\right\| _{\text {C}_{b}\left( B_{M}^{c}\right) }\le \frac{\epsilon }{4C_{3}}. \end{aligned}$$

Since $\left| p_{n_{k}}\left( t,x\right) \right| \le C_{3}\rho \left( x\right) $, we also have

$$\begin{aligned} \left\| p_{n_{k}}|_{\left[ 0,T\right] \times B_{M}^{c}}\right\| _{\text {C}_{b}\left( \left[ 0,T\right] \times B_{M}^{c}\right) }\le \frac{\epsilon }{4}. \end{aligned}$$

In addition, since $p_{n_{k}}\rightarrow p$ point-wise, we also have $\left| p\left( t,x\right) \right| \le C_{3}\rho \left( x\right) $ and thus

$$\begin{aligned} \left\| p_{|\left[ 0,T\right] \times B_{M}^{c}}\right\| _{\text {C}_{b}\left( \left[ 0,T\right] \times B_{M}^{c}\right) }\le \frac{\epsilon }{4}. \end{aligned}$$

Then $p\in \text {C}_{b}\left( \left[ 0,T\right] \times \mathbb {R}^{d}\right) $ and $\left\| p_{n_{k}}-p\right\| _{\infty } \le \epsilon $. Now, if corresponding to $M_{\epsilon }$, we choose $k_{0}$ such that for all $ k\ge k_{0}$ we have

$$\begin{aligned} \left\| (p_{n_{k}}^{N}-p)_{|_{\left[ 0,T\right] \times B_{N}}} \right\| _{\text {C}_{b}\left( \left[ 0,T\right] \times B_{N}\right) }\le \frac{\epsilon }{2}. \end{aligned}$$

Whence, we have proved uniform convergence on the full space $\mathbb {R}^{d}$. $\square $

The following proposition allows us to conclude the proof of Theorem 4.4.

Proposition B.5

There exist four positive and non decreasing functions $\alpha (\,\cdot \,) ,C_{1}(\,\cdot \,),$ $\beta (\,\cdot \,), C_{2}(\,\cdot \,)$, $\rho $ as in (H5) of Sect. 4 and a constant $C_{3} > 0$ such that $\Phi \left( K\right) \subset \Xi _{\alpha ,C_{1},\beta ,C_{2},\rho ,C_{3}}$.

Proof

Without loss of generality, we may assume $\alpha <\frac{1}{2}$. To shorten notations, set

$$\begin{aligned} h\left( s\right) \doteq \frac{\nabla w_{p}\left( s\right) }{w_{p}\left( s\right) }+b\left( \cdot ,p\left( s\right) \right) . \end{aligned}$$

Notice that the following inequalities hold

$$\begin{aligned} \begin{aligned} \left\| h\left( s\right) \right\| _{\infty }&\le C_{w}\left( g,f,b,T\right) +\left\| b\right\| _{\infty } \doteq C_{h}\left( g,f,b,T\right) \\ \left\| \Phi \left( p\right) \left( s\right) h\left( s\right) \right\| _{\infty }&\le C_{2}\left( b,f,g,p_{0},T\right) C_{h}\left( g,f,b,T\right) \doteq C_{3}\left( b,f,g,p_{0},T\right) \end{aligned} \end{aligned}$$

From Eq. (B.6) we have

$$\begin{aligned} \left\| \Phi \left( p\right) \left( t\right) \right\| _{\alpha }\le & {} C_{T,\alpha }\left\| p_{0}\right\| _{\alpha }+\int _{0}^{t}\frac{ C_{T,\alpha }}{\left( t-s\right) ^{\frac{1}{2}+\alpha }}\left\| \Phi \left( p\right) \left( s\right) h\left( s\right) \right\| _{\infty }ds \\\le & {} C_{T,\alpha }\left\| p_{0}\right\| _{\alpha }+C_{T,\alpha }T^{ \frac{1}{2}-\alpha }C_{3}\left( b,f,g,p_{0},T\right) , \end{aligned}$$

were we have used a gradient estimate in Hölder norm similar to those of Lemma C.3 below, but easier. Therefore, (H1.1) in Theorem B.3 is satisfied, even uniformly with respect to R. Let us see (H2.1) in Theorem B.3 with $t>t^{\prime }$:

$$\begin{aligned}&\left\| \Phi \left( p\right) \left( t\right) -\Phi \left( p\right) \left( t^{\prime }\right) \right\| _{\infty ,R}\le I_{1}+I_{2}+I_{3} \\&I_{1}:=\left\| \mathcal {P}_{t}p_{0}-\mathcal {P}_{t^{\prime }}p_{0}\right\| _{\infty ,R} \\&I_{2}:=\int _{t^{\prime }}^{t}\left\| \nabla \mathcal {P}_{t-s}\left( \Phi \left( p\right) \left( s\right) h\left( s\right) \right) \right\| _{\infty ,R}ds \\&I_{3}:=\int _{0}^{t^{\prime }}\left\| \left( \nabla \mathcal {P} _{t-s}-\nabla \mathcal {P}_{t^{\prime }-s}\right) \left( \Phi \left( p\right) \left( s\right) h\left( s\right) \right) \right\| _{\infty ,R}ds. \end{aligned}$$

We use the following property: for small t,

$$\begin{aligned} k\in \text {C}_{b}^{\alpha }\left( B_{N}\right) \Rightarrow \left\| \mathcal {P} _{t}k-k\right\| _{\infty }\le C_{\alpha }t^{\alpha }\left\| k\right\| _{\alpha }. \end{aligned}$$

Hence

$$\begin{aligned} I_{1}= & {} \left\| \mathcal {P}_{t-t^{\prime }}\mathcal {P}_{t^{\prime }}p_{0}-\mathcal {P}_{t^{\prime }}p_{0}\right\| _{\infty } \\\le & {} C_{\alpha }\left( t-t^{\prime }\right) ^{\alpha }\left\| \mathcal {P }_{t^{\prime }}h\right\| _{\alpha }\le C\left( t-t^{\prime }\right) ^{\alpha }\left\| h\right\| _{\alpha } \\ I_{2}\le & {} \int _{t^{\prime }}^{t}\frac{C_{T}}{\left( t-s\right) ^{\frac{1}{2}} }\left\| \Phi \left( p\right) \left( s\right) h\left( s\right) \right\| _{\infty }ds\le 2C_{T}\sqrt{t-t^{\prime }}C_{3}\left( g,f,b,p_{0},T\right) \\ I_{3}= & {} \int _{0}^{t^{\prime }}\left\| \left( \mathcal {P}_{t-t^{\prime }}-Id\right) \nabla \mathcal {P}_{t^{\prime }-s}\left( \Phi \left( p\right) \left( s\right) h\left( s\right) \right) \right\| _{\infty }ds \\\le & {} \int _{0}^{t^{\prime }}C_{\alpha }\left( t-t^{\prime }\right) ^{\alpha }\left\| \nabla \mathcal {P}_{t^{\prime }-s}\left( \Phi \left( p\right) \left( s\right) h\left( s\right) \right) \right\| _{\alpha }ds \\\le & {} C_{\alpha }\left( t-t^{\prime }\right) ^{\alpha }\int _{0}^{t^{\prime }}\frac{C_{T,\alpha }C_{3}\left( g,f,b,p_{0},T\right) }{\left( t^{\prime }-s\right) ^{\frac{1}{2}+\alpha }}ds\le C\left( t-t^{\prime }\right) ^{\alpha } \end{aligned}$$

Therefore, also the second condition in the definition of $\Xi _{\alpha ,C_{1},\beta ,C_{2},\rho ,C_{3}} $ is satisfied, even uniformly with respect to R. The difficult property is

$$\begin{aligned} \left| \Phi \left( p\right) \left( t,x\right) \right| \le C_{3}\rho \left( x\right) \end{aligned}$$

for every $x\in \mathbb {R}^{d}$, $t\in \left[ 0,T\right] ,p\in K$, for a suitable constant $C_{3}>0$. The idea is to write an equation for $\pi _{p}\left( t,x\right) :=\rho ^{-1}\left( x\right) \Phi \left( p\right) \left( t,x\right) $ and deduce that $\left\| \pi _{p}\left( t\right) \right\| _{\infty }\le C_{3}$ for every $t\in \left[ 0,T\right] ,p\in K$. We use the weak formulation

$$\begin{aligned} \left\langle \Phi \left( p\right) \left( t\right) ,\varphi \right\rangle =\left\langle p_{0}, \varphi \right\rangle +\frac{1}{2}\int _{0}^{t}\left\langle \Phi \left( p\right) \left( s\right) ,\Delta \varphi \right\rangle ds +\int _{0}^{t}\left\langle \Phi \left( p\right) \left( s\right) h\left( s\right) ,\nabla \varphi \right\rangle ds \end{aligned}$$

with a test function $\varphi $ of the form $\rho ^{-1}\psi $ with $\psi \in C_{c}^{\infty }\left( \mathbb {R}^{d}\right) $. Then

$$\begin{aligned} \left\langle \pi _{p}\left( t\right) ,\psi \right\rangle= & {} \left\langle \rho ^{-1}p_{0},\psi \right\rangle +\frac{1}{2}\int _{0}^{t}\left\langle \pi _{p}\left( s\right) ,\Delta \psi \right\rangle ds \\&+\frac{1}{2}\int _{0}^{t}\left\langle \Phi \left( p\right) \left( s\right) ,\psi \Delta \rho ^{-1}+2\nabla \psi \cdot \nabla \rho ^{-1}\right\rangle ds \\&+\int _{0}^{t}\left\langle \Phi \left( p\right) \left( s\right) h\left( s\right) ,\nabla \left( \rho ^{-1}\varphi \right) \right\rangle ds \end{aligned}$$

namely, formally speaking,

$$\begin{aligned} \pi _{p}\left( t\right)= & {} \rho ^{-1}p_{0}+\frac{1}{2}\int _{0}^{t}\Delta \left( \rho ^{-1}\Phi \left( p\right) \left( s\right) \right) ds \\&+\frac{1}{2}\int _{0}^{t}\left( \Delta \rho ^{-1}\right) \Phi \left( p\right) \left( s\right) ds-\int _{0}^{t}\text {div}\left( \Phi \left( p\right) \left( s\right) \nabla \rho ^{-1}\right) ds \\&-\int _{0}^{t}\rho ^{-1}\text {div}\left( \Phi \left( p\right) \left( s\right) h\left( s\right) \right) ds. \end{aligned}$$

Using

$$\begin{aligned} \rho ^{-1}\text {div}\left( \Phi \left( p\right) \left( s\right) h\left( s\right) \right) =\text {div}\left( \pi _{p}\left( s\right) h\left( s\right) \right) -\Phi \left( p\right) \left( s\right) h\left( s\right) \cdot \nabla \rho ^{-1} \end{aligned}$$

this leads to

$$\begin{aligned} \pi _{p}\left( t\right)= & {} \mathcal {P}_{t}\left( \rho ^{-1}p_{0}\right) + \frac{1}{2}\int _{0}^{t}\mathcal {P}_{t-s}\left( \left( \Delta \rho ^{-1}\right) \Phi \left( p\right) \left( s\right) \right) ds \\&+\int _{0}^{t}\nabla \mathcal {P}_{t-s}\left( \Phi \left( p\right) \left( s\right) \nabla \rho ^{-1}\right) ds \\&+\int _{0}^{t}\nabla \mathcal {P}_{t-s}\left( \pi _{p}\left( s\right) h\left( s\right) \right) ds \\&+\int _{0}^{t}\mathcal {P}_{t-s}\left( \nabla \rho ^{-1}\cdot \Phi \left( p\right) \left( s\right) h\left( s\right) \right) ds. \end{aligned}$$

Therefore

$$\begin{aligned} \left\| \pi _{p}\left( t\right) \right\| _{\infty }\le & {} \left\| \rho ^{-1}p_{0}\right\| _{\infty }+\frac{1}{2}\int _{0}^{t}\left\| \left( \Delta \rho ^{-1}\right) \Phi \left( p\right) \left( s\right) \right\| _{\infty }ds \\&+\int _{0}^{t}\frac{C_{T}}{\sqrt{t-s}}\left\| \Phi \left( p\right) \left( s\right) \nabla \rho ^{-1}\right\| _{\infty }ds \\&+\int _{0}^{t}\frac{C_{T}}{\sqrt{t-s}}\left\| \pi _{p}\left( s\right) h\left( s\right) \right\| _{\infty }ds \\&+\int _{0}^{t}\left\| \nabla \rho ^{-1}\cdot \Phi \left( p\right) \left( s\right) h\left( s\right) \right\| _{\infty }ds \\\le & {} \left\| \rho ^{-1}p_{0}\right\| _{\infty }+\frac{\left\| \Delta \rho ^{-1}\right\| _{\infty }\left\| \Phi \left( p\right) \right\| _{\infty }}{2}T \\&+2C_{T}T^{1/2}\left\| \nabla \rho ^{-1}\right\| _{\infty }\left\| \Phi \left( p\right) \right\| _{\infty }\\&+C_{h}\left( g,f,b,T\right) \int _{0}^{t}\frac{C_{T}}{\sqrt{t-s}}\left\| \pi _{p}\left( s\right) \right\| _{\infty }ds \\&+T\left\| \nabla \rho ^{-1}\right\| _{\infty }C_{3}\left( g,f,b,p_{0},T\right) . \end{aligned}$$

Recall that $\left\| \Phi \left( p\right) \right\| _{\infty }\le C_{2}\left( g,f,b,p_{0},T\right) $ independently of $p\in K$. Moreover recall that $\left\| \Delta \rho ^{-1}\right\| _{\infty }+\left\| \nabla \rho ^{-1}\right\| _{\infty }<\infty $. From a generalized form of Gronwall lemma we deduce a uniform bound for $\left\| \pi _{p}\left( t\right) \right\| _{\infty }$. $\square $

At this point, we can apply Brouwer–Schauder fixed point theorem and have existence of a weak solution $\left( w,p\right) $. The proof that $\left( u, p\right) :=\left( -\log w,p\right) $ satisfies the original system can then be done by means of mollifiers.

1.4 B.4: Proof of Theorem 4.5

Throughout this section, we assume that $p_0, b, f, g$ satisfy the hypotheses (H1)–(H2) and (H4) in Sect. 2.

Proof

We are going to apply the contraction principle to the system in Eqs. (4.6)–(4.7). Setting $\theta \doteq \nabla u$, for T small enough, it reads as

$$\begin{aligned} \begin{aligned} p\left( t\right)&=\mathcal {P}_{t}p_{0}-\int _{0}^{t}\nabla \mathcal {P} _{t-s}\left( p\left( s\right) \left( \theta \left( s\right) -b\left( \,\cdot \,,p(s)\right) \right) \right) \,ds \\ \theta \left( t\right)&=\nabla \mathcal {P}_{T-t}g-\int _{t}^{T}\nabla \mathcal {P}_{r-t}\left( b\left( \,\cdot \,,p(s)\right) \cdot \theta \left( r\right) -\frac{1}{2}\left| \theta \left( r\right) \right| ^{2}+f\left( \,\cdot \,, p(r) \right) \right) \,dr.\\ \end{aligned} \end{aligned}$$

Now, consider the following Banach space:

$$\begin{aligned} {X_{T}=\text {C}_b( \left[ 0,T\right] \times \mathbb {R}^{d}) \times \text {C}_b( \left[ 0,T\right] \times \mathbb {R}^{d})}, \end{aligned}$$

and by $\left\| \,\cdot \,\right\| _{T,\infty }$ the norm in each space ${\text {C}_b( \left[ 0,T\right] \times \mathbb {R}^{d})}$. On the product space $X_{T}$ consider the norm

$$\begin{aligned} \left\| \left( a,b\right) \right\| _{T,\infty } \doteq \left\| a \right\| _{T,\infty }+\left\| b \right\| _{T,\infty }. \end{aligned}$$

Define the map $\Gamma : X_{T}\rightarrow X_{T}$ as

$$\begin{aligned} \Gamma \left( p,\theta \right) = \left( \Gamma _{1}\left( p,\theta \right) ,\Gamma _{2}\left( p,\theta \right) \right) \end{aligned}$$

(B.7)

whose marginals are given by

$$\begin{aligned} \begin{aligned} \Gamma _{1} \left( p,\theta \right) \left( t\right)&\doteq \mathcal {P} _{t}p_{0}-\int _{0}^{t}\nabla \mathcal {P}_{t-s}\left( p\left( s\right) \left( \theta \left( s\right) - b\left( \,\cdot \,,p(s)\right) \right) \right) \\ \Gamma _{2}\left( p,\theta \right) \left( t\right)&\doteq \nabla \mathcal {P} _{T-t}g-\int _{t}^{T}\nabla \mathcal {P}_{r-t}\left( b\left( \,\cdot \,,p(r)\right) \cdot \theta \left( r\right) \right. \\&\qquad \left. -\frac{1}{2}\left| \theta \left( r\right) \right| ^{2}+f\left( \,\cdot \, p(r) \right) \right) dr. \end{aligned} \end{aligned}$$

Notice that the fact that $\Gamma \left( p,\theta \right) \in X_{T}$ when $\left( p,\theta \right) \in X_{T}$ is implicit in the following computations and thus it will not be explained a priori. It is based on the following estimates of the heat semi-group’s gradient (cfr. also the proof of Theorem 4.4 and the reference therein): $\left\| \nabla \mathcal {P}_{t}F\right\| _{\infty }\le C_{0}t^{-1/2}\left\| F\right\| _{\infty }$ for some constant $C_0$ and every $F\in \text {C}_{b}\left( \mathbb {R}^{d}\right) $ and $\left\| \nabla \mathcal {P}_{t}F\right\| _{\infty }\le C_{0}\left\| \nabla F\right\| _{\infty }$ for every $F\in \text {C}_{b}\left( \mathbb {R}^{d}\right) $ such that $\nabla F\in \text {C}_{b}\left( \mathbb {R}^{d}\right) $.

Now, let us investigate when $\Gamma $ is a contraction. We have

$$\begin{aligned} \begin{aligned}&\left\| \Gamma _{1}\left( p,\theta \right) \left( t\right) -\Gamma _{1}\left( p^{\prime },\theta ^{\prime }\right) \left( t\right) \right\| _{\infty } \\&\quad \le \int _{0}^{t}\frac{C_{0}}{\left( t-s\right) ^{1/2}}\left( \left\| p\left( s\right) \right\| _{\infty }\left( \left\| \theta \left( s\right) -\theta ^{\prime }\left( s\right) \right\| _{\infty }\right. \right. \\&\qquad \left. \left. \qquad +\left\| b\left( \,\cdot \,, p\left( s\right) \right) -b\left( \,\cdot \,, p^{\prime }\left( s\right) \right) \right\| _{\infty }\right) \right) ds \\&\qquad +\int _{0}^{t}\frac{C_{0}}{\left( t-s\right) ^{1/2}}\left\| p\left( s\right) -p^{\prime }\left( s\right) \right\| _{\infty }\left( \left\| \theta ^{\prime }\left( s\right) \right\| _{\infty }+ \left\| b\left( \,\cdot \,, p^{\prime }\left( s\right) \right) \right\| _{\infty }\right) ds\\&\quad \le \int _{0}^{t}\frac{C_{0}}{\left( t-s\right) ^{1/2}}\left( \left\| p\left( s\right) \right\| _{\infty }\left( \left\| \theta \left( s\right) -\theta ^{\prime }\left( s\right) \right\| _{\infty }+L_{b}\left\| p\left( s\right) -p^{\prime }\left( s\right) \right\| _{\infty }\right) \right) ds \\&\qquad +\int _{0}^{t}\frac{C_{0}}{\left( t-s\right) ^{1/2}}\left\| p\left( s\right) -p^{\prime }\left( s\right) \right\| _{\infty }\left( \left\| \theta ^{\prime }\left( s\right) \right\| _{\infty }+\text {C}_{b}\left( 1+\left\| p^{\prime }\left( s\right) \right\| _{\infty }\right) \right) ds\\&\quad \le \int _{0}^{t}\frac{C_{0}}{\left( t-s\right) ^{1/2}}ds\cdot \left( \left\| p\right\| _{T,\infty }\left( \left\| \theta -\theta ^{\prime }\right\| _{T,\infty }+L\left\| p-p^{\prime }\right\| _{T,\infty }\right) \right) \\&\qquad +\int _{0}^{t}\frac{C_{0}}{\left( t-s\right) ^{1/2}}ds\cdot \left\| p-p^{\prime }\right\| _{T,\infty }\left( \left\| \theta ^{\prime }\right\| _{T,\infty }+C\left( 1+\left\| p^{\prime }\right\| _{T,\infty }\right) \right) \\&\quad \le 2C_{0}\sqrt{T}\left[ \left\| p\right\| _{T,\infty }\left\| \theta -\theta ^{\prime }\right\| _{T,\infty }\right. \\&\qquad \left. +\left( C+\left\| \theta ^{\prime }\right\| _{T,\infty }+C\left\| p^{\prime }\right\| _{T,\infty }+L\left\| p\right\| _{T,\infty }\right) \left\| p-p^{\prime }\right\| _{T,\infty }\right] \end{aligned} \end{aligned}$$

and

$$\begin{aligned} \begin{aligned}&\left\| \Gamma _{2}\left( p,\theta \right) \left( t\right) -\Gamma _{2}\left( p^{\prime },\theta ^{\prime }\right) \left( t\right) \right\| _{\infty }\\&\quad \le 2C_{0}\sqrt{T}\left[ \left( C+C\left\| p\right\| _{T,\infty }+\frac{ \left\| \theta \right\| _{T,\infty }+\left\| \theta ^{\prime }\right\| _{T,\infty }}{2}\right) \left\| \theta -\theta ^{\prime }\right\| _{T,\infty }\right. \\&\left. \qquad +\left( L+L\left\| \theta ^{\prime }\right\| _{T,\infty }\right) \left\| p-p^{\prime }\right\| _{T,\infty }\right] , \end{aligned} \end{aligned}$$

respectively.

Summarizing, there exists a constant $\widetilde{C}>0$, depending only on $ C_{0}$, C, L, such that

$$\begin{aligned} \left\| \Gamma \left( p,\theta \right) -\Gamma \left( p^{\prime },\theta ^{\prime }\right) \right\| _{T,\infty }\le & {} \widetilde{C} \sqrt{T}\left\| \left( p,\theta \right) -\left( p^{\prime },\theta ^{\prime }\right) \right\| _{T,\infty }\\&\left( \left\| \left( p,\theta \right) \right\| _{T,\infty }+\left\| \left( p^{\prime },\theta ^{\prime }\right) \right\| _{T,\infty }\right) . \end{aligned}$$

Therefore, to have a contraction we need a bound on $\left\| \left( p,\theta \right) \right\| _{T,\infty }+\left\| \left( p^{\prime },\theta ^{\prime }\right) \right\| _{T,\infty }$. Proceeding as above we have

$$\begin{aligned}&\left\| \Gamma _{1}\left( p,\theta \right) \left( t\right) \right\| _{\infty } {\le } \left\| p_{0}\right\| _{\infty }+\int _{0}^{t}\frac{C_{0}}{ \left( t-s\right) ^{1/2}}\\&\quad \left( \left\| p\left( s\right) \right\| _{\infty }\left( \left\| \theta \left( s\right) \right\| _{\infty }+\left\| b(\,\cdot \,,p(s)) \right\| _{\infty }\right) \right) ds\\&\left\| \Gamma _{2} \left( p,\theta \right) \left( t\right) \right\| _{\infty } \le C_{0}\left\| \nabla g\right\| _{\infty }\\&\quad +\int _{t}^{T} \frac{C_{0}}{\left( r-t\right) ^{1/2}}\left( \left\| b(\,\cdot \,,p(r)) \right\| _{\infty }\left\| \theta \left( r\right) \right\| _{\infty }+\frac{1}{2}\left\| \theta \left( r\right) \right\| _{\infty }^{2}+\left\| f(\,\cdot \,,p(r)) \right\| _{\infty }\right) dr \end{aligned}$$

and

$$\begin{aligned} \begin{aligned} \left\| \Gamma _{1}\left( p,\theta \right) \left( t\right) \right\| _{\infty }&=\left\| p_{0}\right\| _{\infty }+2C_{0}\sqrt{T}\left( \left\| p\right\| _{T,\infty }\left( \left\| \theta \right\| _{T,\infty }+\left\| b\left( p\right) \right\| _{T,\infty }\right) \right) \\ \left\| \Gamma _{2}\left( p,\theta \right) \left( t\right) \right\| _{\infty }&\le C_{0}\left\| \nabla g\right\| _{\infty }+2C_{0}\sqrt{T }\\&\quad \left( \left\| b\left( p\right) \right\| _{T,\infty }\left\| \theta \right\| _{T,\infty }+\frac{1}{2}\left\| \theta \right\| _{T,\infty }^{2}+\left\| f\left( p\right) \right\| _{T,\infty }\right) \end{aligned} \end{aligned}$$

Using the bound on b and f, we get

$$\begin{aligned} \begin{aligned}&\left\| \Gamma _{1}\left( p,\theta \right) \left( t\right) \right\| _{T,\infty } {\le }\left\| p_{0}\right\| _{\infty }+2C_{0}\sqrt{T}\cdot \left( \left\| p\right\| _{T,\infty }\left( \left\| \theta \right\| _{T,\infty }+C\left( 1+\left\| p\right\| _{T,\infty }\right) \right) \right) \\&\left\| \Gamma _{2}\left( p,\theta \right) \left( t\right) \right\| _{T,\infty } \le C_{0}\left\| \nabla g\right\| _{\infty }\\&\qquad +2C_{0} \sqrt{T}\cdot \left( C\left( 1+\left\| p\right\| _{T,\infty }\right) \left\| \theta \right\| _{T,\infty }+\frac{1}{2}\left\| \theta \right\| _{T,\infty }^{2}+C\left( 1+\left\| p\right\| _{T,\infty }\right) \right) \end{aligned} \end{aligned}$$

Therefore, we have proved:

$$\begin{aligned} \left\| \Gamma \left( p,\theta \right) \right\| _{T,\infty }\le & {} \left( C_{0}\left\| \nabla g\right\| _{\infty }+\left\| p_{0}\right\| _{\infty }\right) \\&+2C_{0}\sqrt{T}K\cdot \left( \left\| \left( p,\theta \right) \right\| _{T,\infty }+\left\| \left( p,\theta \right) \right\| _{T,\infty }^{2}\right) \end{aligned}$$

for some constant $K>0$. Hence setting

$$\begin{aligned} \Lambda _{T,R}=\left\{ \left( p,\theta \right) \in X_{T}:\left\| \left( p,\theta \right) \right\| _{T,\infty }\le R\right\} \end{aligned}$$

if we take $\left( p,\theta \right) \in \Lambda _{T,R}$ we get

$$\begin{aligned} \left\| \Gamma \left( p,\theta \right) \right\| _{T,\infty }\le \left( C_{0}\left\| \nabla g\right\| _{\infty }+\left\| p_{0}\right\| _{\infty }\right) +2C_{0}\sqrt{T}K\left( R+R^{2}\right) . \end{aligned}$$

In particular, there exist $T_{0},R_{0}>0$ such that for every $0<T\le T_{0}$ and $0<R\le R_{0}$ we have

$$\begin{aligned} \left( C_{0}\left\| \nabla g\right\| _{\infty }+\left\| p_{0}\right\| _{\infty }\right) +2C_{0}\sqrt{T}K\left( R+R^{2}\right) \le R. \end{aligned}$$

With any such choice of $T,R>0$ we have

$$\begin{aligned} \Gamma \left( \Lambda _{T,R}\right) \subset \Lambda _{T,R}. \end{aligned}$$

If $\left( p,\theta \right) ,\left( p^{\prime },\theta ^{\prime }\right) \in \Lambda _{T,R}$ we have proved above

$$\begin{aligned} \left\| \Gamma \left( p,\theta \right) -\Gamma \left( p^{\prime },\theta ^{\prime }\right) \right\| _{T,\infty }\le 2 R \widetilde{C} \sqrt{T} \left\| \left( p,\theta \right) -\left( p^{\prime },\theta ^{\prime }\right) \right\| _{T,\infty }. \end{aligned}$$

Hence, reducing T if necessary, we see that $\Gamma $, as a map from the metric space $\Lambda _{T,R}$ into itself, is a contraction. $\square $

1.5 B.5: Proof of Theorem 4.8-(i)

Proof

Let $\epsilon >0$ and let be $(\theta _{\epsilon })_{\epsilon > 0}$ be a family of mollifiers. Now, define the function $u_{\epsilon }:[0,T] \times \mathbb {R}^{d} \rightarrow \mathbb {R}$ by setting

$$\begin{aligned} u_{\epsilon }(t, x) \doteq (\theta _{\epsilon } * u(t,\,\cdot \,))(x) = \int _{\mathbb {R}^{d}} \theta _{\epsilon }(x-y)\,u(t, y)\,dy. \end{aligned}$$

In particular, taking the convolution of the Hamilton–Jacobi Bellman equation (4.1) with $\theta _{\epsilon }$ it is not difficult to see that $u_{\epsilon }$ satisfies the following equation

$$\begin{aligned} -\partial _{t} u_{\epsilon } - \frac{1}{2}\Delta u_{\epsilon } - \theta _{\epsilon } * (b(x, p(t, x)) \cdot \nabla u) + \frac{1}{2}\,\theta _{\epsilon } * |\nabla u|^2 = \theta _{\epsilon } * f(x, p(t, x)) \end{aligned}$$

on $(0, T) \times \mathbb {R}^{d}$. The smoothing properties of convolution (see Proposition A.1) guarantees that $D^2 u_{\epsilon }(t, x)$ is continuous; besides, from the Hamilton–Jacobi Bellman equation it follows that also $\partial _t u_{\epsilon }$ is continuous, and therefore that $u_{\epsilon } \in \text {C}^{1,2}((0,T) \times \mathbb {R}^{d})$. Applying Itô’s formula we obtain

$$\begin{aligned} \begin{aligned} d u_{\epsilon }(t, X_t^{\alpha })&= \partial _t u_{\epsilon } dt + \nabla u_{\epsilon } \cdot (\alpha _t + b(X_t^{\alpha }, p(t, X_t^{\alpha })))\,dt + \nabla u_{\epsilon } \cdot dW_t + \frac{1}{2} \nabla u_{\epsilon }\,dt\\&= \left( \partial _t u_{\epsilon } + \frac{1}{2} \nabla u_{\epsilon } + \theta _{\epsilon } * (b(\,\cdot \,,p) \cdot \nabla u_{\epsilon })(t, X_t^{\alpha })\right) \,dt \\&\quad + \left( (\alpha _t + b(X_t^{\alpha }, p(t, X_t^{\alpha }))\cdot \nabla u_{\epsilon }(t, X_t^{\alpha })\right. \\&\quad \left. - \theta _{\epsilon } * (b(\,\cdot \,,p) \cdot \nabla u)(t, X_t^{\alpha })\right) \,dt\\&\quad + \nabla u_{\epsilon }(t, X_t^{\alpha })\cdot dW_t\\&= \left( \frac{1}{2}(\theta _{\epsilon } * |\nabla u|^2)(t, X_t^{\alpha }) - (\theta _{\epsilon } * f(\,\cdot \,,p))(t, X_t^{\alpha })\right) \,dt\\&\quad + \left( \alpha _t \nabla u_{\epsilon }(t, X_t^{\alpha }) + r_{\epsilon }\right) \,dt + \nabla u_{\epsilon }(t, X_t^{\alpha }) \cdot dW_t, \end{aligned} \end{aligned}$$

where we defined

$$\begin{aligned}r_{\epsilon }(t) \doteq b(X_t^{\alpha }, p(t, X_t^{\alpha })) \cdot \nabla u_{\epsilon }(t, X_t^{\alpha }) - \theta _{\epsilon } * (b(\,\cdot \,,p) \cdot \nabla u)(t, X_t^{\alpha }).\end{aligned}$$

Hence,

$$\begin{aligned} \begin{aligned}&\mathbb {E}[(\theta _{\epsilon }* g)(X_T^{\alpha })] - \mathbb {E}[u_{\epsilon }(0, X_0^{\alpha })]\\&\quad = \mathbb {E}\left[ \int _{0}^{T}\left( \frac{1}{2}(\theta _{\epsilon } * |\nabla u_{\epsilon }|^{2})(t, X_t^{\alpha }) - (\theta _{\epsilon } * f(\,\cdot \,,p))(t, X_t^{\alpha }\right) \,dt\right] \\&\qquad + \mathbb {E}\left[ \int _{0}^{T} (\alpha _t \cdot \nabla u_{\epsilon }(t, X_t^{\alpha }) + r_{\epsilon }(t))\,dt\right] \end{aligned} \end{aligned}$$

We claim that by taking the limit as $\epsilon \rightarrow 0$ in the previous equation we obtain the identity (4.17) as in the heuristic argument.

We first deal with terms that do not explicitly depend on time, then extend the argument to time-dependent terms. To this end, let $v \in \text {C}(\mathbb {R}^{d})$; then, $\theta _{\epsilon } * v \rightarrow v$ as $\epsilon \rightarrow 0$ uniformly on compact sets (see Proposition A.2). Set now $v_{\epsilon } \doteq \theta _{\epsilon } * v$. If v is bounded by a constant K, then the same holds for $v_{\epsilon }$ and the constant bounding $v_{\epsilon }$ is independent of $\epsilon $. For all $R > 0$ and for any probability measure $\mu \in \mathcal {P}(\mathbb {R}^{d})$ we have

$$\begin{aligned} \begin{aligned} \Bigg |\int _{\mathbb {R}^{d}}(v_{\epsilon }(x) - v(x))\,\mu (d x)\Bigg |&\le |\overline{B}_{R}(0)| \sup _{x \in \overline{B}_{R}(0)} |v_{\epsilon }(x) - v(x)| \\&\quad + \int _{\mathbb {R}^{d}\setminus \overline{B}_{R}(0)} (\Vert v_{\epsilon } \Vert _{\infty } + \Vert v \Vert _{\infty })\,\mu (dx)\\&\quad \rightarrow 2 K \int _{\mathbb {R}^{d}\setminus \overline{B}_{R}(0)} \mu (dx)\quad \text {as}\quad \epsilon \rightarrow 0, \end{aligned} \end{aligned}$$

(B.8)

where $\overline{B}_{R}(0) \subset \mathbb {R}^{d}$ denotes the closed ball of radius R around the origin and $|\overline{B}_{R}(0)|$ its measure. In particular, the last term in (B.8) converges to zero as $R \rightarrow \infty $.

Let now $u(t,\,\cdot \,)\in \text {C}_{b}(\mathbb {R}^{d})$, bounded by a constant K, with u the first component of the solution of the PDE system in Eq. (4.1). Moreover, let $\mu _t^{\alpha }$ the law of $X_t^{\alpha }$. Then

$$\begin{aligned} \begin{aligned} \mathbb {E}[(\theta _{\epsilon } * u(t,\,\cdot \,))(X_{t}^{\alpha })] =&\int _{\mathbb {R}^{d}} (\theta _{\epsilon } * u(t,\cdot ))(x)\,\mu _t^{\alpha }(d x) \\ \rightarrow&\int _{\mathbb {R}^{d}} u(t,x) \mu _t^{\alpha }(t)(dx)\quad \text {as}\quad \epsilon \rightarrow 0 \end{aligned} \end{aligned}$$

for all $t \in [0,T]$, so in particular for $t = T$ and $u(T) = g$.

Now, we show that a similar argument holds also for terms that have an explicit, continuous, dependence on the time variable. Let $v \in \text {C}_{b}([0, T] \times \mathbb {R}^{d})$; then for each fixed $t \in [0, T]$ we have that $\theta _{\epsilon } * v(t) \rightarrow v(t)$ as $\epsilon \rightarrow 0$ uniformly on compact sets (see, again, Proposition A.2). In particular, for all $R > 0$ and for any probability measure $\mu \in \mathcal {P}(\mathbb {R}^{d})$ we have:

$$\begin{aligned} \begin{aligned} \Bigg | \int _{\mathbb {R}^{d}}\int _{0}^{T}(v_{\epsilon }(t, x) - v(t, x))\,dt\,\mu (dx) \Bigg |&\le \int _{\overline{B}_{R}(0)}\int _{0}^{T} |v_{\epsilon }(t, x) - v(t, x)|\,dt\,\mu (dx)\\&+\int _{\mathbb {R}^{d}\setminus \overline{B}_{R}(0)} \int _{0}^{T} (\Vert v_{\epsilon } \Vert _{\infty } + \Vert v \Vert _{\infty })\,dt\,\mu (dx). \end{aligned} \end{aligned}$$

The first term converges to zero as $\epsilon \rightarrow 0$ provided that both $v_{\epsilon }$ and v belongs to $\text {C}([0,T] \times \mathbb {R}^{d})$; indeed, in this case we can compute the maximum over [0, T]. The second term converges to zero by an argument similar to that used in Eq. (B.8).

However, if $v \in \text {C}_{b}([0, T] \times \mathbb {R}^{d})$, then $v(t,\,\cdot \,) \in \text {C}_b(\mathbb {R}^{d})$ and $v(\,\cdot \,,x) \in \text {C}([0,T])$; therefore, the compactness of [0, T] implies the uniform continuity of $v(\,\cdot \,,x)$. Then, the fact that $v(t,\,\cdot \,) \in \text {C}_b(\mathbb {R}^{d})$ and the uniform continuity of $v(\,\cdot \,,x)$ imply the joint continuity of v. Indeed, let $(t, x) \in [0,T] \times \mathbb {R}^{d}$. For all $\epsilon > 0$ there exist $\delta > 0$ and $ \eta > 0$ such that

$$\begin{aligned} |v(t',x')-v(t,x)|<\epsilon \quad \forall (t',x')\in [0,T]\times \mathbb {R}^d\quad \text {s.t.}\quad |x-x'|<\eta ,\quad |t-t'|<\delta . \end{aligned}$$

More precisely, let $\delta >0$ be the constant related to the uniform continuity in time associated to $\epsilon /2$ and $\eta >0$ be the constant related to the continuity in space associated to $\epsilon /2$. Then:

$$\begin{aligned} \begin{aligned} |v(t',x')-v(t,x)|&\le |v(t',x')-v(t,x')|+|v(t,x')-v(t,x)|\\&< \frac{\epsilon }{2}+\frac{\epsilon }{2}\quad \forall (t',x')\in [0,T]\times \mathbb {R}^d\\&\qquad \qquad \text {s.t.}\quad |x-x'|<\eta ,\quad |t-t'|<\delta . \end{aligned} \end{aligned}$$

By the fact that all our terms satisfy the required continuity as v, by the boundedness of the admissible controls and by choosing $\mu =\mu ^{\alpha }$ law of $X^{\alpha }$ we conclude. $\square $

Appendix C: Hölder-Type Seminorm Bounds-1

This section collects some results for Hölder-type seminorm (see Definition in Eq. (5.8)) used in the proof of Theorem 5.1.

We start by fixing the fractional exponent $s \in (0,1)$ and for any $p \in [1, +\infty )$, we define $W^{s,p}(\mathbb {R}^{d})$ as the space:

$$\begin{aligned} W^{s,p}(\mathbb {R}^{d}) \doteq \left\{ f \in L^{p}(\mathbb {R}^{d})\,:\, \frac{ \left| f\left( x\right) -f\left( y\right) \right| }{\left| x-y\right| ^{\frac{d}{p}+s}}\in L^{p}(\mathbb {R}^{d} \times \mathbb {R}^{d})\right\} \end{aligned}$$

endowed with the following norm:

$$\begin{aligned} \Vert f\Vert _{W^{s,p}(\mathbb {R}^{d})}^{p}\doteq & {} \int _{\mathbb {R}^{d}} |f(x)|^p\,dx + \int _{\mathbb {R}^{d}}\int _{\mathbb {R}^{d}}\frac{ \left| f\left( x\right) -f\left( y\right) \right| ^{p}}{\left| x-y\right| ^{d+sp}}dx\,dy\\\doteq & {} \Vert f \Vert _{L^{p}(\mathbb {R}^{d})}^{p} + \left[ f\right] _{p,sp}^{p}. \end{aligned}$$

Let $p\in [1,+\infty )$ and $s\in (0,1)$ be such that $sp > d$. Then, there exists a constant $C > 0$, depending on d, s, p, such that

$$\begin{aligned} \Vert f \Vert _{\infty } + [f]_{\gamma } \le C\left( \Vert f\Vert _{L^p} + [f]_{p, sp}\right) , \end{aligned}$$

(C.1)

where $\gamma \doteq (s p - d)/p$ and $sp > d$. We refer to [7], Theorem 8.2, for a proof of the previous result. We state the following lemma

Lemma C.1

Let $p \in [1, +\infty )$, $s \in (0,1)$ be such that $s p > d$, $d \in \mathbb {N}$. Then,

$$\begin{aligned} \left[ f\right] _{p,sp}^{p}\le \int _{\mathbb {R}^{d}}\int _{\left| h\right| \le 1}\frac{\left| f\left( y+h\right) -f\left( y\right) \right| ^{p}}{\left| h\right| ^{d+sp}}\,dh\,dy+2\,C_{p,d,s}\left\| f\right\| _{L^{p}}^{p}. \end{aligned}$$

Proof

We write $[f]_{p, s p}^{p}$ as

$$\begin{aligned} \begin{aligned} \left[ f\right] _{p,sp}^{p}=&\int _{\mathbb {R}^{d}}\int _{\mathbb {R}^{d}}\frac{ \left| f\left( y+h\right) -f\left( y\right) \right| ^{p}}{\left| h\right| ^{d+sp}}dh\,dy = I_1 + I_2\quad \text {where}\\ I_{1} =&\int _{\mathbb {R}^{d}}\int _{\left| h\right| \le 1}\frac{\left| f\left( y+h\right) -f\left( y\right) \right| ^{p}}{\left| h\right| ^{d+sp}}dh\,dy\,\,\text {and}\,\\ I_{2}=&\int _{\mathbb {R}^{d}}\int _{\left| h\right| >1}\frac{\left| f\left( y+h\right) -f\left( y\right) \right| ^{p}}{\left| h\right| ^{d+sp}}dh\,dy \end{aligned} \end{aligned}$$

Then,

$$\begin{aligned} \begin{aligned}&\int _{\mathbb {R}^{d}}\int _{\left| h\right|>1}\frac{\left| f\left( y+h\right) -f\left( y\right) \right| ^{p}}{\left| h\right| ^{d+sp}}dh\,dy \\&\quad \le C_{p}\int _{\mathbb {R}^{d}}\int _{\left| h\right|>1}\frac{ \left| f\left( y+h\right) \right| ^{p}}{\left| h\right| ^{d+sp}}dh\,dy+C_{p}\left( \int _{\left| h\right|>1}\frac{1}{ \left| h\right| ^{d+sp}}dh\right) \left( \int _{\mathbb {R} ^{d}}\left| f\left( y\right) \right| ^{p}dy\right) \\&\quad = C_{p}\int _{\left| h\right|>1}\int _{\mathbb {R}^{d}}\frac{ \left| f\left( y+h\right) \right| ^{p}}{\left| h\right| ^{d+sp}}dy\,dh+C_{p,d,s}\left\| f\right\| _{L^{p}}^{p} \\&\quad =C_{p}\int _{\left| h\right| >1}\int _{\mathbb {R}^{d}}\frac{ \left| f\left( y\right) \right| ^{p}}{\left| h\right| ^{d+sp} }dy\,dh+C_{p,d,s}\left\| f\right\| _{L^{p}}^{p}=2\,C_{p,d,s}\left\| f\right\| _{L^{p}}^{p}, \end{aligned} \end{aligned}$$

which concludes the proof. $\square $

Lemma C.2

Assume there exists a number $\epsilon >0$ with the following property. For every $p\ge 2$ there is a function $g_{p}>0$ such that

$$\begin{aligned}&\mathbb {E}\left[ \left| M_{t}^{N}\left( x\right) \right| ^{p}\right] \le g_{p}\left( x\right) , \end{aligned}$$

(C.2)

$$\begin{aligned}&\mathbb {E}\left[ \left| M_{t}^{N}\left( x\right) -M_{t}^{N}\left( x+h\right) \right| ^{p}\right] \le g_{p}\left( x\right) \left| h\right| ^{\epsilon p}, \end{aligned}$$

(C.3)

$$\begin{aligned}&\int _{\mathbb {R}^{d}}g_{p}\left( x\right) dx < \infty \end{aligned}$$

(C.4)

for all $\left| h\right| \le 1$ and $x\in \mathbb {R}^{d}$. Then, there is $\gamma >0$ such that, for every $p\ge 2$, there is a constant $ C_{p}>0$ such that

$$\begin{aligned} \mathbb {E}\left[ \left\| M_{t}^{N}\right\| _{\gamma }^{p}\right] \le C_{p}. \end{aligned}$$

Proof

It is sufficient to prove the thesis for arbitrarily large $\bar{p}\ge 2$, since for smaller ones it follows from Hölder inequality. Choose $s\in (0,\varepsilon )$; then take any $\bar{p}\ge 2$ such that $s\bar{p} > d$. We have to find $\gamma > 0$ such that for every such $\bar{p}$ there is a constant $C_{\bar{p}}$ such that $\mathbb {E}\left[ \left\| M_{t}^{N}\right\| _{\gamma }^{\bar{p}}\right] \le C_{\bar{p}}$ uniformly in $t\in [0,T]$ and $N\in \mathbb {N}$.

Thanks to the assumptions,

$$\begin{aligned} \mathbb {E}\left[ \int _{\mathbb {R}^{d}}\left| M_{t}^{N}\left( x\right) \right| ^{\bar{p}}dx\right] \le C. \end{aligned}$$

Moreover, thanks to Lemma C.1,

$$\begin{aligned} \begin{aligned}&\mathbb {E}\left[ \left[ M_{t}^{N}\right] _{\bar{p},s \bar{p}}^{\bar{p}}\right] \\&\quad \le \int _{ \mathbb {R}^{d}}\int _{\left| h\right| \le 1}\frac{\mathbb {E}\left[ \left| M_{t}^{N}\left( y+h\right) -M_{t}^{N}\left( y\right) \right| ^{\bar{p}}\right] }{\left| h\right| ^{d+s\bar{p}}}dh\,dy+2\,C_{\bar{p},d,s} \mathbb {E}\left[ \left\| M_{t}^{N}\right\| _{L^{\bar{p}}}^{\bar{p}}\right] \\&\quad \le \int _{\mathbb {R}^{d}}\int _{\left| h\right| \le 1}\frac{ g_{\bar{p}}\left( y\right) \left| h\right| ^{\epsilon \bar{p}}}{\left| h\right| ^{d+s\bar{p}}}dh\,dy+C \\&\quad \le \left( \int _{\left| h\right| \le 1}\frac{1}{\left| h\right| ^{d-\left( \epsilon -s\right) \bar{p}}}dh\right) \int _{\mathbb {R} ^{d}}g_{\bar{p}}\left( y\right) dy+C\le C. \end{aligned} \end{aligned}$$

(C.5)

Now, using again the fact that $\mathbb {E}\left[ \Vert M^N_t \Vert ^{\bar{p}}_{L^{\bar{p}}} \right] \le C$, we may apply inequality (C.1) and deduce the desired bound for $\bar{\gamma }=(s\bar{p}-d)/\bar{p}$. A-priori this value of $\gamma $ depends on the particular $\bar{p}$ chosen above. However, it is sufficient to choose first a value $\bar{p}_0$, such that $s\bar{p}_0>d$ and prove that $\mathbb {E}\left[ \Vert M^N_t \Vert ^{\bar{p}}_{\bar{\gamma }_0} \right] \le C_{\bar{p}_0}$; then for all $\bar{p}>\bar{p}_0$, we prove the inequality with $\bar{\gamma }=s-d/\bar{p}$ which is larger than $\bar{\gamma }_0$, hence it holds also with Hölder exponent $\bar{\gamma }_0$, which can be taken as the value of $\gamma $ in the statement of the lemma. $\square $

Lemma C.3

Let $N, d \in \mathbb {N}$, let $\mathcal {P}_t$ be the semi-group associated to the density G(t, x) of $x + W_t$ where $W_t$ is a standard blackian motion, $x \in \mathbb {R}^{d}$ and $t \in {(} 0, T]$. Moreover, let $V \in \text {C}_c^1(\mathbb {R}^d) \cap \mathcal {P}(\mathbb {R}^{d})$. Then

$$\begin{aligned} \left\| \mathcal {P}_{t}h\right\| _{\gamma } \le C_{\gamma }\left\| h\right\| _{\gamma }. \end{aligned}$$

Moreover, if $R>0$ denotes a number such that the support of V is contained in $B_{R}(0)$, the open ball of radius R around the origin, and we write $V^{N}\left( x\right) =\epsilon _{N}^{-d}V\left( \epsilon _{N}^{-1}x\right) $, then there exist two constants $C_{T,R,V}>0$ and $\lambda _{T,R,V}>0$ with the following property: for every $\delta ,\gamma \in \left( 0,1\right) $, $x\in \mathbb {R}^{d}$, $\left| h\right| \le 1$ and $t\in \left[ 0,T\right] $

$$\begin{aligned}&\left| \left( \nabla \mathcal {P}_{t}V^{N}\right) \left( x\right) \right| \le \frac{C_{T,R,V}}{t^{\frac{1-\delta }{2}}}\epsilon _{N}^{-d-\delta }e^{-\frac{\left| x\right| }{8T}} . \end{aligned}$$

(C.6)

$$\begin{aligned}&\left| \left( \nabla \mathcal {P}_{t}V^{N}\right) \left( x\right) -\left( \nabla \mathcal {P}_{t}V^{N}\right) \left( x+h\right) \right| \nonumber \\&\quad \le \frac{ C_{T,R,V}}{t^{\frac{1}{2}\left( 1+\gamma \right) -\frac{\delta }{2}\left( 1-\gamma \right) }}\left| h\right| ^{\gamma }\epsilon _{N}^{-d-\delta \left( 1-\gamma \right) } e^{-\lambda _{T,R,V}\left| x\right| } \end{aligned}$$

(C.7)

Proof

The first inequality is a well known properties of analytic semi-group (see, for instance, Lunardi [18]). We give a detailed proof of the last two equalities.

Step 1 We collect some preliminary fact. We recall that

$$\begin{aligned} G_t(x) \doteq G(t, x) = \frac{1}{\left( 2\pi t\right) ^{d/2}}e^{-\frac{1}{2t}\left| x\right| ^{2}} \quad \text {and}\quad \left( \mathcal {P}_{t}f\right) \left( x\right) {=} \int _{\mathbb {R}^{d}}G_{t}\left( x-y\right) f\left( y\right) dy \end{aligned}$$

and we find a bound for $\nabla G_{t}\left( x\right) $ and $|D^2 G_t(x)|$. Notice that

$$\begin{aligned} \nabla G_{t}\left( x\right) =-\frac{x}{t}\frac{1}{\left( 2\pi t\right) ^{d/2} }e^{-\frac{1}{2t}\left| x\right| ^{2}} =-\frac{1}{ \sqrt{t}}\frac{1}{\left( 2\pi t\right) ^{d/2}}\sqrt{\frac{\left| x\right| ^{2}}{t}}e^{-\frac{1}{2t}\left| x\right| ^{2}} \end{aligned}$$

hence, being $\sqrt{r}\exp \left( -\frac{1}{2}r\right) \le \exp \left( - \frac{1}{4}r\right) $,

$$\begin{aligned} \left| \nabla G_{t}\left( x\right) \right| \le \frac{1}{\sqrt{t}} \cdot \frac{1}{\left( 2\pi t\right) ^{d/2}} e^{-\frac{1}{4 t}\left| x\right| ^{2}} =\frac{2^{d/2}}{\sqrt{t}}\cdot \frac{ 1}{\left( 2\pi \left( 2t\right) \right) ^{d/2}}e^{-\frac{1}{2}\frac{\left| x\right| ^{2}}{(2t)}} \end{aligned}$$

Similarly, for suitable $\lambda ,C>0$,

$$\begin{aligned} \left| D^{2}G_{t}\left( x\right) \right| \le \frac{C}{t}\cdot \frac{ 1}{\left( 2\pi \left( \lambda t\right) \right) ^{d/2}}e^{-\frac{1}{2}\frac{\left| x\right| ^{2}}{(\lambda t)}} \end{aligned}$$

Step 2 In this step we prove that

$$\begin{aligned} \left| \left( \nabla \mathcal {P}_{t}V^{N}\right) \left( x\right) \right| \le \frac{C_{T,R,V}}{\sqrt{t}}\epsilon _{N}^{-d} e^{-\frac{|x|}{8{T}}} \end{aligned}$$

for all $x\in \mathbb {R}^{d}$ and $t \in {(} 0, T]$, for a suitable constant $C_{T,R,V}>0$. From the bound for $\left| \nabla G_{t}\left( x\right) \right| $ in Step 1 we obtain

$$\begin{aligned} \left| \left( \nabla \mathcal {P}_{t}V^{N}\right) \left( x\right) \right|\le & {} \int _{\mathbb {R}^{d}}\left| \nabla G_{t}\left( x-y\right) \right| V^{N}\left( y\right) dy\\= & {} \int _{B_{R}(0) }\left| \nabla G_{t}\left( x-y\right) \right| \epsilon _{N}^{-d}V\left( \epsilon _{N}^{-1}y\right) dy \\\le & {} \frac{2^{d/2}}{\sqrt{t}}\epsilon _{N}^{-d}\left\| V\right\| _{\infty }\int _{B_{R}(0)}\frac{1}{\left( 2\pi \left( 2t\right) \right) ^{d/2}}e^{-\frac{1}{2}\frac{\left| x-y\right| ^{2}}{ \left( 2t\right) }}\,dy. \end{aligned}$$

If $\left| x\right| \le R+1$, we bound the integral from above by the integral on the full space, which is equal to one, and deduce

$$\begin{aligned} \sup _{\left| x\right| \le R+1}\left| \left( \nabla \mathcal {P} _{t}V^{N}\right) \left( x\right) \right| \le \frac{2^{d/2}}{\sqrt{t}} \epsilon _{N}^{-d}\left\| V\right\| _{\infty }. \end{aligned}$$

If $\left| x\right| >R+1$ and $\left| y\right| \le R$, then (we oversimplify to make expressions easier in the sequel) $\left| x-y\right| ^{2}\ge \left| x-y\right| \ge \left| x\right| -R$. Therefore, for $\left| x\right| >R+1$,

$$\begin{aligned} \left| \left( \nabla \mathcal {P}_{t}V^{N}\right) \left( x\right) \right| \le \frac{2^{d/2}}{\sqrt{t}}\epsilon _{N}^{-d}\left\| V\right\| _{\infty }\left| B_{R}(0) \right| \frac{1}{ \left( 2\pi \left( 2t\right) \right) ^{d/2}}e^{-\frac{1}{2}\frac{ \left| x\right| -R}{\left( 2t\right) }} \end{aligned}$$

One show that there is $C_{T}>0$ such that for $t\in \left[ 0,T\right] $ and $\left| x\right| >R+1$, one has

$$\begin{aligned} \frac{1}{\left( 2\pi \left( 2t\right) \right) ^{d/2}}e^{-\frac{1}{2}\frac{ \left| x\right| -R}{\left( 2t\right) }} \le C_{T}e^{-\frac{1}{8T}\left( \left| x\right| -R\right) } \end{aligned}$$

Indeed the left-hand-side is controlled (up to a constant) by $\left( \frac{ \left| x\right| -R}{2t}\right) ^{d/2}e^{-\frac{1}{2}\frac{ \left| x\right| -R}{2t}} $ (because $\left| x\right| -R\ge 1$) and the function $r^{d/2}e^{-\frac{1}{2}r} $ is bounded above by $e^{-\frac{1}{4}r}$, up to a constant; finally, $e^{-\frac{1}{4}\frac{\left| x\right| -R}{2t}} \le e^{-\frac{1}{8T}\left( \left| x\right| -R\right) }$.

Hence

$$\begin{aligned} \left| \left( \nabla \mathcal {P}_{t}V^{N}\right) \left( x\right) \right| \le \frac{C_{T,R,V}}{\sqrt{t}}\epsilon _{N}^{-d} e^{- \frac{\left| x\right| -R}{8T}}=\frac{C_{T,R,V}^{\prime }}{ \sqrt{t}}\epsilon _{N}^{-d} e^{-\frac{\left| x\right| }{8T}} \end{aligned}$$

Remaning the constant $C_{T,R,V}^{\prime }$, the same bound is true for $ \left| x\right| \le R+1$, hence it is true for all x and all $t \in {(} 0, T]$.

Step 3 We complete the proof of (C.6). In addition to the bound found in Step 2 we have

$$\begin{aligned} \left| \left( \nabla \mathcal {P}_{t}V^{N}\right) \left( x\right) \right|\le & {} \int _{\mathbb {R}^{d}}G_{t}\left( x-y\right) \left| \nabla V^{N}\left( y\right) \right| dy\\= & {} \epsilon _{N}^{-d-1}\int _{\mathbb { R}^{d}}G_{t}\left( x-y\right) \left| \left( \nabla V\right) \left( \epsilon _{N}^{-1}y\right) \right| dy \\\le & {} \epsilon _{N}^{-d-1}\left\| \nabla V\right\| _{\infty }\int _{B_{R}(0)}G_{t}\left( x-y\right) dy. \end{aligned}$$

Arguing as above we get,

$$\begin{aligned} \left| \left( \nabla \mathcal {P}_{t}V^{N}\right) \left( x\right) \right| \le C_{T,R,V}\epsilon _{N}^{-d-1} e^{-\frac{\left| x\right| }{8T}} \end{aligned}$$

where if necessary we have renamed the constant $C_{T,R,V}$. Now, taken $ \delta \in \left( 0,1\right) $, we use both inequalities for $\left| \left( \nabla \mathcal {P}_{t}V^{N}\right) \left( x\right) \right| $ to get

$$\begin{aligned} \left| \left( \nabla \mathcal {P}_{t}V^{N}\right) \left( x\right) \right| =\left| \left( \nabla \mathcal {P}_{t}V^{N}\right) \left( x\right) \right| ^{1-\delta }\left| \left( \nabla \mathcal {P} _{t}V^{N}\right) \left( x\right) \right| ^{\delta }\le \frac{C_{T,R,V}}{ t^{\frac{1-\delta }{2}}}\epsilon _{N}^{-d-\delta }e^{-\frac{\left| x\right| }{8T}}. \end{aligned}$$

Step 4 Finally we prove (C.7). We note first that

$$\begin{aligned}&\left| \left( \nabla \mathcal {P}_{t}V^{N}\right) \left( x\right) -\left( \nabla \mathcal {P}_{t}V^{N}\right) \left( x+h\right) \right| \\&\quad \le \sup _{\left| \xi \right| \le \left| h\right| }\left| D^{2}\mathcal {P}_{t}V^{N}\left( x+\xi \right) \right| \left| h\right| \\&\quad \le \sup _{\left| \xi \right| \le \left| h\right| }\int _{ \mathbb {R}^{d}}\left| D^{2}G_{t}\left( x+\xi -y\right) \right| V^{N}\left( y\right) dy\left| h\right| \\&\quad \le \frac{C}{t}\frac{\left| h\right| \epsilon _{N}^{-d}\left\| V\right\| _{\infty }}{\left( 2\pi \left( \lambda t\right) \right) ^{d/2}} \sup _{\left| \xi \right| \le \left| h\right| }\int _{B_R(0) } e^{-\frac{1}{2}\frac{\left| x+\xi -y\right| ^{2} }{\left( \lambda t\right) }}\,dy \\&\quad \le \frac{C_{T,R,V}}{t}\left| h\right| \epsilon _{N}^{-d}\sup _{\left| \xi \right| \le \left| h\right| }e^{-\frac{\left| x+\xi \right| }{4\lambda T}} \\&\quad \le \frac{C_{T,R,V}}{t}\left| h\right| \epsilon _{N}^{-d}\sup _{\left| \xi \right| \le \left| h\right| }e^{-\frac{\left| x\right| -\left| \xi \right| }{4\lambda T}}\\&\quad =\frac{C_{T,R,V}}{t}\left| h\right| \epsilon _{N}^{-d} e^{ \frac{1}{4\lambda T}} e^{-\frac{\left| x\right| }{ 4\lambda T}} \\&\quad =\frac{C_{T,R,V}^{\prime }}{t}\left| h\right| \epsilon _{N}^{-d} e^{-\frac{\left| x\right| }{4\lambda T}} . \end{aligned}$$

On the other hand, it holds:

$$\begin{aligned} \left| \left( \nabla \mathcal {P}_{t}V^{N}\right) \left( x\right) -\left( \nabla \mathcal {P}_{t}V^{N}\right) \left( x+h\right) \right|\le & {} \left| \left( \nabla \mathcal {P}_{t}V^{N}\right) \left( x\right) \right| +\left| \left( \nabla \mathcal {P}_{t}V^{N}\right) \left( x+h\right) \right| \\\le & {} \frac{C_{T,R,V}}{t^{\frac{1-\delta }{2}}}\epsilon _{N}^{-d-\delta } e^{-\frac{\left| x\right| }{8T}} +\frac{C_{T,R,V}}{ t^{\frac{1-\delta }{2}}}\epsilon _{N}^{-d-\delta }e^{-\frac{ \left| x+h\right| }{8T}} \\\le & {} \frac{C_{T,R,V}^{\prime }}{t^{\frac{1-\delta }{2}}}\epsilon _{N}^{-d-\delta } e^{-\frac{\left| x\right| }{8T}} \end{aligned}$$

because $\left| h\right| \le 1$. Therefore, for every (small) $\gamma \in \left( 0,1\right) $,

$$\begin{aligned}&\left| \left( \nabla \mathcal {P}_{t}V^{N}\right) \left( x\right) -\left( \nabla \mathcal {P}_{t}V^{N}\right) \left( x+h\right) \right| \\&\quad \le \frac{C_{T,R,V}^{\prime }}{t^{\frac{1-\delta }{2}\left( 1-\gamma \right) }} \frac{1}{t^{\gamma }}\left| h\right| ^{\gamma }\epsilon _{N}^{\left( -d-\delta \right) \left( 1-\gamma \right) }\epsilon _{N}^{-d\gamma }e^{-\frac{\left| x\right| }{4\left( 2\wedge \lambda \right) T}} \\&\quad =\frac{C_{T,R,V}^{\prime }}{t^{\frac{1}{2}\left( 1+\gamma \right) -\frac{ \delta }{2}\left( 1-\gamma \right) }}\left| h\right| ^{\gamma }\epsilon _{N}^{-d-\delta \left( 1-\gamma \right) }e^{-\frac{ \left| x\right| }{4\left( 2\wedge \lambda \right) T}}, \end{aligned}$$

which completes the proof. $\square $

Appendix D: Hölder-Type Seminorm Bounds-2

Let $N\in \mathbb {N}$. This section collects some results on Hölder type semi-norm for convolution of the type $V^{N}*\mu _{N}$, where $V^{N}$ satisfies to hypothesis (H3), i.e. $V^{N}(x)=\epsilon _{N}^{-d}V(\epsilon _{N}^{-1}x)$ with $\epsilon _{N}>0$, $\lim _{N\rightarrow \infty }\epsilon _{N}=0$, $V\in \text {C}^{1}(\mathbb {R} ^{d})\cap \mathcal {P}(\mathbb {R}^{d})$. In addition, $\mu _{N}\in \mathcal {P} (\mathbb {R}^{d})$. In what follows, for pedagogical reasons, we first treat the case in which the probability measure $\mu _{N}$ is deterministic, then we analyse the case in which $\mu _{N}$ is stochastic; the results’ proofs in the latter case are less elementary. We make the following remark. If $\mu \in \mathcal {P}(\mathbb {R}^{d})$, then $V^{N}*\mu \in \text {C}^{1}(\mathbb {R}^{d})$. Moreover, if $(\mu _{N})_{N\in \mathbb {N} }\subset \mathcal {P}(\mathbb {R}^{d}) $ converges weakly to $\mu \in \mathcal {P}(\mathbb {R}^{d})$ as $N\rightarrow \infty $, then

$$\begin{aligned} \lim _{N\rightarrow \infty }\left\langle V^{N}*\mu _{N},\varphi \right\rangle =\left\langle \mu ,\varphi \right\rangle \quad \text {for all}\quad \varphi \in \text {C}_{c}(\mathbb {R}^{d}). \end{aligned}$$

Indeed, $\left\langle V^{N}*\mu _{N},\varphi \right\rangle =\left\langle \mu _{N},V^{N,-}*\varphi \right\rangle $ where $V^{N,-}\left( x\right) =V^{N}\left( -x\right) $; then $V^{N,-}*\varphi \rightarrow \varphi $ uniformly on $\mathbb {R}^{d}$ as $N\rightarrow \infty $ and thus $\left\langle \mu _{N},V^{N,-}*\varphi \right\rangle $ converges to $\left\langle \mu ,\varphi \right\rangle $. Let, as usual, $\overline{B} _{R}(0)$ be the closed ball of radius R centred around zero. Spaces like $\text {C}_{\ell oc}^{\gamma }(\mathbb {R}^{d})$, namely with the $\ell oc$ specification, are Polish spaces; the convergence in this spaces is the convergence in the corresponding topologies over $\overline{B}_{R}(0)$ for each $R>0$. In addition, let

$$\begin{aligned} \text {C}_{\ell oc}^{\gamma -}(\mathbb {R}^{d})\doteq \cap _{\begin{array}{c} \gamma ^{^{\prime }}<\gamma \end{array}}\text {C}_{\ell oc}^{\gamma ^{^{\prime }}}(\mathbb {R}^{d}) \end{aligned}$$

and endow it with the natural metric which yields convergence in each $\text {C}_{\ell oc}^{\gamma ^{^{\prime }}}(\mathbb {R}^{d})$. Recall that, by $\left\| f\right\| _{\gamma }$ we mean the sum of the supremum norm $\left\| f\right\| _{\infty }$ on full space $\mathbb {R}^{d}$ plus the $\gamma $-Hölder seminorm on $\mathbb {R}^{d}$.

Lemma D.1

Let $\left( \mu _{N}\right) _{N\in \mathbb {N}}\subset \mathcal {P}\left( \mathbb {R}^{d}\right) $ be a sequence converging weakly to $\mu \in \mathcal {P}(\mathbb {R}^{d})$. Set $p_{N}=V^{N}*\mu _{N}$. Let $\gamma \in (0,1)$ be such that there exists $K>0$ for which

$$\begin{aligned} \left\| p_{N}\right\| _{\gamma }\le K \end{aligned}$$

for all $N\in \mathbb {N}$. Then $\mu $ is absolutely continuous w.r.t. Lebesgue measure with density $p\in \text {C}_{\ell oc}^{\gamma -}\left( \mathbb {R} ^{d}\right) $ and $\left\| p\right\| _{\infty }\le K$. Moreover, $p_{N}\rightarrow p$ in $\text {C}_{loc}^{\gamma -}\left( \mathbb {R} ^{d}\right) $.

Proof

First, notice that for every $R>0$ and $\gamma ^{^{\prime }}<\gamma $ the space $C^{\gamma }(\overline{B}_{R}(0))$ is compactly embedded into $C^{\gamma ^{^{\prime }}}(\overline{B}_{R}(0))$. Take any subsequence $(p_{N_{k}} )_{k\in \mathbb {N}}$. Thanks to the previous compactness result, together with a diagonal procedure on a subsequence of radius $(R_{i})_{i\in \mathbb {N}}$, $R_{i}\rightarrow \infty $ as $i\rightarrow \infty $ and a sequence of exponents $\gamma _{i}^{^{\prime }}<\gamma $ such that $\gamma _{i}^{^{\prime }} \rightarrow \gamma $ as $i\rightarrow \infty $, we may prove that there exists a subsequence $\left( p_{N_{k}^{^{\prime }}}\right) _{k\in \mathbb {N}}$ which converges in $\text {C}^{\gamma ^{^{\prime }}}(\mathbb {R}^{d})$ for every $\gamma ^{\prime }<\gamma $, to a function $p\in \text {C}_{\ell oc}^{\gamma ^{^{\prime }}}(\mathbb {R}^{d})$; a priori, the function p depends on the subsequence. Therefore (see the remark above)

$$\begin{aligned} \left\langle \mu ,\varphi \right\rangle =\lim _{k\rightarrow \infty }\langle p_{N_{k}^{^{\prime }}},\varphi \rangle =\left\langle p,\varphi \right\rangle \end{aligned}$$

for every $\varphi \in \text {C}_{c}\left( \mathbb {R}^{d}\right) $. Hence, $\mu $ is absolutely continuous with respect the Lebesgue measure with density p. Notice that the properties $p\ge 0$ a.e. and $p\in L^{1}(\mathbb {R}^{d})$ follow from the identity $\left\langle \mu ,\varphi \right\rangle =\left\langle p,\varphi \right\rangle $ for every $\varphi \in \text {C}_{c}\left( \mathbb {R}^{d}\right) $. This identify uniquely p, independently of the subsequence. Since the convergence in $\text {C}_{\ell oc}^{\gamma ^{\prime } }\left( \mathbb {R}^{d}\right) $ is metric, we deduce that the whole sequence $\left( p_{N}\right) $ converges to p in $C_{\ell oc}^{\gamma ^{\prime } }\left( \mathbb {R}^{d}\right) $.

Finally, the previous convergence implies pointwise convergence, hence

$$\begin{aligned} \left| p\left( x\right) \right| =\lim _{N\rightarrow \infty }\left| p_{N}\left( x\right) \right| \le K \end{aligned}$$

This proves $\left\| p\right\| _{\infty }\le K$. $\square $

Now, we state and prove the previous lemma in the case in which $(\mu _{N})_{N \in \mathbb {N}} \subset \mathcal {P}(\mathbb {R}^{d})$ is a random sequence. Recall that a random probability measure is a random variable from $(\Omega , \mathcal {F}, \mathbb {P})$ to $\mathcal {P}(\mathbb {R}^{d})$, considered as a Polish space with a metric inducing weak convergence of measures. Instead, a random function p of class $\text {C}_{\ell oc}^{\gamma }\left( \mathbb {R}^{d}\right) $ is a random variable from $\left( \Omega ,\mathcal {F},\mathbb {P}\right) $ to $\text {C}_{\ell oc}^{\gamma }\left( \mathbb {R}^{d}\right) $.

Lemma D.2

Let $\left( \mu _{N}\right) _{N\in \mathbb {N}} \subset \mathcal {P}\left( \mathbb {R}^{d}\right) $ be a sequence of random probability measures converging in law, in the weak topology of $\mathcal {P}(\mathbb {R}^{d})$, to a random $\mu \in \mathcal {P} (\mathbb {R}^{d})$. Introduce the random differentiable functions $p_{N} \doteq V^{N}*\mu _{N}$. Let $\gamma \in (0,1)$, $q\ge 2$ be such that there exists a constant $K>0$ for which

$$\begin{aligned} \mathbb {E}\left[ \left\| p_{N}\right\| _{\gamma }^{q}\right] \le K \end{aligned}$$

(D.1)

for all $N\in \mathbb {N}$. Then there exists a random function p of class $\text {C}_{\ell oc}^{\gamma -}\left( \mathbb {R}^{d}\right) $ such that, with probability one, $\mu \left( dx\right) =p\left( x\right) dx$; and for every $q^{\prime }<q$ we have

$$\begin{aligned} \mathbb {E}\left[ \left\| p\right\| _{\infty }^{q^{\prime }}\right] <K^{q^{\prime }/q}. \end{aligned}$$

(D.2)

Moreover, $p_{N}$ converges to p in law, in the topology of $\text {C}_{\ell oc}^{\gamma -}\left( \mathbb {R}^{d}\right) $; and when p is deterministic (so that $p_{N}$ converges to p also in probability) we have

$$\begin{aligned} \lim _{N\rightarrow \infty }\mathbb {E}\left[ \left\| p_{N}-p\right\| _{\text {C}\left( \overline{B}_{R}(0)\right) }^{q^{^{\prime }}}\right] =0 \end{aligned}$$

(D.3)

for every $q^{\prime }<q$ and $R>0$.

Proof

Let us denote by $P_{N}$ the law of $p_{N}$ on Borel sets of $\text {C} ^{\gamma }(\mathbb {R}^{d})$, by $\pi _{N}$ and $\pi $ the laws of $\mu _{N}$ and $\mu $ on Borel sets $\mathcal {P}(\mathbb {R}^{d})$, respectively. We know that $\pi _{N}$ converges weakly to $\pi $. Set

$$\begin{aligned} \mathcal {K}_{R}\doteq \{f\in \text {C}^{\gamma }(\mathbb {R}^{d})\,:\,\Vert f\Vert _{C^{\gamma }(\mathbb {R}^{d})}\le R\}. \end{aligned}$$

$\mathcal {K}_{R}$ is pre-compact in $\text {C}_{\ell oc}^{\gamma -} (\mathbb {R}^{d})$. By assumption (D.1) and Markov inequality,

$$\begin{aligned} P_{N}(\mathcal {K}_{R}^{C})\le \frac{K}{R^{q}}. \end{aligned}$$

Then the family $(P_{N})_{N\in \mathbb {N}}$ is tight in $C_{\ell oc}^{\gamma -}(\mathbb {R}^{d})$. Let $(P_{N_{k}})_{k\in \mathbb {N}}$ be any subsequence converging weakly in the topology of $\text {C}_{\ell oc}^{\gamma -} (\mathbb {R}^{d})$ to some measure P, which, in principle, depends a priori on the subsequence. More precisely, denote by $Q_{N}$ the joint law of the vector $\left( p_{N},\mu _{N}\right) $ on Borel sets of $C_{\ell oc} ^{\gamma -}(\mathbb {R}^{d})\times \mathcal {P}(\mathbb {R}^{d})$. Since we already know that $\mu _{N}$ converges weakly, hence it is precompact, we can extract $\left( n_{k}\right) _{k\in \mathbb {N}}$ such that $Q_{N_{{k}}}$ converges weakly to a probability measure Q on Borel sets of $C_{\ell oc}^{\gamma -}(\mathbb {R}^{d})\times \mathcal {P}(\mathbb {R}^{d})$. The second marginal of Q is $\pi $, the first marginal will be called P, as above. The first marginal of $Q_{N_{{k}}}$ is $P_{N_{k}}$ and converges weakly to P; the second marginal is $\pi _{N_{k}}$ and converges weakly to $\pi $. Notice that at this stage we do not know yet $\mu $ has a density and that P is the law of such density. Concerning uniqueness, $\mu $ is the unique limit point (in law) of $\mu _{N}$, but P a priori is not the unique weak limit point of $P_{N}$.

By Skorohod representation theorem, there exists a probability space $(\widetilde{\Omega },\widetilde{\mathcal {F}},\widetilde{\mathbb {P}})$, random variables $\left( \widetilde{p}_{N_{{k}}},\widetilde{\mu }_{N_{{k}}}\right) $ and $\left( \widetilde{p},\widetilde{\mu }\right) $ from $(\widetilde{\Omega },\widetilde{\mathcal {F}},\widetilde{\mathbb {P}})$ to $C_{\ell oc}^{\gamma -}\left( \mathbb {R}^{d}\right) \times \mathcal {P}\left( \mathbb {R} ^{d}\right) $, with laws $Q_{N_{{k}}}$ and Q respectively, such that $\left( \widetilde{p}_{N_{k}},\widetilde{\mu }_{{N_{{k}}}}\right) \rightarrow \left( \widetilde{p},\widetilde{\mu }\right) $ as $k\rightarrow \infty $ in $\text {C}_{\ell oc}^{\gamma -}\left( \mathbb {R}^{d}\right) \times \mathcal {P}\left( \mathbb {R}^{d}\right) $, $\widetilde{\mathbb {P}}$-a.s. The link $p_{N_{k}}=V^{N_{k}}*\mu _{N_{k}}$ is preserved under this change of basis: $\widetilde{p}_{N_{k}}=V^{N_{k}}*\widetilde{\mu }_{N_{k}}$ with $\widetilde{\mathbb {P}}$ probability one. Indeed, denoting by $\widetilde{\mathbb {E}}[\,\cdot \,]$ the mathematical expectation on $(\widetilde{\Omega },\widetilde{\mathcal {F}},\widetilde{\mathbb {P}})$,

$$\begin{aligned} \widetilde{\mathbb {E}}\left[ 1\wedge \left\| V^{N_{k}}*\widetilde{\mu }_{N_{k}}-\widetilde{p}_{N_{k}}\right\| _{\text {C}\left( \overline{B} _{R}(0)\right) }\right] =\mathbb {E}\left[ 1\wedge \left\| V^{N_{k}} *\mu _{N_{k}}-p_{N_{k}}\right\| _{\text {C}\left( \overline{B} _{R}(0)\right) }\right] =0 \end{aligned}$$

(the first identity is true because $\left( \widetilde{p}_{N_{k} },\widetilde{\mu }_{N_{k}}\right) $ and $\left( p_{N_{k}},\mu _{N_{k}}\right) $ have the same law; second identity is true because $p_{N_{k}}=V^{N_{k}} *\mu _{N_{k}}$). Hence $\widetilde{p}_{N_{k}}=V^{N_{k}}*\widetilde{\mu }_{N_{k}}$, $\widetilde{\mathbb {P}}$-a.s.

The novelty on $(\widetilde{\Omega },\widetilde{\mathcal {F}} ,\widetilde{\mathbb {P}})$ is that we have the random variable $\widetilde{p}$, not only $\widetilde{\mu }$. Let us prove that the former is the density of the latter. From the remark above, with $\widetilde{\mathbb {P}}$ probability one, since $\widetilde{\mu }_{N_{k}}$ converges weakly to $\widetilde{\mu }$ we have

$$\begin{aligned} \lim _{k\rightarrow \infty }\left\langle V^{N_{k}}*\widetilde{\mu }_{N_{k} },\varphi \right\rangle =\left\langle \widetilde{\mu },\varphi \right\rangle \end{aligned}$$

for all $\varphi \in \text {C}_{c}(\mathbb {R}^{d})$. But at the same time, being $V^{N_{k}}*\widetilde{\mu }_{N_{k}}=\widetilde{p}_{N_{k}}$ and $\widetilde{p}_{N_{k}}$ converges to $\widetilde{p}$ in C$_{\ell oc}^{\gamma -}\left( \mathbb {R}^{d}\right) $, we have

$$\begin{aligned} \lim _{k\rightarrow \infty }\left\langle V^{N_{k}}*\widetilde{\mu }_{N_{k} },\varphi \right\rangle =\left\langle \widetilde{p},\varphi \right\rangle \end{aligned}$$

for all $\varphi \in \text {C}_{c}(\mathbb {R}^{d})$. Therefore,

$$\begin{aligned} \left\langle \widetilde{p},\varphi \right\rangle =\left\langle \widetilde{\mu },\varphi \right\rangle \text { for all }\varphi \in \text {C}_{c}(\mathbb {R}^{d}) \end{aligned}$$

with $\widetilde{\mathbb {P}}$ probability one. It implies that, $\widetilde{\mathbb {P}}$-a.s., the measure $\widetilde{\mu }$ has density $\widetilde{p}\in \text {C}_{\ell oc}^{\gamma -}\left( \mathbb {R}^{d}\right) $; the property that $\widetilde{p}$ is a probability density follows from the same identity, by suitable choice of $\varphi \in \text {C}_{c}\left( \mathbb {R}^{d}\right) $.

Call $\Lambda $ the subset of $\text {C}_{\ell oc}^{\gamma -}\left( \mathbb {R}^{d}\right) \times \mathcal {P}\left( \mathbb {R}^{d}\right) $ such that the first element is the density of the second. Call $\Lambda _{2}$ the set of elements of $\mathcal {P}\left( \mathbb {R}^{d}\right) $ that have a density of class $\text {C}_{\ell oc}^{\gamma -}\left( \mathbb {R}^{d}\right) $. The sets $\Lambda $ and $\Lambda _{2}$ are in bijection. The two sets are measurable in the corresponding spaces and the bijection is bi-measurable. Therefore a probability measure on C$_{\ell oc}^{\gamma -}\left( \mathbb {R}^{d}\right) \times \mathcal {P}\left( \mathbb {R}^{d}\right) $, concentrated on $\Lambda $, corresponds uniquely to a probability measure on $\mathcal {P}\left( \mathbb {R}^{d}\right) $ concentrated on $\Lambda _{2}$, by this bijection. It follows that Q is uniquely determined by its second marginal $\pi $, which is unique a priori. This proves that Q is independent of the subsequence $\left( n_{k}\right) _{k\in \mathbb {N}}$ and thus the full sequence $(Q_{N})_{N\in \mathbb {N}}$ converges, to a single Q.

We can now prove that $\mu $ has a density, $\widetilde{\mathbb {P}}$-a.s. We have proved that the law of $\widetilde{\mu }$ is concentrated on $\Lambda _{2} $; but, being Q the law of $\left( \widetilde{p},\widetilde{\mu }\right) $ and having Q second marginal $\pi $, the law of $\widetilde{\mu } $ is $\pi $. Hence $\pi $, which is also the law of $\mu $, is concentrated on $\Lambda _{2}$. Namely, $\mathbb {P}$-a.e. realization of $\mu $ has a density p, of class $\text {C}_{\ell oc}^{\gamma -}(\mathbb {R}^{d})$. The random element $\left( p,\mu \right) $ is the image of $\mu $ under the bijection above, hence it has law Q. It follows, from the weak convergence of $(Q_{N})_{N\in \mathbb {N}}$ to Q, that $p_{N}$ converges to p in law.

It remains to prove (D.2) and (D.3). Let us prove (D.2). The sequence of r.v.’s $\left\{ \sup _{\left| x\right| \le n}\left| p\left( x\right) \right| ^{q^{\prime }}\right\} _{n\in \mathbb {N}}$ is non decreasing and non-negative, and converges a.s. to $\sup _{x\in \mathbb {R}^{d}}\left| p\left( x\right) \right| ^{q^{\prime }}$, hence by Beppo-Levi theorem

$$\begin{aligned} \mathbb {E}\left[ \sup _{x\in \mathbb {R}^{d}}\left| p\left( x\right) \right| ^{q^{\prime }}\right] =\lim _{n\rightarrow \infty }\mathbb {E}\left[ \sup _{\left| x\right| \le n}\left| p\left( x\right) \right| ^{q^{\prime }}\right] . \end{aligned}$$

Therefore (using also the fact that $\widetilde{p}$ and p have the same law, the first marginal of Q above) it is sufficient to find a constant $C>0$, independent of R, such that

$$\begin{aligned} \mathbb {E}\left[ \sup _{\left| x\right| \le R}\left| \widetilde{p}\left( x\right) \right| ^{q^{\prime }}\right] \le C \end{aligned}$$

for every $R>0$. But we know that $\sup _{\left| x\right| \le R}\left| \widetilde{p}^{N_{K}}\left( x\right) \right| ^{q^{\prime }}$ converges a.s. to $\sup _{\left| x\right| \le R}\left| \widetilde{p}\left( x\right) \right| ^{q^{\prime }}$. Moreover, we know that there exists $\gamma >1$ such that

$$\begin{aligned} \mathbb {E}\left[ \left( \sup _{\left| x\right| \le R}\left| \widetilde{p}^{N}\left( x\right) \right| ^{q^{\prime }}\right) ^{\gamma }\right] \le K \end{aligned}$$

(take $\gamma =q/q^{\prime }$ and use assumption (D.1)). Hence, by Vitali convergence theorem, we get

$$\begin{aligned} \mathbb {E}\left[ \sup _{\left| x\right| \le R}\left| \widetilde{p}\left( x\right) \right| ^{q^{\prime }}\right] =\lim _{N\rightarrow \infty }\mathbb {E}\left[ \sup _{\left| x\right| \le R}\left| \widetilde{p}^{N}\left( x\right) \right| ^{q^{\prime } }\right] \le K^{1/\gamma }. \end{aligned}$$

Finally, (D.3) is proved similarly, under the additional assumption that p is deterministic. In this case $p^{N}$ converges to p in probability, not only in law, in $\text {C}_{\ell oc}^{\gamma -}(\mathbb {R}^{d})$. In particular, $\sup _{\left| x\right| \le R}\left| p^{N}\left( x\right) -p\left( x\right) \right| ^{q^{\prime }}$ converges to zero in probability. Since $\sup _{\left| x\right| \le R}\left| p^{N}\left( x\right) -p\left( x\right) \right| ^{q^{\prime }}$ is uniformly integrable, by Vitali theorem it converges to zero in average. $\square $

Appendix E: Relaxed Controls

In the proof of Theorem 6.1 we use the concept of relaxed controls. In this section we briefly recall the definition of relaxed controls are; for more details, see, for instance, El Karoui et al. [10] and Kushner [14]. Let $\mathcal {S}$ be a Polish space and let $\mathcal {R}_{\mathcal {S}}$ be the space of all deterministic $\mathcal {S}$-valued relaxed controls over the time interval [0, T], that is,

$$\begin{aligned}\mathcal {R}_{\mathcal {S}} \doteq \{r\,:\,r\,\text {positive measure on }\mathcal {B}(\mathcal {S} \times [0,T])\,:\,r(\mathcal {S} \times [0,t]) = t, t \in [0,T]\}. \end{aligned}$$

If $r \in \mathcal {R}_{\mathcal {S}}$, then the time derivative of r exists almost everywhere as a measurable mapping $ \overset{\cdot }{r}_t : [0, T] \rightarrow \mathcal {P}(\mathcal {S})$ such that $r(dy, dt) = \overset{\cdot }{r}_t(dy)\,dt$. The topology of weak convergence of measure turns $\mathcal {R}_{\mathcal {S}}$ into a Polish space. In addition, the space $\mathcal {R}_{\mathcal {S}}$ is compact if $\mathcal {S}$ is compact. Finally, any $\mathcal {S}$-valued $(\mathcal {F}_t)$-adapted process $\alpha $ defined on some filtered probability space $(\Omega , \mathcal {F}, \mathbb {P})$ induces a $\mathcal {R}_{\mathcal {S}}$-valued random variable $\rho $, the corresponding stochastic relaxed control, according to:

$$\begin{aligned}\rho _{\omega }(B \times I) \doteq \int _{I} \delta _{\alpha (t,\omega )}(B)\,dt,\end{aligned}$$

where $B \in \mathcal {B}(\Gamma )$ with $\Gamma $ the set of control actions, or action space, $I \in \mathcal {B}([0, T])$ and $\omega \in \Omega $. The random measure $\rho $ is $(\mathcal {F}_t)$-adapted in the sense that its restriction to $\mathcal {S} \times [0,t]$ is $\mathcal {F}_t$-measurable for every $t \in [0,T]$.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Flandoli, F., Ghio, M. & Livieri, G. N-Player Games and Mean Field Games of Moderate Interactions. Appl Math Optim 85, 38 (2022). https://doi.org/10.1007/s00245-022-09834-7

Download citation

Accepted: 25 November 2021
Published: 10 May 2022
DOI: https://doi.org/10.1007/s00245-022-09834-7

Keywords

Mathematics Subject Classification

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

N-Player Games and Mean Field Games of Moderate Interactions

Abstract

Similar content being viewed by others

Mean Field Games

Mean Field Games

Mean field games based on stable-like processes

1 Introduction

2 Preliminaries and Assumptions

3 N-Player Games

Definition 3.1

4 Mean Field Games

Definition 4.1

Lemma 4.2

Proof

Definition 4.3

Theorem 4.4

Proof

Theorem 4.5

Proof

4.1 Feedback MFG with Given Density

Definition 4.6

4.2 Open-Loop MFG with Given Density

Definition 4.7

Theorem 4.8

Proof

5 Moderately Interacting Particles

Theorem 5.1

5.1 Tightness of the Empirical Measure

Lemma 5.2

Proof

5.2 Estimates on Mollified Empirical Measures

Lemma 5.3

Proof

Lemma 5.4

Proof

Remark 5.5

Lemma 5.6

Proof

5.3 Identification of the Limit

Lemma 5.7

Proof

Proposition 5.8

Proof

6 Approximate Nash Equilibria from the Mean Field Game

Theorem 6.1

Proof

Notes

References

Funding

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Appendices

Appendix A: Some Well Known Results

Proposition A.1

Definition A.2

Proposition A.3

Lemma A.4

Proof

Lemma A.5

Proof

Appendix B: Hamilton–Jacobi Equation, Kolmogorov Equation Equations and Mild Solutions

1.1 B.1: The Hamilton–Jacobi and the Kolmogorov Equation Equation In Mild Form

Theorem B.1

Proof

Theorem B.2

Proof

1.2 B.2: Proof of Lemma 4.2

Proof

1.3 B.3: Proof of Theorem 4.4

Theorem B.3

Theorem B.4

Proof of Theorem B.3

Proposition B.5

Proof

1.4 B.4: Proof of Theorem 4.5

Proof

1.5 B.5: Proof of Theorem 4.8-(i)