Continuous-Time Mean Field Games with Finite State Space and Common Noise

Belak, Christoph; Hoffmann, Daniel; Seifried, Frank T.

doi:10.1007/s00245-020-09743-7

Continuous-Time Mean Field Games with Finite State Space and Common Noise

Open access
Published: 07 February 2021

Volume 84, pages 3173–3216, (2021)
Cite this article

Download PDF

You have full access to this open access article

Applied Mathematics & Optimization Aims and scope Submit manuscript

Continuous-Time Mean Field Games with Finite State Space and Common Noise

Download PDF

Christoph Belak¹,
Daniel Hoffmann² &
Frank T. Seifried²

2115 Accesses
9 Citations
Explore all metrics

Abstract

We formulate and analyze a mathematical framework for continuous-time mean field games with finitely many states and common noise, including a rigorous probabilistic construction of the state process and existence and uniqueness results for the resulting equilibrium system. The key insight is that we can circumvent the master equation and reduce the mean field equilibrium to a system of forward-backward systems of (random) ordinary differential equations by conditioning on common noise events. In the absence of common noise, our setup reduces to that of Gomes, Mohr and Souza (Appl Math Optim 68(1): 99–143, 2013) and Cecchin and Fischer (Appl Math Optim 81(2):253–300, 2020).

Master Equation for Finite State Mean Field Games with Additive Common Noise

Uniqueness for Linear-Quadratic Mean Field Games with Common Noise

Article 20 July 2016

Mean field games with monotonous interactions through the law of states and controls of the agents

Article 22 June 2022

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

Since the seminal contributions of Lasry and Lions [44] and Huang, Malhamé and Caines [39], mean field games have become an active field of mathematical research with a wide range of applications, including economics [13, 16, 27, 33, 41, 50], sociology [35], finance [17, 45], epidemiology [23, 26, 46] and computer science [40]; see also the overview article [29] and the monograph [9].

Mean field games constitute a class of dynamic, multi-player stochastic differential games with identical agents. The key characteristic of the mean field approach is that (i) the payoff and state dynamics of each agent depend on other agents’ decisions only through an aggregate statistic (typically, the aggregate distribution of states); and (ii) no individual agent’s actions can change the aggregate outcome. Thus, in solving an individual agent’s optimization problem, the feedback effect of his own actions on the aggregate outcome can be discarded, breaking the notorious vicious circle (“the optimal strategy depends on the aggregate outcome, which depends on the strategy, which depends ...”). This significantly facilitates the identification of rational expectations equlibria. A standard assumption that further simplifies the analysis is that randomness is idiosyncratic (equivalently, there is no common noise), i.e. that the random variables appearing in one agent’s optimization are independent of those in any other’s. As a result, all randomness is “averaged out” in the aggregation of individual decisions, and the equilibrium dynamics of the aggregate distribution are deterministic.

In the literature, mean field games are most often studied in settings with a continuous state space and deterministic or diffusive dynamics, i.e. stochastic differential equations (SDEs) driven by Brownian motion. The corresponding dynamic programming equations thus become parabolic partial differential equations, and the aggregate dynamics are represented by a flow of Borel probability measures; see, e.g., the monographs [4] and [9] and the references therein. Formally, the mean field game is typically formulated in terms of a controlled McKean-Vlasov SDE, where the coefficients depend on the current state and control and the distribution of the solution; intuitively, these McKean-Vlasov dynamics codify the dynamics that pertain to a representative agent. The mathematical link to N-player games is subsequently made through suitable propagation of chaos results in the mean field limit $N\rightarrow \infty $; see, e.g., [14, 25, 28, 42, 43]. In this context, the analysis of McKean-Vlasov SDEs has also seen significant progress recently; see, e.g., [6, 8, 19, 48]. In the presence of common noise, i.e. sources of risk that affect all agents and do not average out in the mean field limit, the mathematical analysis becomes even more involved as the dynamics of the aggregate distribution become stochastic, leading to conditional McKean-Vlasov dynamics; see, e.g., [1, 12, 21, 51]. We refer to [10] for background and further references on continuous-state mean field games with common noise.

There is also a strand of literature on mean field games with finite state spaces, including [2, 15, 18, 24, 30, 31, 34, 49] as well as [9, §7.2]. In a recent article, [22] provide an extension of [31] to mean field interactions that occur not only through the agents’ states, but also through their controls. To the best of our knowledge, however, to date there has been no extension of these results to settings that include common noise. In the context of finite-state mean field games, we are only aware of two contributions that include common stochasticity (both via the master equation and with a different focus/setting than this paper): [5] analyze the master equation for finite-state mean field games with common noise, and [3] include a common continuous-time Gaussian noise in the aggregate distribution dynamics.

In this article, we set up a mathematical framework for finite-state mean field games with common noise.^{Footnote 1} Our setup extends that of [31] and [15] by common noise events at fixed points in time. We provide a rigorous formulation of the underlying stochastic dynamics, and we establish a verification theorem for the optimal strategy and an aggregation theorem to determine the resulting aggregate distribution. This leads to a characterization of the mean field equilibrium in terms of a system of (random) forward-backward differential equations. The key insight is that, after conditioning on common noise configurations, we obtain classical piecewise dynamics subject to jump conditions at common noise times.

The remainder of this article is organized as follows: In Sect. 2 we set up the mathematical model, provide a probabilistic construction of the state dynamics, and formulate the agent’s optimization problem. In Sect. 3 we state the dynamic programming equation and establish a verification theorem for the agent’s optimization, given an ex ante aggregate distribution (Theorem 6). Section 4 provides the dynamics of the ex post distribution (Theorem 9) and, on that basis, a system of random forward-backward ODEs for the mean field equilibrium (Definition 10) as well as corresponding existence and uniqueness results (Theorems 13 and 16 ). In Sect. 5 we showcase our results in two benchmark applications: agricultural production and infection control. The Appendix provides the proofs of Theorems 13 and 16.

2 Mean Field Model

We first provide an informal description of the individual agents’ state dynamics, optimization problem, and the resulting mean field equilibrium. The agent’s state process $X=\{X_t\}$ takes values in the finite set ${\mathbb {S}}$. Between common noise events, transitions from state i to state j occur with intensity $Q^{ij}(t,W_t,M_t,\nu _t)$, where $W_t$ represents the common noise events that have occurred up to time t; $M_t$ the time-t aggregate distribution of agents; and $\nu _t$ the agent’s control. In addition, upon the realization of a common noise event $W_k$ at time $T_k$, the state jumps from $X_{T_k-}$ to $X_{T_k}=J^{X_{T_k-}}(T_k,W_{T_k},M_{T_k-})$. With this, the agent aims to maximize

$$\begin{aligned} {\mathbb {E}}^\nu \Bigl [ \int _0^T \psi ^{X_t}(t,W_t,M_t,\nu _t)\mathrm {d}t + \Psi ^{X_T}(W_T,M_T) \Bigr ] \end{aligned}$$

where $\psi $ and $\Psi $ are suitable reward functions and the aggregate distribution process $M=\{M_t\}$ is given by

$$\begin{aligned} M_t\triangleq \mu (t,W_t)\quad \text {for }t\in [0,T]. \end{aligned}$$

Here $\mu $ represents the aggregate distribution of states as a function of the common noise factors. We obtain a rational expectations equilibrium by determining $\mu $ such that the representative agent’s ex ante expectations equal the ex post aggregate distribution resulting from all agents’ optimal decisions, i.e.

$$\begin{aligned} {\mathbb {P}}^{{{\widehat{\nu }}}}(X_t\in \,\cdot \,\,|\,W_t) = {{\widehat{\mu }}}(t,W_t)\quad \text {for all }t\in [0,T], \end{aligned}$$

where ${{\widehat{\nu }}}$ and ${{\widehat{\mu }}}$ denote the equilibrium strategy and the equilibrium aggregate distribution. In the remainder of this section, we provide a rigorous mathematical formulation of this model.

2.1 Probabilistic Setting and Common Noise

Throughout, we fix a time horizon $T>0$ and a finite set ${\mathbb {W}}$ and work on a probability space $(\Omega ,{\mathfrak {A}},{\mathbb {P}})$ that carries a finite sequence $W_1,\ldots ,W_n$ of i.i.d. random variables that are uniformly distributed^{Footnote 2} on ${\mathbb {W}}$. We refer to $W_1,\dots ,W_n$ as common noise factors and to ${\mathbb {P}}$ as the reference probability. The common noise factor $W_k$ is revealed at time $T_k$, where

$$\begin{aligned} 0\triangleq T_0< T_1< T_2< \cdots< T_n < T_{n+1}\triangleq T. \end{aligned}$$

Both n and the common noise times $T_0,T_1,\ldots ,T_{n+1}$ are fixed and deterministic. The piecewise constant filtration ${\mathfrak {G}}=\{{\mathfrak {G}}_t\}$ generated by common noise events is given by

$$\begin{aligned} {\mathfrak {G}}_t\triangleq \sigma \bigl ( W_k\, :\, k\in [1:n],\ T_k\le t \bigr )\vee {\mathfrak {N}}\quad \text {for }t\in [0,T] \end{aligned}$$

where ${\mathfrak {N}}$ denotes the set of ${\mathbb {P}}$-null sets. For each configuration of common noise factors $w\in {\mathbb {W}}^n$ we write

$$\begin{aligned} w_t\triangleq (w_1,\ldots ,w_k)\ \text {for }t\in [T_k,T_{k+1}\rangle ,\ k\in [0:n], \end{aligned}$$

where for $0\le s\le t\le T$ we set $[s,t\rangle \triangleq [s,t)$ if $t<T$ and $[s,T\rangle \triangleq [s,T]$. With this convention, $W=\{W_t\}$ represents a piecewise constant, ${\mathfrak {G}}$-adapted process.

Definition 1

A function $f:\ [0,T]\times {\mathbb {W}}^n\rightarrow {\mathbb {R}}^m$ is non-anticipative if for all $t\in [0,T]$

$$\begin{aligned} f(t,w) = f(t,{{\bar{w}}})\quad \text {whenever }w,{{\bar{w}}}\in {\mathbb {W}}^n\ \text {are such that }w_t={{\bar{w}}}_t. \end{aligned}$$

Moreover, f is regular if $f(\,\cdot \,,w)$ is absolutely continuous on $[T_k,T_{k+1}\rangle $ for all $k\in [0:n]$. $\square $

With a slight abuse of notation, if $f:\ [0,T]\times {\mathbb {W}}^n\rightarrow {\mathbb {R}}^m$ is non-anticipative, we write

$$\begin{aligned} f(t,w_t) \triangleq f(t,w)\quad \text {for }w\in {\mathbb {W}}^n,\ t\in [0,T]. \end{aligned}$$

Note that for f regular, the one-sided limits $f(T_k-,w)\triangleq \lim _{t\uparrow T_k}f(t,w)$ exist for all $k\in [1:n]$, $w\in {\mathbb {W}}^n$.

2.2 Optimization Problem

The agent’s state and action spaces are given by

$$\begin{aligned} {\mathbb {S}}\triangleq [1:d]\qquad \text {and}\qquad {\mathbb {U}}\subseteq {\mathbb {R}}^k,\qquad \text {where }d,k\in {\mathbb {N}}\text { and }{\mathbb {U}}\ne \varnothing , \end{aligned}$$

and we identify the space of aggregate distributions on ${\mathbb {S}}$ with the space of probability vectors

$$\begin{aligned} {\mathbb {M}}\triangleq \Bigl \{ m\in [0,\infty )^{1\times d}\, :\, \sum _{i=1}^d m^i = 1 \Bigr \}. \end{aligned}$$

The coefficients in the state dynamics and payoff functional are bounded and Borel measurable functions

$$\begin{aligned} Q:\ [0,T]\times {\mathbb {W}}^n\times {\mathbb {M}}\times {\mathbb {U}}&\rightarrow {\mathbb {R}}^{d\times d}&J:\ [0,T]\times {\mathbb {W}}^n\times {\mathbb {M}}&\rightarrow {\mathbb {S}}^d\\ \psi :\ [0,T]\times {\mathbb {W}}^n\times {\mathbb {M}}\times {\mathbb {U}}&\rightarrow {\mathbb {R}}^d&\Psi :\ {\mathbb {W}}^n\times {\mathbb {M}}&\rightarrow {\mathbb {R}}^d \end{aligned}$$

such that $Q(\,\cdot \,,\,\cdot \,,m,u)$, $\psi (\,\cdot \,,\,\cdot \,,m,u)$ and $J(\,\cdot \,,\,\cdot \,,m)$ are non-anticipative for all fixed $m\in {\mathbb {M}}$ and $u\in {\mathbb {U}}$; Q satisfies the intensity matrix conditions $Q^{ij}(t,w,m,u)\ge 0$, $i,j\in {\mathbb {S}}$, $i\ne j$ and $\sum _{j\in {\mathbb {S}}}Q^{ij}(t,w,m,u)=0$, $i\in {\mathbb {S}}$, for $(t,w,m,u)\in [0,T]\times {\mathbb {W}}^n\times {\mathbb {M}}\times {\mathbb {U}}$; and for each $k\in [1:n]$ the function

$$\begin{aligned} \kappa _k:\ {\mathbb {W}}^k\times {\mathbb {M}}\rightarrow [0,1],\qquad (w_k,w_1,\ldots ,w_{k-1},m)\mapsto \ \kappa _k(w_k|w_1,\ldots ,w_{k-1},m), \end{aligned}$$

is Borel measurable with $\sum _{{{\bar{w}}}_k\in {\mathbb {W}}}\kappa _k({{\bar{w}}}_k|w_1,\ldots ,w_{k-1},m)=1$ for all $w_1,\ldots ,w_{k-1}\in {\mathbb {W}}$ and $m\in {\mathbb {M}}$.

We further suppose that $(\Omega ,{\mathfrak {A}},{\mathbb {P}})$ supports, for each $i,j\in {\mathbb {S}}$, $i\ne j$, a standard (i.e., unit intensity) Poisson process $N^{ij}=\{N^{ij}_t\}$ and an ${\mathbb {S}}$-valued random variable $X_0$ such that

$$\begin{aligned} X_0\qquad \text {and}\qquad N^{ij},\ i,j\in {\mathbb {S}},\ i\ne j\qquad \text {and}\qquad W_1,\ldots ,W_n\qquad \text {are independent.} \end{aligned}$$

The corresponding full filtration ${\mathfrak {F}}=\{{\mathfrak {F}}_t\}$ is given by

$$\begin{aligned} {\mathfrak {F}}_t\triangleq \sigma \bigl ( X_0,\, W_s,\, N^{ij}_s\, :\, s\in [0,t];\ i,j\in {\mathbb {S}},\ i\ne j \bigr )\vee {\mathfrak {N}}\quad \text {for }t\in [0,T]. \end{aligned}$$

Note that ${\mathfrak {G}}_t\subseteq {\mathfrak {F}}_t$ for all $t\in [0,T]$, that both ${\mathfrak {G}}$ and ${\mathfrak {F}}$ satisfy the usual conditions, and that $N^{ij}$ is a standard $({\mathfrak {F}},{\mathbb {P}})$-Poisson process for $i,j\in {\mathbb {S}}$, $i\ne j$. Given a regular, non-anticipative function $\mu $, the ${\mathfrak {G}}$-adapted, ${\mathbb {M}}$-valued ex ante aggregate distribution $M=\{M_t\}$ is given by

$$\begin{aligned} M_t\triangleq \mu (t,W_t)\quad \text {for }t\in [0,T] \end{aligned}$$

and the agent’s optimization problem reads^{Footnote 3}

where the class of admissible strategies for (P${}_\mu $) is given by the set of closed-loop controls

$$\begin{aligned} {\mathcal {A}}\triangleq \bigl \{ \nu :\ [0,T]\times {\mathbb {S}}^{[0,T]}\times {\mathbb {W}}^n\rightarrow {\mathbb {U}}\&:\, \nu \ \text {is Borel measurable and }\\&\quad \nu (\,\cdot \,,x,\,\cdot \,)\ \text {is non-anticipative for all }x\in {\mathbb {S}}^{[0,T]} \bigr \}. \end{aligned}$$

Note that ${\mathcal {A}}$ subsumes the class of Markovian feedback controls considered in, e.g., [31] or [34], and that each $\nu \in {\mathcal {A}}$ canonically induces an ${\mathfrak {F}}$-adapted ${\mathbb {U}}$-valued process via

$$\begin{aligned} \nu _t\triangleq \nu \bigl (t,X_{(\,\cdot \,\wedge t)-},W_t\bigr )\quad \text {for }t\in [0,T]. \end{aligned}$$

${\mathbb {E}}^\nu [\,\cdot \,]$ denotes the expectation operator with respect to the probability measure ${\mathbb {P}}^\nu $ given by

$$\begin{aligned} \frac{\mathrm {d}{\mathbb {P}}^\nu }{\mathrm {d}{\mathbb {P}}}&= \prod \limits _{\begin{array}{c} i,j\in {\mathbb {S}},\\ i\ne j \end{array}}\left( \exp \left\{ \int _0^T \bigl (1-Q^{ij}(t,W_t,M_t,\nu _t)\bigr )\mathrm {d}t\right\} \cdot \, \prod \limits _{\begin{array}{c} t\in (0,T],\\ \Delta N^{ij}_t\ne 0 \end{array}} \, Q^{ij}(t,W_t,M_t,\nu _t)\right) \nonumber \\&\quad \times \ |{\mathbb {W}}|^n\cdot \prod _{k=1}^n\kappa _k\bigl (W_k|W_1,\ldots ,W_{k-1},M_{T_k-}\bigr ); \end{aligned}$$

(1)

and the agent’s state process X is given by

$$\begin{aligned} \mathrm {d}X_t = \sum \limits _{\begin{array}{c} i,j\in {\mathbb {S}},\\ i\ne j \end{array}} \mathbb {1}_{\{X_{t-}=i\}} (j-i) \mathrm {d}N^{ij}_t\quad \text {for }t\in [T_k,T_{k+1}\rangle ,\ k\in [0:n], \end{aligned}$$

(2)

subject to the jump conditions

$$\begin{aligned} X_{T_k}=J^{X_{T_k-}}\bigl (T_k,W_{T_k},M_{T_k-}\bigr )\ \text {for\ } k\in [1:n]. \end{aligned}$$

(3)

Here $N^{ij}$ triggers transitions from state i to state j, and ${\mathbb {P}}^\nu $ is defined in such a way that $N^{ij}$ has ${\mathbb {P}}^\nu $-intensity $Q^{ij}(t,W_t,M_t,\nu _t)$; see Lemma 2 below.^{Footnote 4} In summary, in order to formulate a mean field model within the above setting, it suffices to specify

the agent’s state space ${\mathbb {S}}$, action space ${\mathbb {U}}$ and the common noise space ${\mathbb {W}}$,
the transition intensities Q(t, w, m, u), transition kernels $\kappa _k(w_k|w_1,\ldots ,w_{k-1},m)$ and common noise jumps J(t, w, m), and finally
the reward functions $\psi (t,w,m,u)$ and $\Psi (w,m)$.

2.3 State Dynamics

In what follows, we show that the preceding construction implies the dynamics described informally above.

Lemma 2

$({\mathbb {P}}^\nu $-dynamics) For each admissible strategy $\nu \in {\mathcal {A}}$, ${\mathbb {P}}^\nu $ is a well-defined probability measure on $(\Omega ,{\mathfrak {A}})$, absolutely continuous with respect to ${\mathbb {P}}$, and satisfies

$$\begin{aligned} {\mathbb {P}}^\nu = {\mathbb {P}}\quad \text {on }\sigma (X_0). \end{aligned}$$

Moreover, $N^{ij}$ is a counting process with $({\mathfrak {F}},{\mathbb {P}}^\nu )$-intensity $\lambda ^{ij}=\{\lambda ^{ij}_t\}$, where

$$\begin{aligned} \lambda ^{ij}_t \triangleq Q^{ij}\left( t,W_t,M_t,\nu _t\right) \quad \text {for }t\in [0,T]\ \text {and }i,j\in {\mathbb {S}},\ i\ne j. \end{aligned}$$

Finally, for all $k\in [1:n]$ we have

$$\begin{aligned} {\mathbb {P}}^\nu \bigl (W_k=w_k|{\mathfrak {G}}_{T_k-}\bigr ) = \kappa _k(w_k|W_1,\ldots ,W_{k-1},M_{T_k-})\quad \text {for all }w_1,\ldots ,w_k\in {\mathbb {W}}\end{aligned}$$

where ${\mathfrak {G}}$ denotes the common noise filtration and, in particular,

$$\begin{aligned} {\mathbb {P}}^{\nu _1} = {\mathbb {P}}^{\nu _2}\quad \text {on }{\mathfrak {G}}_T\quad \text {for all admissible strategies }\nu _1,\nu _2\in {\mathcal {A}}. \end{aligned}$$

Proof

We fix $\nu \in {\mathcal {A}}$ and split the proof into four steps.

Step 1: ${\mathbb {P}}^\nu $ is well-defined by (1). Since $N^{ij}$ is a standard Poisson process under ${\mathbb {P}}$, the compensated process ${\bar{N}}^{ij}_t\triangleq N^{ij}_t-t$, $t\ge 0$, is an $({\mathfrak {F}},{\mathbb {P}})$-martingale for all $i,j\in {\mathbb {S}}$, $i\ne j$. We define $\theta ^\nu =\{\theta ^\nu _t\}$ via^{Footnote 5}

$$\begin{aligned} \theta ^\nu _t \triangleq \sum \limits _{\begin{array}{c} i,j\in {\mathbb {S}},\\ i\ne j \end{array}}\int _0^t\bigl (Q^{ij}\bigl (s,W_s,\mu (s,W_s),\nu _s\bigr )-1\bigr )\mathrm {d}{\bar{N}}^{ij}_s,\quad t\in [0,T], \end{aligned}$$

and observe that the Doléans-Dade exponential ${\mathcal {E}}[\theta ^\nu ]$ is a local $({\mathfrak {F}},{\mathbb {P}})$-martingale with

$$\begin{aligned} {\mathcal {E}}[\theta ^\nu ]_t = \prod \limits _{\begin{array}{c} i,j\in {\mathbb {S}},\\ i\ne j \end{array}}\left( \exp \left\{ \int _0^t\bigl (1-Q^{ij}\bigl (s,W_s,\mu (s,W_s),\nu _s\bigr )\bigr )\mathrm {d}s\right\} \cdot \prod \limits _{\begin{array}{c} s\in (0,t],\\ \Delta N^{ij}_s\ne 0 \end{array}}Q^{ij}(s,W_s,\mu (s,W_s),\nu _s)\right) \end{aligned}$$

(4)

for $t\in [0,T]$. Next, we define $\vartheta =\{\vartheta _t\}$ via

$$\begin{aligned} \vartheta _t\triangleq \sum _{\begin{array}{c} k\in [1:n],\\ T_k\le t \end{array}}\Bigl (|{\mathbb {W}}|\cdot \kappa _k\bigl (W_k|W_1,\ldots ,W_{k-1},\mu (T_k-,W_{T_k-})\bigr )-1\Bigr ),\quad t\in [0,T], \end{aligned}$$

and note that $\vartheta $ is an $({\mathfrak {F}},{\mathbb {P}})$-martingale. Indeed, for each $k\in [0:n]$ we have $\vartheta _t=\vartheta _{T_k}$ for $t\in [T_k,T_{k+1}\rangle $ and, using that $W_k$ is independent of ${\mathfrak {F}}_{T_k-}$ and uniformly distributed on ${\mathbb {W}}$ under ${\mathbb {P}}$, it follows that

$$\begin{aligned} {\mathbb {E}}\bigl [\vartheta _{T_k}|{\mathfrak {F}}_{T_k-}\bigr ]&= \vartheta _{T_k-}+{\mathbb {E}}\bigl [|{\mathbb {W}}|\cdot \kappa _k\bigl (W_k|W_1,\ldots ,W_{k-1},\mu (T_k-,W_{T_k-})\bigr )-1\big |{\mathfrak {F}}_{T_k-}\bigr ]\\&= \vartheta _{T_k-} - 1 + |{\mathbb {W}}|\cdot \sum _{w_k\in {\mathbb {W}}}{\mathbb {P}}\bigl (W_k=w_k|W_1,\ldots ,W_{k-1},\mu (T_k-,W_{T_k-})\bigr )\\&\qquad \qquad \qquad \qquad \qquad \qquad \qquad \qquad \times \kappa _k\bigl (w_k|W_1,\ldots ,W_{k-1},\mu (T_k-,W_{T_k-})\bigr )\\&= \vartheta _{T_k-} - 1 + |{\mathbb {W}}|\cdot \sum _{w_k\in {\mathbb {W}}}\frac{1}{|{\mathbb {W}}|}\cdot \kappa _k\bigl (w_k|W_1,\ldots ,W_{k-1},\mu (T_k-,W_{T_k-})\bigr ) = \vartheta _{T_k-}. \end{aligned}$$

Hence the Doléans-Dade exponential ${\mathcal {E}}[\vartheta ]$ is a local $({\mathfrak {F}},{\mathbb {P}})$-martingale, and we have

$$\begin{aligned} {\mathcal {E}}[\vartheta ]_t = \prod _{s\in (0,t]}(1+\Delta \vartheta _s) = \prod _{\begin{array}{c} k\in [1:n],\\ T_k\le t \end{array}} \Bigl (|{\mathbb {W}}|\cdot \kappa _k\bigl (W_k|W_1,\ldots ,W_{k-1},\mu (T_k-,W_{T_k-})\bigr )\Bigr )\nonumber \\ \end{aligned}$$

(5)

for $t\in [0,T]$. Since $\Delta N^{ij}_{T_k}=0$ for all $i,j\in {\mathbb {S}}$, $i\not =j$, and $k\in [1:n]$ a.s., we have $[\theta ^\nu ,\vartheta ]=0$, and thus the process $Z^\nu \triangleq {\mathcal {E}}[\theta ^\nu +\vartheta ]={\mathcal {E}}[\theta ^\nu ]\cdot {\mathcal {E}}[\vartheta ]$, i.e.

$$\begin{aligned} Z^\nu _t&= \prod \limits _{\begin{array}{c} i,j\in {\mathbb {S}},\\ i\ne j \end{array}}\left( \exp \left\{ \int _0^t\bigl (1-Q^{ij}\bigl (s,W_s,\mu (s,W_s),\nu _s\bigr )\bigr )\mathrm {d}s\right\} \cdot \prod \limits _{\begin{array}{c} s\in (0,t],\\ \Delta N^{ij}_s\ne 0 \end{array}}Q^{ij}(s,W_s,\mu (s,W_s),\nu _s)\right) \nonumber \\&\quad \times \prod _{\begin{array}{c} k\in [1:n],\\ T_k\le t \end{array}} \Bigl (|{\mathbb {W}}|\cdot \kappa _k\bigl (W_k|W_1,\ldots ,W_{k-1},\mu (T_k-,W_{T_k-})\bigr )\Bigr ) \end{aligned}$$

(6)

is a local $({\mathfrak {F}},{\mathbb {P}})$-martingale. Since

$$\begin{aligned} \sup _{t\in [0,T]}|{\mathcal {E}}[\theta ^\nu ]_t| \le \mathrm {e}^{d^2 T}\cdot \ell ^{Y} \end{aligned}$$

(7)

where $\ell \triangleq \max _{i,j\in {\mathbb {S}},\,i\ne j}\Vert Q^{ij}\Vert _\infty $ and $Y\triangleq \sum _{i,j\in {\mathbb {S}},\,i\ne j}N_T^{ij}\sim _{{\mathbb {P}}}\mathsf {Poisson}(d(d-1)T)$ and

$$\begin{aligned} \sup _{t\in [0,T]}|{\mathcal {E}}[\vartheta ]_t| \le |{\mathbb {W}}|^n \end{aligned}$$

(8)

it follows that $\sup _{t\in [0,T]}|Z^\nu _t|$ is ${\mathbb {P}}$-integrable, so $Z^\nu $ is in fact an $({\mathfrak {F}},{\mathbb {P}})$-martingale. Since $Z^\nu $ is non-negative with $Z^\nu _0=1$ by construction, we conclude that ${\mathbb {P}}^\nu $ is a well-defined probability measure on ${\mathfrak {A}}$, absolutely continuous with respect to ${\mathbb {P}}$, with density process

$$\begin{aligned} \left. \frac{\mathrm {d}{\mathbb {P}}^\nu }{\mathrm {d}{\mathbb {P}}}\right| _{{\mathfrak {F}}_t} = Z^\nu _t,\quad t\in [0,T]. \end{aligned}$$

Step 2: ${\mathbb {P}}^{\nu }$-intensity of $N^{ij}$. Let $i,j\in {\mathbb {S}}$ with $i\ne j$. Since ${\mathbb {P}}^\nu \ll {\mathbb {P}}$ it is clear that $N^{ij}$ is a ${\mathbb {P}}^\nu $-counting process, so it suffices to show that the process given by

(9)

is a local $({\mathfrak {F}},{\mathbb {P}}^\nu )$-martingale. To show this, by Step 1 it suffices to demonstrate that is a local $({\mathfrak {F}},{\mathbb {P}})$-martingale. Noting that

$[N^{k\ell },N^{ij}]=\sum \limits _{s\in (0,\,\cdot \,]}\Delta N^{k\ell }_s\cdot \Delta N^{ij}_s=0$ whenever $k,\ell \in {\mathbb {S}}$ and $(k,\ell )\ne (i,j)$,
$\mathrm {d}Z^\nu _t=Z^\nu _{t-}\mathrm {d}\theta ^\nu _t + Z^\nu _{t-}\mathrm {d}\vartheta _t = \sum \limits _{\begin{array}{c} k,\ell \in {\mathbb {S}}, k\ne \ell \end{array}}Z^\nu _{t-}\left( Q^{k\ell }(t,W_t,\mu (t,W_t),\nu _t)-1\right) \mathrm {d}{\bar{N}}^{k\ell }_t + Z^\nu _{t-}\mathrm {d}\vartheta _t$,
,

and using integration by parts, the local martingale property follows since

Step 3: ${\mathbb {P}}^\nu ={\mathbb {P}}$ on $\sigma (X_0)$. For any function $g:\ {\mathbb {S}}\rightarrow {\mathbb {R}}$ we have

$$\begin{aligned} {\mathbb {E}}^\nu [g(X_0)] = {\mathbb {E}}[g(X_0)\cdot Z_T^\nu ] = {\mathbb {E}}\bigl [g(X_0)\cdot {\mathbb {E}}[Z_T^\nu |{\mathfrak {F}}_0]\bigr ] = {\mathbb {E}}[g(X_0)\cdot Z^\nu _0] = {\mathbb {E}}[g(X_0)] \end{aligned}$$

by the $({\mathfrak {F}},{\mathbb {P}})$-martingale property of $Z^\nu $.

Step 4: Distribution of $W_k$ under ${\mathbb {P}}^\nu $. Let $k\in [1:n]$ and $w_1,\ldots ,w_k\in {\mathbb {W}}$. Since ${\mathcal {E}}[\theta ^\nu ]_{T_k}={\mathcal {E}}[\theta ^\nu ]_{T_k-}$ a.s. and $W_k$ is uniformly distributed on ${\mathbb {W}}$ and independent of ${\mathfrak {F}}_{T_k-}$ under ${\mathbb {P}}$, iterated conditioning yields

$$\begin{aligned}&{\mathbb {P}}^\nu \bigl (W_1=w_1,\ldots ,W_k=w_k\bigr ) = {\mathbb {E}}\bigl [Z^\nu _{T_k} \cdot \mathbb {1}_{\{W_k=w_k\}} \cdot \mathbb {1}_{\{W_1=w_1,\ldots ,W_{k-1}=w_{k-1}\}}\bigr ]\\&= {\mathbb {E}}\bigl [ Z^\nu _{T_k-} \cdot |{\mathbb {W}}| \cdot \kappa _k(W_k|W_1,\ldots ,W_{k-1},\mu (T_k-,W_{T_k-})) \cdot \mathbb {1}_{\{W_k=w_k\}} \cdot \mathbb {1}_{\{W_1=w_1,\ldots ,W_{k-1}=w_{k-1}\}}\bigr ]\\&= |{\mathbb {W}}| \cdot \kappa _k(w_k|w_1,\ldots ,w_{k-1},\mu (T_k-,w_{T_k-})) \cdot {\mathbb {E}}\bigl [ Z^\nu _{T_k-} \cdot \mathbb {1}_{\{W_1=w_1,\ldots ,W_{k-1}=w_{k-1}\}} \\&\qquad \qquad \qquad \qquad \qquad \qquad \qquad \qquad \qquad \qquad \qquad \qquad \qquad \cdot {\mathbb {P}}(W_k=w_k|{\mathfrak {F}}_{T_k-})\bigr ]\\&= \kappa _k(w_k|w_1,\ldots ,w_{k-1},\mu (T_k-,w_{T_k-})) \cdot {\mathbb {P}}^\nu \bigl (W_1=w_1,\ldots ,W_{k-1}=w_{k-1}\bigr ). \end{aligned}$$

Thus we have ${\mathbb {P}}^\nu (W_k=w_k|{\mathfrak {G}}_{T_k-}) = \kappa _k(w_k|W_1,\ldots ,W_{k-1},M_{T_k-})$ and the proof is complete. $\square $

Lemma 2 implies in particular that ${\mathbb {P}}^\nu (\Delta N^{ij}_t\ne 0)=0$ for every $t\in [0,T]$, so as a consequence we have

$$\begin{aligned} \Delta X_t=0\quad {\mathbb {P}}^\nu \text {-a.s.\ for all }t\in [0,T]\setminus \{T_1,\ldots ,T_n\}. \end{aligned}$$

Moreover, since ${\mathbb {P}}^{\nu _1}={\mathbb {P}}^{\nu _2}$ on ${\mathfrak {G}}_T$ for all admissible controls $\nu _1,\nu _2\in {\mathcal {A}}$ and $M_t=\mu (t,W_t)$ for $t\in [0,T]$, the agent’s ex ante beliefs concerning the common noise factors are the same, irrespective of his control.

3 Solution of the Optimization Problem

In the following, we solve the agent’s maximization problem (P${}_\mu $) using the associated dynamic programming equation (DPE). This is the same methodology as in [31] and [15]; see [22] for an alternative approach (to extended mean field games, but without common noise) based on backward SDEs.

The DPE for the value function of the agent’s optimization problem (P${}_\mu $) reads

$$\begin{aligned} 0 = \sup \limits _{u\in {\mathbb {U}}}\, \biggl \{ \frac{\partial v^i}{\partial t}(t,w) + \psi ^i\bigl (t,w,\mu (t,w),u\bigr ) + Q^{i\cdot }\bigl (t,w,\mu (t,w),u\bigr )\cdot v(t,w) \biggl \} \end{aligned}$$

for $i\in {\mathbb {S}}$, subject to suitable consistency conditions for $t=T_k$, $k\in [1:n]$, and the terminal condition

$$\begin{aligned} v(T,w) = \Psi \bigl (w,\mu (T,w)\bigr )\quad \text {for all }w\in {\mathbb {W}}^n. \end{aligned}$$

Assumption 3

There exists a Borel measurable function $h:\ [0,T]\times {\mathbb {W}}^n\times {\mathbb {M}}\times {\mathbb {R}}^d\rightarrow {\mathbb {U}}^d$ such that for every $i\in {\mathbb {S}}$ and all $(t,w,m,v)\in [0,T]\times {\mathbb {W}}^n\times {\mathbb {M}}\times {\mathbb {R}}^d$ we have

$$\begin{aligned} h^i(t,w,m,v) \in \mathop {{{\,\mathrm{arg\,max}\,}}}\limits _{u\in {\mathbb {U}}} \bigl \{ \psi ^i(t,w,m,u) + Q^{i\cdot }(t,w,m,u)\cdot v \bigr \}. \end{aligned}$$

Assumption 3 is satisfied e.g. if ${\mathbb {U}}$ is compact and Q and $\psi $ are continuous with respect to $u\in {\mathbb {U}}$. Note that, since $\psi ^i(\,\cdot \,,\,\cdot \,,m,u)$ and $Q^{i\cdot }(\,\cdot \,,\,\cdot \,,m,u)$ are non-anticipative for $m\in {\mathbb {M}}$, $u\in {\mathbb {U}}$, we can assume without loss of generality that $h(\,\cdot \,,\,\cdot \,,m,v)$ is non-anticipative for $m\in {\mathbb {M}}$, $v\in {\mathbb {R}}^d$. With this, we define

$$\begin{aligned}&{\widehat{Q}}:\ [0,T]\times {\mathbb {W}}^n\times {\mathbb {M}}\times {\mathbb {R}}^d\rightarrow {\mathbb {R}}^{d\times d}, \quad&{\widehat{Q}}^{ij}(t,w,m,v)\triangleq Q^{ij}\bigl ( t, w, m, h^i(t,w,m,v) \bigr ),\\&{\widehat{\psi }}:\ [0,T]\times {\mathbb {W}}^n\times {\mathbb {M}}\times {\mathbb {R}}^d\rightarrow {\mathbb {R}}^d, \quad&{\widehat{\psi }}^i(t,w,m,v)\triangleq \psi ^i\bigl ( t, w, m, h^i(t,w,m,v) \bigr ) \end{aligned}$$

and thus obtain the following reduced-form DPE, which we use in the following:

Definition 4

Let $\mu :\ [0,T]\times {\mathbb {W}}^n\rightarrow {\mathbb {M}}$ be regular and non-anticipative. A function $v:[0,T]\times {\mathbb {W}}^n\rightarrow {\mathbb {R}}^d$ is called a solution of (DP${}_\mu $) subject to (CC${}_\mu $), (TC${}_\mu $) if v is non-anticipative and satisfies the ordinary differential equation (ODE)^{Footnote 6}

for $t\in [T_k,T_{k+1}\rangle $, $k\in [0:n]$, subject to the consistency and terminal conditions

for $k\in [1:n]$ and all $w\in {\mathbb {W}}^n$. Here, for $k\in [1:n]$, the jump operator $\Psi _k$ is defined via

$$\begin{aligned} \Psi ^i_k(w,m,{{\bar{v}}}) \triangleq \sum _{{{\bar{w}}}_k\in {\mathbb {W}}} \kappa _k\bigl ({{\bar{w}}}_k|w_1,\ldots ,w_{k-1},m\bigr ) \cdot {{\bar{v}}}^{J^i(T_k,(w_{-k},{{\bar{w}}}_k),m)}(w_{-k},{{\bar{w}}}_k),\ i\in {\mathbb {S}},\nonumber \\ \end{aligned}$$

(10)

where ${{\bar{v}}}:\, {\mathbb {W}}^n\rightarrow {\mathbb {R}}^d$ and $(w_{-k},{{\bar{w}}}_k)\triangleq (w_1,\ldots ,w_{k-1},{{\bar{w}}}_k,w_{k+1},\ldots ,w_n)$ for ${{\bar{w}}}_k\in {\mathbb {W}}$, $w\in {\mathbb {W}}^n$. $\square $

Observe that (DP${}_\mu $) represents a system of (random) ODEs, coupled via $w\in {\mathbb {W}}^n$. The ODEs run backward in time on each segment $[T_{k},T_{k+1}\rangle \times {\mathbb {W}}^n$, $k\in [0:n]$, and their terminal conditions for $t\uparrow T_{k+1}$ are specified by (TC${}_\mu $) for $k=n$ and by (CC${}_\mu $) for $k<n$. Note that for $t\in [T_k,T_{k+1}\rangle $ the relevant common noise factors $W_1,\ldots ,W_k$ are known.

Remark 5

While the significance of the DPE (DP${}_\mu $) and the terminal condition (TC${}_\mu $) are clear, the consistency conditions (CC${}_\mu $) warrant a brief comment: For $i\in {\mathbb {S}}$, $k\in [1:n]$ and $w\in {\mathbb {W}}^n$ the state process jumps from state i to state $j\triangleq J^i(T_k,(w_{-k},W_k),\mu (T_k-,w_{T_k-}))$ on $\{X_{T_k-}=i\}\cap \{W_{T_k-}=w_{T_k-}\}$ when the common noise factor $W_k$ is revealed at time $T_k$.$\square $

We next link the solution of the DPE to the underlying stochastic control problem.

Theorem 6

(Verification) Suppose $\mu :\ [0,T]\times {\mathbb {W}}^n\rightarrow {\mathbb {M}}$ is regular and non-anticipative and v is a solution of (DP${}_\mu $) subject to (CC${}_\mu $) and (TC${}_\mu $). Then v is the agent’s value function for problem (P${}_\mu $), i.e.

$$\begin{aligned} \sum _{i\in {\mathbb {S}}} {\mathbb {P}}(X_0=i)v^i(0) = \sup _{\nu \in {\mathcal {A}}}{\mathbb {E}}^\nu \Bigl [ \int _0^T \psi ^{X_t}(t,W_t,M_t,\nu _t)\mathrm {d}t + \Psi ^{X_T}(W_T,M_T) \Bigr ], \end{aligned}$$

and an optimal control is given by ${\widehat{\nu }}\in {\mathcal {A}}$ with

$$\begin{aligned} {\widehat{\nu }}\left( t,X_{(\,\cdot \,\wedge t)-},W_t\right) = h^{X_{t-}}\bigl (t,W_t,\mu (t,W_t),v(t,W_t)\bigr )\quad \text {for }t\in [0,T]. \end{aligned}$$

Proof

Let $\nu \in {\mathcal {A}}$ be an admissible strategy. Until further notice we fix $k\in [0:n]$.

Step 1: Dynamics on $[T_k,T_{k+1}\rangle $. From Itô’s lemma, applicable due to regularity of v, we obtain

(11)

Step 2: Jump dynamics at $T_k$. We recall from Lemma 2 that

$$\begin{aligned} {\mathbb {P}}^\nu (W_k={{\bar{w}}}_k|X_{T_k-},W_1,\ldots ,W_{k-1})&= {\mathbb {P}}^\nu (W_k={{\bar{w}}}_k|W_1,\ldots ,W_{k-1})\\&= \kappa _k\bigl ({{\bar{w}}}_k|W_1,\ldots ,W_{k-1},\mu (T_k-,W_{T_k-})\bigr ). \end{aligned}$$

In view of the jump dynamics (3) and the consistency condition (CC${}_\mu $), we thus obtain

$$\begin{aligned}&{\mathbb {E}}^{\nu }\bigl [v^{X_{T_k}}\bigl (T_k,W_{T_k}\bigr )\big |\sigma \bigl (X_{T_k-},W_{T_k-}\bigr )\bigr ]\nonumber \\&\quad = {\mathbb {E}}^{\nu }\bigl [v^{J^{X_{T_k-}}(T_k,(W_{T_k-},W_k),\mu (T_k-,W_{T_k-}))}\bigl (T_k,(W_{T_k-},W_k)\bigr )\big |X_{T_k-},W_{T_k-}\bigr ]\nonumber \\&\quad = \sum _{{{\bar{w}}}_k\in {\mathbb {W}}} \kappa _k\bigl ({{\bar{w}}}_k|W_{T_k-},\mu (T_k-,W_{T_k-})\bigr )v^{J^{X_{T_k-}}(T_k,(W_{T_k-},{{\bar{w}}}_k),\mu (T_k-,W_{T_k-}))}\bigl (T_k,(W_{T_k-},{{\bar{w}}}_k)\bigr )\nonumber \\&\quad = \Psi ^{X_{T_k-}}_k\bigl (W_{T_k-}, \mu (T_k-,W_{T_k-}), v(T_k,\,\cdot \,)\bigr ) = v^{X_{T_k-}}(T_k-,W_{T_k-}). \end{aligned}$$

(12)

Step 3: Optimality. Combining (11) and (12) for $k=[1:n]$ and using (TC${}_\mu $) yields

(13)

where for $i,j\in {\mathbb {S}}$, $i\ne j$ the local $({\mathfrak {F}},{\mathbb {P}}^\nu )$-martingale $M^{ij}$ is given by

Since is a compensated counting process and v and Q are bounded, $M^{ij}$ is in fact an $({\mathfrak {F}},{\mathbb {P}}^\nu )$-martingale. Hence taking ${\mathbb {P}}^\nu $-expectations in (13), using the tower property of conditional expectation and the fact that ${\mathbb {P}}^{\nu }$ and ${\mathbb {P}}$ coincide on $\sigma (X_0)$ by Lemma 2, and finally that v solves the DPE, we obtain

$$\begin{aligned} \sum _{i\in {\mathbb {S}}} {\mathbb {P}}(X_0=i)v^i(0)&= {\mathbb {E}}\bigl [v^{X_0}(0)\bigr ] = {\mathbb {E}}^{\nu }\bigl [v^{X_0}(0)\bigr ]\nonumber \\&= {\mathbb {E}}^\nu \bigg [ \Psi ^{X_T}\left( W_T,\mu \left( T,W_T\right) \right) \nonumber \\&\quad -\int _0^{T}\sum _{i=1}^d\mathbb {1}_{\left\{ X_s=i\right\} }\left( {\dot{v}}^i(s,W_s)+Q^{i\cdot }\left( s,W_s,\mu (s,W_s),\nu _s\right) \cdot v(s,W_s)\right) \mathrm {d}s\bigg ]\nonumber \\&\ge {\mathbb {E}}^\nu \bigg [\Psi ^{X_T}\left( W_T,M_T\right) +\int _0^T\psi ^{X_s}\left( s,W_s,M_s,\nu _s\right) \mathrm {d}s\bigg ]. \end{aligned}$$

(14)

If we replace $\nu $ with ${\widehat{\nu }}$, the same argument applies with equality in (14); we thus conclude that v is the value function of (P${}_\mu $), and that the strategy ${\widehat{\nu }}$ is optimal. $\square $

The optimal strategy is Markovian in the agent’s state; this is unsurprising given the literature, see e.g. [31, Theorem 1] or [22, Proposition 3.9] and [15, Theorem 4]. Note, however, that the time-t optimal strategy may depend on all common noise events that have occurred up to time t, as $W_t=(W_1,\ldots ,W_k)$ for $t\in [T_k,T_{k+1}\rangle $. In the following, we denote by $\widehat{{\mathbb {P}}}$ the probability measure

$$\begin{aligned} \widehat{{\mathbb {P}}}\triangleq {\mathbb {P}}^{{\widehat{\nu }}} \end{aligned}$$

where ${\widehat{\nu }}$ is the optimal control specified in Theorem 6. It follows from Lemma 2 that $N^{ij}$ has $\widehat{{\mathbb {P}}}$-intensity ${\widehat{\lambda }}^{ij}=\{{\widehat{\lambda }}^{ij}_t\}$ for $i,j\in {\mathbb {S}}$, $i\ne j$, where

$$\begin{aligned} {\widehat{\lambda }}^{ij}_t\triangleq Q^{ij}\bigl (t,W_t,\mu (t,W_t),h^{X_{t-}}(t,W_t,\mu (t,W_t),v(t,W_t))\bigr )\quad \text {for }t\in [0,T]. \end{aligned}$$

(15)

4 Equilibrium

Having solved the agent’s optimization problem for a given ex ante function $\mu $, we now turn to the resulting mean field equilibrium. We first identify the aggregate distribution resulting from the optimal control.

Remark 7

This paper generally adopts a “representative agent” point of view; an alternative justification of mean field equilibrium is via convergence of Nash equilibria of symmetric N-player games in the limit $N\rightarrow \infty $; see, among others, [2, 14, 15, 18, 20, 22, 24, 28]. In the setting of this article (albeit under additional regularity conditions) a mean field limit justification can be provided along the lines of the proof of Theorem 7 in [31] by conditioning on common noise configurations, similarly as in the proof of Theorem 9 below.$\square $

4.1 Aggregation

Given an ex ante aggregate distribution specified in terms of a regular, non-anticipative function $\mu $ and a corresponding solution v of (DP${}_\mu $) subject to (CC${}_\mu $), (TC${}_\mu $), Theorem 6 yields an optimal strategy ${\widehat{\nu }}$ for the agent’s optimization problem (P${}_\mu $). With $\widehat{{\mathbb {P}}}$ denoting the probability measure associated with ${\widehat{\nu }}$, the resulting ex post aggregate distribution is given by the ${\mathbb {M}}$-valued, ${\mathfrak {G}}$-adapted process ${\widehat{M}}=\{{\widehat{M}}_t\}$,

$$\begin{aligned} {\widehat{M}}_t\triangleq \widehat{{\mathbb {P}}}(X_t\in \,\cdot \,\,|\,{\mathfrak {G}}_t)\quad \text {for }t\in [0,T] \end{aligned}$$

where ${\mathfrak {G}}$ denotes the common noise filtration. We note that ${\widehat{M}}$ is càdlàg since ${\mathfrak {G}}$ is piecewise constant and X is càdlàg. Equilibrium obtains if ${\widehat{M}}_t=\mu (t,W_t)$ for all $t\in [0,T]$. To proceed, we aim for a more explicit description of ${\widehat{M}}$ and, in particular, its dynamics. Thus we define for $k\in [1:n]$

$$\begin{aligned} \Phi _k:\ {\mathbb {W}}^n\times {\mathbb {M}}\times {\mathbb {M}}\rightarrow {\mathbb {M}},\quad \Phi _k(w,m,{{\bar{m}}})\triangleq m\cdot P_k(w,{{\bar{m}}}), \end{aligned}$$

(16)

where $P_k:\ {\mathbb {W}}^n\times {\mathbb {M}}\rightarrow \{0,1\}^{d\times d}$ is given by

$$\begin{aligned} P_k^{ij}(w,{{\bar{m}}}) \triangleq \mathbb {1}_{\left\{ J^i(T_k,w_1,\ldots ,w_{k},{{\bar{m}}})=j\right\} }\quad \text {for }i,j\in {\mathbb {S}}\end{aligned}$$

and we set

$$\begin{aligned} m_0\triangleq {\mathbb {P}}(X_0\in \,\cdot \,)=\widehat{{\mathbb {P}}}(X_0\in \,\cdot \,)\in {\mathbb {M}}. \end{aligned}$$

Lemma 8

Let $\mu : [0,T]\times {\mathbb {W}}^n\rightarrow {\mathbb {M}}$ and $v: [0,T]\times {\mathbb {W}}^n\rightarrow {\mathbb {R}}^d$ be regular and non-anticipative, and suppose that $Y=\{Y_t\}$ is an ${\mathbb {M}}$-valued stochastic process with dynamics

$$\begin{aligned}&Y_0=m_0,\quad Y_t = Y_{T_k} + \int _{T_k}^t Y_s \cdot {\widehat{Q}}\bigl (s,W_s,\mu (s,W_s),v(s,W_s)\bigr )\mathrm {d}s\nonumber \\&\quad \text {for } t\in [T_k,T_{k+1}\rangle ,\, k\in [0:n] \end{aligned}$$

(17)

that satisfies the consistency conditions

$$\begin{aligned} Y_{T_k}=\Phi _k\bigl ( W_{T_k}, Y_{T_k-}, \mu (T_k-,W_{T_k-}) \bigr )\quad \text {for }k\in [1:n]. \end{aligned}$$

Then Y is ${\mathfrak {G}}$-adapted.

Proof

Step 1: Existence and uniqueness of Carathéodory solutions. For each $k\in [0:n]$ and $w\in {\mathbb {W}}^n$, since $\mu $ and v are regular and Q is bounded, the function

$$\begin{aligned} f:\ [T_k,T_{k+1}]\times {\mathbb {R}}^{1\times d}\rightarrow {\mathbb {R}}^{1\times d},\quad f(t,y) \triangleq y\cdot {\widehat{Q}}\bigl (t,w,\mu (t,w),v(t,w)\bigr ) \end{aligned}$$

is measurable in the first and Lipschitz continuous in the second argument. Thus, using that $\mu $, v and ${\widehat{Q}}$ are non-anticipative, a classical result, see [36, Theorem I.5.3], implies that for each initial condition $y\in {\mathbb {R}}^{1\times d}$ there exists a unique Carathéodory solution $\varphi _k^{y,w_{T_k}}:\ [T_k,T_{k+1}\rangle \rightarrow {\mathbb {R}}^{1\times d}$ of

$$\begin{aligned} \dot{y}(t) = y(t)\cdot {\widehat{Q}}\bigl (t,w_{T_k},\mu (t,w_{T_k}),v(t,w_{T_k})\bigr )\text { for } t\in [T_k,T_{k+1}\rangle ,\qquad y(T_k) = y. \end{aligned}$$

Step 2: Y is ${\mathfrak {G}}$-adapted. First note that $Y_0=m_0$ is clearly ${\mathfrak {G}}_0$-measurable. Next, suppose that $Y_{T_k}$ is ${\mathfrak {G}}_{T_k}$-measurable, and note that for $t\in [T_k,T_{k+1}\rangle $ we have $W_t=W_{T_k}$, so

$$\begin{aligned} Y_t = Y_{T_k}+\int _{T_k}^t Y_s\cdot {\widehat{Q}}\bigl (s,W_{T_k},\mu (s,W_{T_k}),v(s,W_{T_k}) \bigr )\mathrm {d}s. \end{aligned}$$

Thus from uniqueness in part (a) it follows that we have the representation

$$\begin{aligned} Y_t = \varphi _k^{Y_{T_k}, W_{T_k}}(t)\quad \text {for } t\in [T_k,T_{k+1}\rangle . \end{aligned}$$

Hence $Y_t$ is ${\mathfrak {G}}_{T_k}$-measurable for all $t\in [T_k,T_{k+1}\rangle $. Finally, for all $k\in [0:(n-1)]$ the consistency condition implies that $ Y_{T_{k+1}} = \Phi _{k+1}( W_{T_{k+1}}, Y_{T_{k+1}-}, \mu (T_{k+1}-,W_{T_{k+1}-}))$ is ${\mathfrak {G}}_{T_{k+1}}$-measurable, so the claim follows by induction on $k\in [0:n]$. $\square $

Theorem 9

(Aggregation) Let $\mu :\ [0,T]\times {\mathbb {W}}^n\rightarrow {\mathbb {M}}$ be regular and non-anticipative with $\mu (0)=m_0$. Suppose v is a solution of (DP${}_\mu )$ subject to (CC${}_\mu )$, (TC${}_\mu )$, and the agent implements his optimal strategy ${\widehat{\nu }}$ as defined in Theorem 6. Then the aggregate distribution ${\widehat{M}}$ has the $\widehat{{\mathbb {P}}}$-dynamics

$$\begin{aligned} \mathrm {d}{\widehat{M}}_t = {\widehat{M}}_t \cdot {\widehat{Q}}\bigl ( t, W_t, \mu (t,W_t), v(t,W_t) \bigr )\mathrm {d}t\quad \text {for }t\in [T_k,T_{k+1}\rangle ,\ k\in [0:n], \end{aligned}$$

(\text {M)

and satisfies the initial condition

and the jump conditions

Proof

Let $w\in {\mathbb {W}}^n$ be a common noise configuration. Since X is defined path by path, see (2) and (3), we first note that $X=X^w$ on $\{W_T=w\}$, where $X^w$ satisfies (2) and

$$\begin{aligned} X^w_{T_k}=J^{X^w_{T_k-}}\bigl (T_k,w_{T_k},\mu (T_k-,w_{T_k-})\bigr )\ \text {for }k\in [1:n]. \end{aligned}$$

(18)

We define $\zeta (w)=\{\zeta (w)_t\}$ via

$$\begin{aligned} \zeta (w)_t&\triangleq \prod \limits _{\begin{array}{c} i,j\in {\mathbb {S}},\\ i\ne j \end{array}}\biggl (\exp \Bigl \{\int _0^t\Bigl (1-Q^{ij}\bigl (s,w_s,\mu (s,w_s),h^{X^w_{s-}}(s,w_s,\mu (s,w_s),v(s,w_s))\bigr )\Bigr )\mathrm {d}s\Bigr \}\nonumber \\&\quad \times \prod \limits _{\begin{array}{c} s\in (0,t],\\ \Delta N^{ij}_s\ne 0 \end{array}}Q^{ij}\bigl (s,w_s,\mu (s,w_s),h^{X^w_{s-}}(s,w_s,\mu (s,w_s),v(s,w_s))\bigr )\biggr ). \end{aligned}$$

Using analogous arguments as in Step 1 of the proof of Lemma 2 (see in particular (4) and (7)), it follows that there exists a probability measure $\widehat{{\mathbb {P}}}^w$ with density process

$$\begin{aligned} \frac{\mathrm {d}\widehat{{\mathbb {P}}}^w}{\mathrm {d}{\mathbb {P}}}\bigg |_{{\mathfrak {H}}_t} \triangleq \zeta (w)_t \quad \text {for } t\in [0,T], \end{aligned}$$

where the filtration ${\mathfrak {H}}=\{{\mathfrak {H}}_t\}$ is given by

$$\begin{aligned} {\mathfrak {H}}_t\triangleq \sigma \bigl (X_0,\, N^{ij}_s\, :\, s\in [0,t];\ i,j\in {\mathbb {S}},\ i\ne j \bigr )\vee {\mathfrak {N}}\quad \text {for }t\in [0,T]. \end{aligned}$$

Furthermore, in view of (4) and (15) we have

$$\begin{aligned} \zeta (w) = {\mathcal {E}}[\theta ^{{\widehat{\nu }}}] \quad \text {on } \{W_T=w\}. \end{aligned}$$

(19)

Step 1: Conditional Kolmogorov dynamics. Throughout Step 1, we fix a common noise configuration $w\in {\mathbb {W}}^n$. It follows exactly as in the proof of Lemma 2 (with $\widehat{{\mathbb {P}}}^w$ in place of $\widehat{{\mathbb {P}}}$) that

$$\begin{aligned} \widehat{{\mathbb {P}}}^w\ll {\mathbb {P}},\qquad \widehat{{\mathbb {P}}}^w={\mathbb {P}}\quad \text {on }\sigma (X_0), \end{aligned}$$

and that for $i,j\in {\mathbb {S}}$, $i\ne j$, the process $N^{ij}$ is a counting process with $({\mathfrak {H}},\widehat{{\mathbb {P}}}^w)$-intensity

$$\begin{aligned} Q^{ij}\bigl (t,w_t,\mu (t,w_t),h^{X^w_{t-}}(t,w_t,\mu (t,w_t),v(t,w_t))\bigr )\quad \text {for }t\in [0,T]. \end{aligned}$$

Boundedness of Q implies that for each $z\in {\mathbb {R}}^d$ the process $L^w[z]=\{L^w_t[z]\}$,

is an $({\mathfrak {H}},\widehat{{\mathbb {P}}}^w)$-martingale, where is given by

Using Itô’s lemma and the fact that ${\widehat{\lambda }}^{ij}_t = {\widehat{Q}}^{ij}(t,W_t,\mu (t,W_t),v(t,W_t))$ on $\{X_{t-}=i\}$, $t\in [0,T]$, by (15), we have for each $z\in {\mathbb {R}}^d$, $k\in [0:n]$ and $t\in [T_k,T_{k+1}\rangle $

$$\begin{aligned} z^{X^w_t} = z^{X^w_{T_k}}+L^w_t[z]-L^w_{T_k}[z]+\sum _{i=1}^d\int _{T_k}^t\mathbb {1}_{\{X_s^w=i\}}\cdot {\widehat{Q}}^{i\cdot }(s,w_s,\mu (s,w_s),v(s,w_s))\cdot z\ \mathrm {d}s. \end{aligned}$$

Taking expectations with respect to $\widehat{{\mathbb {P}}}^w$ and using Fubini’s theorem yields

$$\begin{aligned} \widehat{{\mathbb {E}}}^w\bigl [z^{X^w_t}\bigr ]=\widehat{{\mathbb {E}}}^w\bigl [z^{X^w_{T_k}}\bigr ]+\sum _{i=1}^d\int _{T_k}^t\widehat{{\mathbb {P}}}^w(X^w_s=i)\cdot {\widehat{Q}}^{i\cdot }(s,w_s,\mu (s,w_s),v(s,w_s))\cdot z\ \mathrm {d}s, \end{aligned}$$

so with $z=e_i$, $i\in {\mathbb {S}}$, we get

$$\begin{aligned} \widehat{{\mathbb {P}}}^w(X_t^w=i) = \widehat{{\mathbb {P}}}^w(X^w_{T_k}=i)+\sum _{j=1}^d\int _{T_k}^t\widehat{{\mathbb {P}}}^w(X^w_s=j)\cdot {\widehat{Q}}^{ji}(s,w_s,\mu (s,w_s),v(s,w_s))\mathrm {d}s.\nonumber \\ \end{aligned}$$

(20)

It follows from (20) that $\eta (w)=\{\eta (w)_t\}$,

$$\begin{aligned} \eta (w)_t \triangleq \ \widehat{{\mathbb {P}}}^w(X_t^w\in \,\cdot \,),\quad t\in [0,T] \end{aligned}$$

(21)

satisfies, for all $i\in {\mathbb {S}}$ and $k\in [0:n]$,

$$\begin{aligned} \eta (w)^i_t = \eta (w)^i_{T_k} + \int _{T_k}^t\eta (w)_s\cdot {\widehat{Q}}^{\cdot i}\left( s,w_s,\mu (s,w_s),v(s,w_s)\right) \mathrm {d}s\quad \text {for }t\in [T_k,T_{k+1}\rangle .\nonumber \\ \end{aligned}$$

(22)

Moreover, since $\widehat{{\mathbb {P}}}^w={\mathbb {P}}$ on $\sigma (X_0)$ and $X_0^w=X_0$, $\eta (w)$ satisfies the initial condition

$$\begin{aligned} \eta (w)_0 = \widehat{{\mathbb {P}}}^w(X^w_0\in \,\cdot \,)={\mathbb {P}}(X^w_0\in \,\cdot \,)={\mathbb {P}}(X_0\in \,\cdot \,)=m_0. \end{aligned}$$

(23)

Finally, consider a common noise time $t=T_k$ and note that for all $i\in {\mathbb {S}}$ the jump condition (18) implies

$$\begin{aligned} \eta (w)^i_{T_k}&= \widehat{{\mathbb {P}}}^w\bigl (X^w_{T_k}=i\bigr ) = \widehat{{\mathbb {P}}}^w\bigl (J^{X^w_{T_k-}}(T_k,w_{T_k},\mu (T_k-,w_{T_k-}))=i\bigr )\nonumber \\&= \sum _{j=1}^d\widehat{{\mathbb {P}}}^w\bigl (J^j(T_k,w_{T_k},\mu (T_k-,w_{T_k-}))=i\bigr |X^w_{T_k-}=j\bigr ) \cdot \widehat{{\mathbb {P}}}^w(X^w_{T_k-}=j)\nonumber \\&= \sum _{j=1}^d\mathbb {1}_{\left\{ J^j(T_k,w_{T_k},\mu (T_k-,w_{T_k-}))=i\right\} }\cdot \widehat{{\mathbb {P}}}^w(X^w_{T_k-}=j)\nonumber \\&= \sum _{j=1}^d P_k^{ji}(w_{T_k}, \mu (T_k-,w_{T_k-}))\cdot \eta (w)^j_{T_k-} = \Phi _k^i\bigl (w_{T_k},\eta (w)_{T_k-}, \mu (T_k-,w_{T_k-})\bigr ). \end{aligned}$$

(24)

Since $\eta (W_T)= \sum _{w\in {\mathbb {W}}^n}\mathbb {1}_{\{W_T=w\}} \cdot \eta (w)$, in view of (22), (23) and (24) it follows from Lemma 8 that the process $\eta (W_T)$ is ${\mathfrak {G}}$-adapted.

Step 2: Identification of $\eta (W_T)$. Recall that ${\mathfrak {G}}_T=\sigma (W_T)\vee {\mathfrak {N}}$ and let $w\in {\mathbb {W}}^n$. For $t\in [0,T]$ and $i\in {\mathbb {S}}$ we have by (6) and (19)

$$\begin{aligned}&\widehat{{\mathbb {E}}}\bigl [\mathbb {1}_{\{W_T=w\}} \cdot \mathbb {1}_{\{X_t=i\}}\bigr ] = {\mathbb {E}}\bigl [\mathbb {1}_{\{W_T=w\}} \cdot \mathbb {1}_{\{X^w_t=i\}} \cdot Z^{{\widehat{\nu }}}_T\bigr ] = {\mathbb {E}}\bigl [\mathbb {1}_{\{W_T=w\}} \cdot \mathbb {1}_{\{X^w_t=i\}} \cdot \zeta (w)_T \cdot {\mathcal {E}}[\vartheta ]_T\bigr ]\\&= \prod _{k=1}^n\bigl (|{\mathbb {W}}|\cdot \kappa _k(w_k|w_1,\dots ,w_{k-1},\mu (T_k-,w_{T_k-}))\bigr ) \cdot {\mathbb {E}}\bigl [\mathbb {1}_{\{W_T=w\}} \cdot \mathbb {1}_{\{X^w_t=i\}} \cdot \zeta (w)_T\bigr ]\\&= |{\mathbb {W}}|^n \cdot \widehat{{\mathbb {P}}}(W_T=w) \cdot {\mathbb {P}}(W_T=w) \cdot \widehat{{\mathbb {P}}}^w(X_t^w=i) = \widehat{{\mathbb {E}}}\bigl [\mathbb {1}_{\{W_T=w\}} \cdot \eta (W_T)^i_t\bigr ], \end{aligned}$$

where in the final line the first identity is due to Lemma 2 and ${\mathbb {P}}$-independence of $(\zeta (w),X^w)$ and ${\mathfrak {G}}_T$; and the second is due to (21) and the fact that ${\mathbb {P}}(W_T=w)=1/|{\mathbb {W}}|^n$. Thus

$$\begin{aligned} \widehat{{\mathbb {P}}}(X_t\in \,\cdot \,|{\mathfrak {G}}_T) = \eta (W_T)_t \quad \widehat{{\mathbb {P}}}\text {-a.s.~for } t\in [0,T]. \end{aligned}$$

Step 3: Dynamics of ${\widehat{M}}$. By Step 2 and the tower property of conditional expectation, we find that for each $i\in {\mathbb {S}}$ and $t\in [0,T]$

$$\begin{aligned} {\widehat{M}}^i_t = \widehat{{\mathbb {P}}}(X_t=i|{\mathfrak {G}}_t) = \widehat{{\mathbb {E}}}\bigl [\widehat{{\mathbb {E}}}[\mathbb {1}_{\left\{ X_t=i\right\} }|{\mathfrak {G}}_T]|{\mathfrak {G}}_t\bigr ] = \widehat{{\mathbb {E}}}\bigl [\eta (W_T)^i_t|{\mathfrak {G}}_t\bigr ] = \eta (W_T)^i_t \quad \widehat{{\mathbb {P}}}\text {-a.s.}, \end{aligned}$$

where the final identity is due to the fact that $\eta (W_T)$ is ${\mathfrak {G}}$-adapted by Step 1 and $\widehat{{\mathbb {E}}}$ denotes $\widehat{{\mathbb {P}}}$-expectation. Since both ${\widehat{M}}$ and $\eta (W_T)$ are càdlàg, it follows that ${\widehat{M}}=\eta (W_T)$ $\widehat{{\mathbb {P}}}$-a.s., and (M), ($\text {M}_0$) and ($\text {M}_k$) follow from (22), (23) and (24). $\square $

As a by-product, the preceding proof yields the alternative representation

$$\begin{aligned} {\widehat{M}}_t = \widehat{{\mathbb {P}}}(X_t\in \,\cdot \,\,|\,{\mathfrak {G}}_T)\quad \text {for }t\in [0,T],\ \widehat{{\mathbb {P}}}\text {-a.s.} \end{aligned}$$

4.2 Mean Field Equilibrium System

As discussed above, equilibrium obtains if the agents’ ex ante beliefs coincide with the ex post outcome. This holds if and only if the ex post aggregate distribution process ${\widehat{M}}$ from (M) satisfies

$$\begin{aligned} \widehat{{\mathbb {P}}}(X_t\in \,\cdot \,|{\mathfrak {G}}_t) = {\widehat{M}}_t \overset{!}{= }M_t = \mu (t,W_t)\quad \text {for all }t\in [0,T]. \end{aligned}$$

Definition 10

(Equilibrium System). A pair $(\mu ,v)$ of regular and non-anticipative functions

$$\begin{aligned} \mu :\ [0,T]\times {\mathbb {W}}^n\rightarrow {\mathbb {M}}\qquad \text {and}\qquad v:\ [0,T]\times {\mathbb {W}}^n\rightarrow {\mathbb {R}}^d \end{aligned}$$

is called a rational expectations equilibrium, or briefly an equilibrium, if for all $w\in {\mathbb {W}}^n$

for $t\in [T_k,T_{k+1}\rangle $, $k\in [0:n]$, subject to the consistency conditions^{Footnote 7}

for $k\in [1:n]$, and the initial/terminal conditions

We also refer to (E1)-(E6) as the equilibrium system. $\square $

In combination, Theorem 6 and Theorem 9 demonstrate that, given a solution $(\mu ,v)$ of the equilibrium system, v is the value function of the agent’s optimization problem (P${}_\mu $) with ex ante aggregate distribution $\mu $; and the ex post distribution resulting from the corresponding optimal strategy is given by $\mu $ itself. Thus we can identify a mean field equilibrium with common noise by producing a solution of the equilibrium system (E1)-(E6). We provide some illustrations in Sect. 5. Theorems 13 and 16 below ensure that this is feasible by showing that, under suitable continuity and monotonicity conditions, there exists a unique solution of the equilibrium system. The proofs are ramifications of classical arguments, based on Schauder’s fixed point theorem and monotonicity arguments, respectively.

We set

$$\begin{aligned}&Q_{\max } \triangleq \sup _{\begin{array}{c} t\in [0,T],\, w\in {\mathbb {W}}^n\\ m\in {\mathbb {M}},\, u\in {\mathbb {U}} \end{array}} \bigl \Vert Q(t,w,m,u)\bigr \Vert , \quad \psi _{\max } \triangleq \sup _{\begin{array}{c} t\in [0,T],\, w\in {\mathbb {W}}^n\\ m\in {\mathbb {M}},\, u\in {\mathbb {U}} \end{array}} \bigl \Vert \psi (t,w,m,u)\bigr \Vert , \\&\Psi _{\max } \triangleq \sup _{\begin{array}{c} m\in {\mathbb {M}}\\ w\in {\mathbb {W}}^n \end{array}}\Vert \Psi (w,m)\Vert \end{aligned}$$

and

$$\begin{aligned} v_{\max } \triangleq \bigl (\Psi _{\max } + T \cdot \psi _{\max }\bigr ) \cdot \mathrm {e}^{Q_{\max } \cdot T}. \end{aligned}$$

(25)

Note that these constants depend only on the underlying model coefficients.

Assumption 11

(i)
The reduced-form running reward function ${\widehat{\psi }}$ satisfies
$$\begin{aligned} \Vert {\widehat{\psi }}(t,w,m_1,v_1)-{\widehat{\psi }}(t,w,m_2,v_2)\Vert \le L_{{\widehat{\psi }}} \cdot \bigl (\Vert m_1-m_2\Vert + \Vert v_1-v_2\Vert \bigr ) \end{aligned}$$
for all $t\in [0,T]$, $w\in {\mathbb {W}}^n$, $m_1,m_2\in {\mathbb {M}}$ and $v_1,v_2\in {\mathbb {R}}^d$ with $\Vert v_1\Vert ,\Vert v_2\Vert \le v_{\max }$, for some $L_{{\widehat{\psi }}}>0$.
(ii)
The reduced-form intensity matrix function ${\widehat{Q}}$ satisfies
$$\begin{aligned} \bigl \Vert {\widehat{Q}}(t,w,m_1,v_1)-{\widehat{Q}}(t,w,m_2,v_2)\bigr \Vert \le L_{{\widehat{Q}}} \cdot \bigl (\Vert m_1-m_2\Vert + \Vert v_1-v_2\Vert \bigr ) \end{aligned}$$
for all $t\in [0,T]$, $w\in {\mathbb {W}}^n$, $m_1,m_2\in {\mathbb {M}}$ and $v_1,v_2\in {\mathbb {R}}^d$ with $\Vert v_1\Vert ,\Vert v_2\Vert \le v_{\max }$, for some $L_{{\widehat{Q}}}>0$.
(iii)
The terminal reward function $\Psi $ is continuous with respect to m, i.e. for every $w\in {\mathbb {W}}^n$ the map $\Psi (w,\,\cdot \,)$ is continuous.
(iv)
For each $k\in [1:n]$ and all $i\in {\mathbb {S}}$, $w\in {\mathbb {W}}^n$ and $v\in {\mathbb {R}}^d$ with $\Vert v\Vert \le v_{\max }$, the map
$$\begin{aligned} {\mathbb {M}}\ni m \mapsto \sum _{{{\bar{w}}}_k\in {\mathbb {W}}} \kappa _k\bigl ({{\bar{w}}}_k|w_1,\ldots ,w_{k-1},m\bigr ) v^{J^i(T_k,(w_{-k},{\bar{w}}_k),m)} \in {\mathbb {R}}\quad \text {is continuous.} \end{aligned}$$
(v)
For each $k\in [1:n]$ and $w\in {\mathbb {W}}^n$ the map $\Phi _k(w,\,\cdot \,)$ is continuous. $\square $

Since all norms on ${\mathbb {R}}^d$ are equivalent, the concrete specification is immaterial for Assumption 11. For the sake of convenience, in the following we use the maximum norm on ${\mathbb {R}}^d$ and a compatible matrix norm on ${\mathbb {R}}^{d\times d}$; moreover, we suppose that (ii) holds for both ${\widehat{Q}}$ and ${\widehat{Q}}^{^{{\mathsf {T}}}}$.

Remark 12

Sufficient conditions for Assumptions 11(i)-(ii) in terms of the model’s primitives can be found in, e.g., [31] or [15]. Furthermore, in the special case where the jump map J is independent of $m\in {\mathbb {M}}$, Assumption 11(v) is trivially satisfied, and continuity of the transition kernels $\kappa _k$ with respect to m is sufficient for Assumption 11(iv) to hold.$\square $

Theorem 13

(Existence of Equilibria) If Assumption 11 holds, then there exists a solution of the equilibrium system (E1)– (E6).

Proof

See Appendix A. $\square $

The reduced-form Hamiltonian $\widehat{{\mathcal {H}}}:\, [0,T] \times {\mathbb {W}}^n\times {\mathbb {M}}\times {\mathbb {R}}^d\rightarrow {\mathbb {R}}^d$ is defined via

$$\begin{aligned} \widehat{{\mathcal {H}}}^i(t,w,m,v)&\triangleq \sup _{u\in {\mathbb {U}}} \psi ^i(t,w,m,u)+Q^{i\cdot }(t,w,m,u)\cdot v \\&={\widehat{\psi }}^i(t,w,m,v)+{\widehat{Q}}^{i\cdot }(t,w,m,v)\cdot v. \end{aligned}$$

Assumption 14

Let Assumptions 11(i) and (ii) hold, and suppose that:

(i)
The terminal payoff function $\Psi $ is monotone with respect to $m\in {\mathbb {M}}$, i.e.
$$\begin{aligned} (m_1 - m_2) \cdot \bigl [ \Psi (w,m_1) - \Psi (w,m_2) \bigr ] \le 0 \quad \text {for all } w\in {\mathbb {W}}^n,\ m_1,m_2\in {\mathbb {M}}. \end{aligned}$$
(ii)
The reduced-form Hamiltonian $\widehat{{\mathcal {H}}}$ is convex with respect to v, i.e. for all $i\in {\mathbb {S}}$, $t\in [0,T]$, $w\in {\mathbb {W}}^n$, $m\in {\mathbb {M}}$ and $v_1,v_2\in {\mathbb {R}}^d$ satisfying $\Vert v_1\Vert , \Vert v_2\Vert \le v_{\max }$ we have
$$\begin{aligned} \widehat{{\mathcal {H}}}^i(t,w,m,v_2) - \widehat{{\mathcal {H}}}^i(t,w,m,v_1) - {\widehat{Q}}^{i\cdot }(t,w,m,v_1) \cdot (v_2-v_1) \ge 0. \end{aligned}$$
(iii)
The reduced-form Hamiltonian $\widehat{{\mathcal {H}}}$ satisfies a uniform monotonicity condition with respect to $m\in {\mathbb {M}}$, i.e. there exist $\alpha ,\gamma >0$ such that
$$\begin{aligned}&m_1 \cdot \bigl [ \widehat{{\mathcal {H}}}(t,w,m_2,v_2) - \widehat{{\mathcal {H}}}(t,w,m_1,v_2) \bigr ] + m_2 \cdot \bigl [ \widehat{{\mathcal {H}}}(t,w,m_1,v_1) \\&\quad \qquad \qquad \qquad \qquad \qquad \qquad \qquad \qquad \qquad \qquad \qquad \quad \qquad - \widehat{{\mathcal {H}}}(t,w,m_2,v_1) \bigr ] \ge \gamma \cdot \Vert m_1-m_2\Vert ^\alpha \end{aligned}$$
for all $t\in [0,T]$, $w\in {\mathbb {W}}^n$, $m_1,m_2\in {\mathbb {M}}$ and $v_1,v_2\in {\mathbb {R}}^d$ with $\Vert v_1\Vert , \Vert v_2\Vert \le v_{\max }$.
(iv)
For $k\in [1:n]$ the maps $\kappa _k$ and J satisfy the following monotonicity conditions in $m\in {\mathbb {M}}$: For all $w\in {\mathbb {W}}^n$, $m_1,m_2\in {\mathbb {M}}$ and $v_1,v_2\in {\mathbb {R}}^d$ satisfying $\Vert v_1\Vert ,\Vert v_2\Vert \le v_{\max }$ as well as
$$\begin{aligned} \bigl [ \Phi _k(w,m_1) - \Phi _k(w,m_2) \bigr ] \cdot (v_1-v_2) \le 0 \end{aligned}$$
we have
$$\begin{aligned}&\bigl [ \kappa _k(w_k|w_1,\ldots ,w_{k-1},m_1) - \kappa _k(w_k|w_1,\ldots ,w_{k-1},m_2) \bigr ] \nonumber \\&\quad \cdot \bigl ( m_2\cdot v_1^{J^{\cdot }(T_k,w,m_2)} - m_1\cdot v_1^{J^{\cdot }(T_k,w,m_1)} \nonumber \\&\quad + m_2\cdot v_2^{J^{\cdot }(T_k,w,m_2)} - m_1\cdot v_2^{J^{\cdot }(T_k,w,m_1)} \bigr ) \ge 0 \end{aligned}$$
(26)
and
$$\begin{aligned}&\kappa _k(w_k|w_1,\ldots ,w_{k-1},m_2) \cdot m_1 \cdot \bigl ( v_2^{J^{\cdot }(T_k,w,m_2)} - v_2^{J^{\cdot }(T_k,w,m_1)} \bigr ) \nonumber \\&\quad + \kappa _k(w_k|w_1,\ldots ,w_{k-1},m_1) \cdot m_2 \cdot \bigl ( v_1^{J^{\cdot }(T_k,w,m_1)} - v_1^{J^{\cdot }(T_k,w,m_2)} \bigr ) \ge 0. \end{aligned}$$
(27)

The constant $v_{\max }>0$ in 14(ii)-(iv) is defined in (25). Conditions 14(i)-(iii) are standard given the literature; see, e.g., Assumptions 1-3 in [31].^{Footnote 8}

Remark 15

Assumption 14 simplifies if some model coefficients do not depend on the mean field parameter $m\in {\mathbb {M}}$:

(a)
If ${\widehat{Q}}$ is independent of m, 14(iii) reduces to a monotonicity condition for ${\widehat{\psi }}$.
(b)
In 14(iv), (26) is trivially satisfied if the probability weights $\kappa _k$ do not depend on m.
(c)
In 14(iv), (27) is trivially satisfied if the jump map J is independent of m.$\square $

Theorem 16

(Uniqueness of Equilibria) Under the monoticity conditions stated in Assumption 14, the equilibrium system (E1)– (E6) possesses at most one solution.

Proof

See Appendix B. $\square $

5 Applications

Before we illustrate our results in two showcase examples, we briefly discuss our numerical approach to the equilibrium system (E1)-(E6). (E1)-(E2) is a forward-backward system of 2d ODEs with boundary conditions (E3)-(E6), coupled through the parameter $w\in {\mathbb {W}}^n$ representing common noise configurations. The special case $n=0$ (no common noise) corresponds to the setting of [31] and [15], with the equilibrium system reducing to a single 2d-dimensional forward-backward ODE. For $n\ge 1$, the consistency conditions (E3)-(E4) specify initial conditions for $\mu $ on $[T_k,T_{k+1}\rangle $ and terminal conditions for v on $[T_{k-1},T_k\rangle $, $k\in [1:n]$; since these conditions are interconnected, there is in general no segment $[T_k,T_{k+1}\rangle \times {\mathbb {W}}^n$ where the equilibrium system yields both an explicit initial condition for $\mu $ and an explicit terminal condition for v, so we cannot simply split the problem into subintervals. Rather, the equilibrium system can be regarded as a multi-point boundary value problem where for each of the $|{\mathbb {W}}|^k$ conceivable combinations of common noise factors on $[T_k,T_{k+1}\rangle $, $k\in [0:n]$, we have to solve a coupled forward-backward system of ODEs in 2d dimensions, resulting in a tree of such systems of size

$$\begin{aligned} \sum _{k=0}^n|{\mathbb {W}}|^k=\frac{|{\mathbb {W}}|^{n+1}-1}{|{\mathbb {W}}|-1}\in {\mathcal {O}}(|{\mathbb {W}}^n|). \end{aligned}$$

Our approach to solving (E1)-(E6) numerically is to rely on the probabilistic interpretation as a fixed-point system, based on Theorem 13. Thus, starting from an initial flow of probability weights $\mu _0(t,w)$, $(t,w)\in [0,T]\times {\mathbb {W}}^n$ with $\mu _0(0,w)=m_0$ for all $w\in {\mathbb {W}}^n$, we solve (DP${}_\mu $) subject to (TC${}_\mu $) and (CC${}_\mu $) backward in time for all non-negligible common noise configurations $w\in {\mathbb {W}}^n$ to obtain the value $v_0(t,w)$, $(t,w)\in [0,T]\times {\mathbb {W}}^n$, of the agents’ optimal response to the given belief $\mu _0$. This, in turn, is used to solve (M) subject to ($\text {M}_0$) and ($\text {M}_k$) forward in time. As a result, we obtain an ex post aggregate distribution $\mu _1(t,w)$, $(t,w)\in [0,T]\times {\mathbb {W}}^n$; we then iterate this with $\mu _1$ in place of $\mu _0$, etc.^{Footnote 9}

5.1 A Decentralized Agricultural Production Model

As a first (stylized) example we consider a mean field game of agents, each of which owns (an infinitesimal amount of) land of identical size and quality within a given area. If it is farmed, each field has a productivity $f(w_k)>0$ depending on the common weather condition $w_k$. We assume that weather is either good, bad or catastrophic, so $w_k\in {\mathbb {W}}\triangleq \{\uparrow ,\downarrow ,\lightning \}$, and changes at given common noise times $T_1,\ldots ,T_n$.

Each agent is in exactly one state $i\in {\mathbb {S}}\triangleq \{0,1\}$ depending on whether he grows crops on his field ($i=1$, the agent is a farmer) or not ($i=0$). The selling price p for his harvest depends on aggregate production, and thus in particular on the proportion $m^1\in [0,1]$ of farmers; the mean field interaction is transmitted through the market price of the crop. We assume that p is a strictly decreasing function of overall production $f(w_k)\cdot m^1$; see Fig. 1 for illustration.

We assume that $f(\uparrow )\ge f(\downarrow )=f(\lightning )\ge 0$. Moreover, on the catastrophic event $\{W_k=\lightning \}$ all agents are reduced to being non-farmers, and thus

$$\begin{aligned} J^i(t,w,m) \triangleq {\left\{ \begin{array}{ll} 0 &{} \text {if } t\in \{T_1,\ldots ,T_n\},\, t=T_k,\, w_k=\lightning ,\\ i &{} \text {else} \end{array}\right. } \end{aligned}$$

for $(i,t,w,m)\in {\mathbb {S}}\times [0,T]\times {\mathbb {W}}^n\times {\mathbb {M}}$. Each agent can make an effort $u\in {\mathbb {U}}\triangleq [0,\infty )$ to become being a farmer; the intensity matrix for state transitions is given by

$$\begin{aligned} Q(t,w,m,u) = \left[ \begin{array}{cc} -u \cdot q_{\mathrm {entry}} &{} u \cdot q_{\mathrm {entry}} \\ q_{\mathrm {exit}} &{} -q_{\mathrm {exit}} \end{array} \right] \quad \text {for } (t,w,m,u)\in [0,T]\times {\mathbb {W}}^n\times {\mathbb {M}}\times {\mathbb {U}}, \end{aligned}$$

where $q_{\mathrm {entry}},q_{\mathrm {exit}}\ge 0$ are given transition rates. The running rewards capture the fact that both efforts to building up farming capacities and production itself are costly, while revenues from the sales of the crop generate profits; thus

$$\begin{aligned} \psi ^0(t,w,m,u) = -\frac{1}{2}c_{\mathrm {entry}} \cdot u^2\quad \text {and}\quad \psi ^1(t,w,m,u) = p\bigl (f(w_k) \cdot m^1\bigr ) \cdot f(w_k) - c_{\mathrm {prod}} \end{aligned}$$

for $t\in [T_k,T_{k+1}\rangle $, $k\in [0:n]$, where $w_0\triangleq \ \uparrow $ and $c_{\mathrm {entry}},c_{\mathrm {prod}}\ge 0$. The terminal reward is zero. It follows that the maximizer $h^0$ in Assumption 3 is unique and given by

$$\begin{aligned} h^0(t,w,m,v) = \frac{q_{\mathrm {entry}}}{c_{\mathrm {entry}}}(v^1-v^0)^+; \end{aligned}$$

a specification of $h^1$ is immaterial. We choose $m^1_0\triangleq 10\%$ for the initial proportion of farmers, and report the relevant coefficients in Table 1.

Table 1 Coefficients in the agricultural production model

Full size table

Our results for the evolution of the mean field equilibrium are shown in Figs. 2 and 3 for various common noise configurations $w\in {\mathbb {W}}^n$ and the following two baseline models:

$(\mathrm {nC})$:

Catastrophic weather conditions do not occur; we use

$$\begin{aligned} \kappa _k(\uparrow |w_1,\ldots ,w_{k-1},m) = \kappa _k(\downarrow |w_1,\ldots ,w_{k-1},m) = 0.5 \end{aligned}$$

for all $w\in {\mathbb {W}}^n$ and $m\in {\mathbb {M}}$.

$(\mathrm {C})$:

Catastrophic events are likely; we use

$$\begin{aligned} \kappa _k(\uparrow |w_1,\ldots ,w_{k-1},m)= & {} 0.25,\quad \kappa _k(\downarrow |w_1,\ldots ,w_{k-1},m) = 0.25, \\ \kappa _k(\, \lightning \, |w_1,\ldots ,w_{k-1},m)= & {} 0.5 \end{aligned}$$

for all $w\in {\mathbb {W}}^n$ and $m\in {\mathbb {M}}$.

The model specified above satisfies both Assumption 11 and Assumption 14, so Theorems 13 and 16 guarantee the existence of a unique mean field equilibrium characterized by (E1)-(E6). Figure 2 illustrates the tree of all possible equilibrium evolutions in model $(\mathrm {C})$. Figs. 3, 4 and 5 illustrate the resulting equilibrium proportions of farmers, optimal actions, and market prices for some fixed common noise configurations. To illustrate the effect of uncertainty about future weather conditions we also show, for each common noise configuration, the theoretical perfect-foresight equilibria that would pertain if future weather conditions were known; these are plotted using dashed lines in Figs. 3, 4 and 5, and the subscript $\circ $ indicates the relevant deterministic common noise path. Equilibrium prices are stochastically modulated by the prevailing weather conditions, both directly and indirectly: First, prices jump at common noise times due to weather-related changes in productivity. Second, weather conditions indirectly affect market prices through their effect on the proportion of farming agents. Thus, with consistently good weather conditions, agents are strongly incentivized to become farmers, see Fig. 4; the fraction of farmers increases, see Fig. 3; and hence increased production drives down prices, see Fig. 5. By contrast, under bad weather conditions, incentives are weaker and prices remain higher. Both effects are dampened if a catastrophic event may occur. In addition, efforts tend to decrease between common noise times; this is due to the uncertainty of future weather conditions; this effect is more pronounced in the presence of catastrophic events.

5.2 An $\mathrm {SIR}$ Model with Random One-Shot Vaccination

Our second application is a mean field game of agents that are confronted with the spread of an infectious disease. Our main focus is to illustrate the qualitative effects of common noise on the equilibrium behavior of the system. We consider a classical $\mathrm {SIR}$ model setup with ${\mathbb {S}}=\{\mathrm {S},\mathrm {I},\mathrm {R}\}$: Each agent can be either susceptible to infection ($\mathrm {S}$), infected and simultaneously infectious for other agents ($\mathrm {I}$), or recovered and thus immune to (re-)infection ($\mathrm {R}$); see Fig. 6.

The infection rate is proportional to the prevalence of the disease, i.e. the percentage of currently infected agents. Susceptible agents can make individual efforts of size $u\in {\mathbb {U}}\triangleq [0,1]$ to protect themselves against infection and thus reduce intensity of infection. The transition intensities are given by

$$\begin{aligned} Q(t,w,m,u) \triangleq \left[ \begin{array}{ccc} -q_{\mathrm {inf}}(t,w,m,u) &{} q_{\mathrm {inf}}(t,w,m,u) &{} 0\\ 0 &{} -q_{\mathrm {I}\mathrm {R}} &{} q_{\mathrm {I}\mathrm {R}} \\ 0 &{} 0 &{} 0 \end{array} \right] \end{aligned}$$

for $(t,w,m,u)\in [0,T]\times {\mathbb {W}}^n\times {\mathbb {M}}\times {\mathbb {U}}$, where $q_{\mathrm {I}\mathrm {R}}\ge 0$ denotes the recovery rate of infected agents and the infection rate is given by

$$\begin{aligned} q_{\mathrm {inf}}(t,w,m,u) \triangleq q_{\mathrm {S}\mathrm {I}}\cdot m^{\mathrm {I}}\cdot (1-u)\cdot \mathbb {1}_{\{t<\tau ^{\star }\}}(w) \end{aligned}$$

with a given maximum rate $q_{\mathrm {S}\mathrm {I}}\ge 0$. The running reward penalizes both protection efforts and time spent in the infected state; with $c_{\mathrm {P}},\psi _{\mathrm {I}}\ge 0$ we set

$$\begin{aligned} \psi ^{\mathrm {S}}(t,w,m,u) \triangleq -c_{\mathrm {P}} \frac{u}{1-u},\qquad \psi ^{\mathrm {I}}(t,w,m,u) \triangleq -\psi _{\mathrm {I}},\qquad \psi ^{\mathrm {R}}(t,w,m,u) \triangleq 0. \end{aligned}$$

In addition, we include the possibility of a one-shot vaccination that becomes available, simultaneously to all agents, at a random point of time $\tau ^{\star }\in \{T_1,\ldots ,T_n\}\subset (0,T)$. We set ${\mathbb {W}}\triangleq \{0,1\}$ and identify the $k^{\text {th}}$ unit vector $e_k=(\delta _{kj})_{j\in [1:n]}\in {\mathbb {W}}^n$, $k\in [1:n]$ with the indicator of the event $\{\tau ^{\star }=T_k\}$. The event that no vaccine is available until T is represented by $0\in {\mathbb {W}}^n$; we set $\tau ^{\star }\triangleq +\infty $ in this case.^{Footnote 10} If and when it is available, all susceptible agents are vaccinated instantaneously, rendering them immune to infection; thus

$$\begin{aligned} J^{\mathrm {S}}(t,w,m) \triangleq {\left\{ \begin{array}{ll} \mathrm {R}&{} \text {if } t\in \{T_1,\ldots ,T_n\},\, t = T_k = \tau ^{\star }, \\ \mathrm {S}&{} \text {otherwise} \end{array}\right. } \quad \text {and} \quad J^i(t,w,m) \triangleq i \quad \text {for } i\in \{\mathrm {I},\mathrm {R}\}. \end{aligned}$$

The probability of vaccination becoming available is proportional to the percentage of agents that have already recovered from the disease. Thus for $k\in [1:n]$, $w_1,\ldots ,w_k\in {\mathbb {W}}$ and $m\in {\mathbb {M}}$ we set

$$\begin{aligned} \kappa _k(1\, |w_1,\ldots ,w_{k-1},m) \triangleq {\left\{ \begin{array}{ll} \alpha \cdot m^{\mathrm {R}} &{} \text {if } w_1,\ldots ,w_{k-1}=0,\\ 0 &{} \text {otherwise,} \end{array}\right. } \end{aligned}$$

and $\kappa _k(0\, |w_1,\ldots ,w_{k-1},m) \triangleq 1-\kappa _k(1\, |w_1,\ldots ,w_{k-1},m)$ where $\alpha \in (0,1]$. As a consequence, for all $(i,t,w,m,v)\in {\mathbb {S}}\times [0,T]\times {\mathbb {W}}\times {\mathbb {M}}\times {\mathbb {R}}^3$, a maximizer as required in Assumption 3 is given by^{Footnote 11}

$$\begin{aligned} h^{\mathrm {S}}(t,w,m,v) \triangleq {\left\{ \begin{array}{ll} \Bigl [1-\sqrt{\frac{c_{\mathrm {P}}}{q_{\mathrm {S}\mathrm {I}}\cdot m^{\mathrm {I}}\cdot (v^{\mathrm {S}}-v^{\mathrm {I}})}}\Bigr ]^+ &{} \text {if } v^{\mathrm {S}}>v^{\mathrm {I}},\ m^{\mathrm {I}}>0 \text { and }{t<\tau ^\star },\\ \qquad \qquad 0 &{} \text {otherwise,} \end{array}\right. } \end{aligned}$$

and $h^i(t,w,m,v)\triangleq 0$ for $i\in \{\mathrm {I},\mathrm {R}\}$.

Remark 17

($\mathrm {SIR}$ Models in the Literature). Note that, given the above specification of the transition matrix Q, the forward dynamics (E1) within the equilibrium system (E1)-(E6) read as follows:

$$\begin{aligned} \left\{ \begin{aligned} {\dot{\mu }}^{\mathrm {S}}(t,w)&= -q_{\mathrm {SI}}\cdot \mu ^{\mathrm {I}}(t,w)\cdot \bigl (1-h^{\mathrm {S}}(t,w,\mu (t,w),v(t,w))\bigr )\cdot \mathbb {1}_{\{t<\tau ^{\star }\}}(w)\cdot \mu ^{\mathrm {S}}(t,w)\\ {\dot{\mu }}^{\mathrm {I}}(t,w)&= q_{\mathrm {SI}}\cdot \mu ^{\mathrm {I}}(t,w)\cdot \bigl (1-h^{\mathrm {S}}(t,w,\mu (t,w),v(t,w))\bigr )\cdot \mathbb {1}_{\{t<\tau ^{\star }\}}(w)\cdot \mu ^{\mathrm {S}}(t,w)- q_{\mathrm {IR}}\cdot \mu ^{\mathrm {I}}(t,w)\\ {\dot{\mu }}^{\mathrm {R}}(t,w)&= \ q_{\mathrm {IR}}\cdot \mu ^{\mathrm {I}}(t,w). \end{aligned} \right. \end{aligned}$$

Disregarding common noise, these constitute a ramification of the classical $\mathrm {SIR}$ dynamics, which are a basic building block of numerous compartmental epidemic models in the literature; see, among others, [32, 37, 38, 47] and the references therein. The $\mathrm {SIR}$ mean field game with controlled infection rates, albeit without common noise, has recently been studied in the independent article [26]; we also refer to [46] and [23] for mean field models with controlled vaccination rates. Mathematically similar contagion mechanisms also appear in, e.g., [40, 41], §7.2.3 in [9], §7.1.10 in [10], or §4.4 in [52].$\square $

While Theorem 13 guarantees existence of a mean field equilibrium for (a variant^{Footnote 12} of) the $\mathrm {SIR}$ model, the monotonicity conditions of Theorem 16 do not hold in this setup.^{Footnote 13} Nevertheless, our numerical results reliably yield consistent equilibria. For our illustrations, the initial distribution of agents is given by $m_0\triangleq (0.995,0.005,0.00)$, and the model coefficients are reported in Table 2. Note that there are $n=1999$ common noise times $T_k=k\cdot 0.01$, $k=1,\ldots ,1999$, at which a vaccine can be administered. The specifications of $q_{\mathrm {S}\mathrm {I}}$ and $q_{\mathrm {I}\mathrm {R}}$ imply a basic reproduction number $R_0\triangleq q_{\mathrm {S}\mathrm {I}}/q_{\mathrm {I}\mathrm {R}}=15$ in the absence of vaccination and protection efforts.

Table 2 Coefficients in the $\mathrm {SIR}$ model

Full size table

Our results for the mean field equilibrium distributions of agents $\mu $ and the corresponding optimal protection efforts of susceptible agents $h^{\mathrm {S}}$ are displayed in Figs. 7, 8, and 9 for different common noises configurations, i.e. vaccination times $\tau ^{\star }$. As in Sect. 5.1, we also display the corresponding (theoretical) perfect-foresight equilibria, marked by the subscript $\circ $.

Note that an agent’s running reward is the same in state $\mathrm {S}$ with zero protection effort and in state $\mathrm {R}$; agents are penalized relative to these in state $\mathrm {I}$ and hence aim to avoid that state. Susceptible agents can reach the state $\mathrm {R}$ of immunity by two ways: First, they can become infected and overcome the disease; second, they can be vaccinated and jump instantly from state $\mathrm {S}$ to state $\mathrm {R}$. While the first alternative is painful, the second comes at no cost and is hence clearly preferable. However, as the availability of a vaccine cannot be directly controlled by the agents, they can only protect themselves against infection at a certain running cost until the vaccine becomes available.

Figures 7, 8, and 9 demonstrate that the possibility of vaccination as a common noise event can dampen the spread of the disease and lower the peak infection rate. This is due to an increase in agents’ protection efforts during the time period when the proportion of infected agents is high. By contrast, in the perfect-foresight equilibria where the vaccination date is known, agents do not make substantial protection efforts until the vaccination date is imminent, see Figs. 8 and 9; in the scenario without vaccination, see Fig. 7, protection efforts are only ever made by a very small fraction of the population. With perfect foresight, the agents’ main rationale is to avoid being in state $\mathrm {I}$ when the vaccine becomes available. This highlights the importance of being able to model the vaccination date as a (random) common noise event. Finally, observe that our numerical results indicate convergence to the stationary distribution ${{\bar{\mu }}}=(0,0,1)\in {\mathbb {M}}$, showing that the model is able to capture the entire evolution of an epidemic.

Notes

We wish to point out that our focus is not on the mean field limit of multi-player games; rather, we directly investigate the mean field equilibrium via the corresponding McKean-Vlasov dynamics (see also Remark 7 and [11] in that context).
While the common noise factors are i.i.d. uniformly distributed under ${\mathbb {P}}$, the distribution of $W_1,\ldots ,W_n$ in the agent’s optimization problem can be modeled arbitrarily via the functions $\kappa _1,\ldots ,\kappa _n$ introduced below; see also Lemma 2.
For notational simplicity, we write $X_t$ instead of $X_{t-}$, $M_t$ instead of $M_{t-}$, etc., where it does not make a difference.
See also [7, Sect. 3.3] for a similar change-of-measure construction of jump processes with stochastic intensities.
Note that $\int _0^t Q^{ij}(s,W_s,\mu (s,W_s),\nu _s)-1)\mathrm {d}{\bar{N}}^{ij}_s = \int _0^t Q^{ij}(s,W_{s-},\mu (s,W_{s-}),\nu _s)-1)\mathrm {d}{\bar{N}}^{ij}_s$ ${\mathbb {P}}$-a.s.
All ODEs in this article are taken in the sense of Carathéodory; see [36, §I.5]. Briefly, $x:\ I\rightarrow {\mathbb {R}}$ is a solution of ${\dot{x}}(t)=f(t,x(t))$, $x(t_0)=x_0$, in the sense of Carathéodory if $t\mapsto f(t,x(t))$ is Lebesgue integrable and satisfies $x(t) = x_0 + \int _{t_0}^t f(s,x(s))\mathrm {d}s$ for all $t\in I$.
With a slight abuse of notation, here and subsequently we set $\Phi _k(w,m) \triangleq \Phi _k(w,m,m)$ for $k\in [1:n]$, $w\in {\mathbb {W}}^n$, $m\in {\mathbb {M}}$. Recall that $\Psi _k$ and $\Phi _k$ are defined in (10) and (16), respectively.
Note, however, that our result does not require uniform convexity in 14(ii).
Under suitable Lipschitz conditions, one can establish a variant of Theorem 13 based on Banach’s (rather than Schauder’s) fixed point theorem; see, e.g., [15, Theorem 6] for the case without common noise. This guarantees convergence of the fixed point iteration for sufficiently short time horizons.
The specification of $\kappa _k$, $k\in [1:n]$, below implies that $\tau ^{\star }=+\infty $ is equivalent to $w=0\in {\mathbb {W}}^n$ ${\mathbb {P}}$-a.s., i.e., the configurations ${\mathbb {W}}^n\setminus \bigl (\{0\}\cup \{e_k:\, k\in [1:n]\}\bigr )$ are ${\mathbb {P}}$-negligible.
Note that for given $w\in {\mathbb {W}}^n$ the stated maximizer $h^{\mathrm {S}}$ is unique for times $t<\tau ^{\star }$; otherwise its specification is immaterial. The latter applies likewise to $h^{\mathrm {I}}$ and $h^{\mathrm {R}}$.
With action space ${\mathbb {U}}\triangleq [0,u_{\max }]$ for an arbitrary, but fixed $u_{\max }<1$.
In fact, Assumption 14(iii) is not even satisfied in the baseline $\mathrm {SIR}$ model without protection or vaccination.

References

Ahuja, S.: Wellposedness of mean field games with common noise under a weak monotonicity condition. SIAM J. Control Optim. 54(1), 30–48 (2016)
Article MathSciNet MATH Google Scholar
Bayraktar, E., Cohen, A.: Analysis of a finite state many player game using its master equation. SIAM J. Control Optim. 56(5), 3538–3568 (2018)
Article MathSciNet MATH Google Scholar
Bayraktar, E., Cecchin, A., Cohen, A., Delarue, F.: Finite state mean field games with Wright-Fisher common noise (2019). arXiv:1912.06701
Bensoussan, A., Frehse, J., Yam, P.: Mean Field Games and Mean Field Type Control Theory. Springer Briefs in Mathematics. Springer, Berlin (2013)
Book MATH Google Scholar
Bertucci, C., Lasry, J.-M., Lions, P.-L.: Some remarks on mean field games. Commun. Partial Differ. Equ. 44(3), 205–227 (2019)
Article MathSciNet MATH Google Scholar
Basei, M., Pham, H.: A weak martingale approach to linear-quadratic McKean–Vlasov stochastic control problems. J. Optim. Theory Appl. 181(2), 347–382 (2019)
Article MathSciNet MATH Google Scholar
Becherer, D., Schweizer, M.: Classical solutions to reaction-diffusion systems for hedging problems with interacting Itô and point processes. Ann. Appl. Probab. 15(2), 1111–1144 (2005)
Article MathSciNet MATH Google Scholar
Carmona, R., Delarue, F.: Probabilistic analysis of mean-field games. SIAM J. Control Optim. 51(4), 2705–2734 (2013)
Article MathSciNet MATH Google Scholar
Carmona, R., Delarue, F.: Probabilistic Theory of Mean Field Games with Applications I: Mean Field FBSDEs, Control, and Games. Springer, Berlin (2018a)
Book MATH Google Scholar
Carmona, R., Delarue, F.: Probabilistic Theory of Mean Field Games with Applications II: Mean Field Games with Common Noise and Master Equations. Springer, Berlin (2018)
Book MATH Google Scholar
Carmona, R., Delarue, F., Lachapelle, A.: Control of McKean–Vlasov dynamics versus mean field games. Math. Financ. Econ. 7(2), 131–166 (2013)
Article MathSciNet MATH Google Scholar
Carmona, R., Delarue, F., Lacker, D.: Mean field games with common noise. Ann. Probab. 44(6), 3740–3803 (2016)
Article MathSciNet MATH Google Scholar
Carmona, R., Delarue, F., Lacker, D.: Mean field games of timing and models for bank runs. Appl. Math. Optim. 76(1), 217–260 (2017)
Article MathSciNet MATH Google Scholar
Campi, L., Fischer, M.: $N$-player games and mean-field games with absorption. Ann. Appl. Probab. 28(4), 2188–2242 (2018)
Article MathSciNet MATH Google Scholar
Cecchin, A., Fischer, M.: Probabilistic approach to finite state mean field games. Appl. Math. Optim. 81(2), 253–300 (2020)
Article MathSciNet MATH Google Scholar
Carmona, R., Fouque, J.-P., Sun, L.-H.: Mean field games and systemic risk. Commun. Math. Sci. 13(4), 911–933 (2015)
Article MathSciNet MATH Google Scholar
Casgrain, P., Jaimungal, S.: Mean field games with partial information for algorithmic trading (2019). arxiv:1803.04094
Cecchin, A., Pelino, G.: Convergence, fluctuations and large deviations for finite state mean field games via the master equation. Stoch. Process. Appl. 129(11), 4510–4555 (2019)
Article MathSciNet MATH Google Scholar
Cosso, A., Pham, H.: Zero-sum stochastic differential games of generalized McKean–Vlasov type. J. Math. Pures Appl. 129, 180–212 (2019)
Article MathSciNet MATH Google Scholar
Cecchin, A., Dai Pra, P., Fischer, M., Pelino, G.: On the convergence problem in mean field games: a two state model without uniqueness. SIAM J. Control Optim. 57(4), 2443–2466 (2019)
Article MathSciNet MATH Google Scholar
Carmona, R., Wang, P.: An alternative approach to mean field game with major and minor players, and applications to herders impacts. Appl. Math. Optim. 76(1), 5–27 (2017)
Article MathSciNet MATH Google Scholar
Carmona, R., Wang, P.: A probabilistic approach to extended finite state mean field games (2018). arXiv:1808.07635
Doncel, J., Gast, N., Gaujal, B.: A mean-field game analysis of SIR dynamics with vaccination (2017). hal:01496885
Doncel, J., Gast, N., Gaujal, B.: Discrete mean field games: existence of equilibria and convergence. J. Dyn. Games 6(3), 221–239 (2019)
MathSciNet MATH Google Scholar
Delarue, F., Lacker, D., Ramanan, K.: From the master equation to mean field game limit theory: a central limit theorem. Electron. J. Probab. 24(51), 54 (2019)
MathSciNet MATH Google Scholar
Elie, R., Hubert, E., Turinici, G.: Contact rate epidemic control of COVID-19: an equilibrium view (2020). arxiv:2004.08221
Elie, R., Ichiba, T., Laurière, M.: Large banking systems with default and recovery: a mean field game model (2020). arxiv:2001.10206
Fischer, M.: On the connection between symmetric $n$-player games and mean field games. Ann. Appl. Probab. 27(2), 757–810 (2017)
Article MathSciNet MATH Google Scholar
Guéant, O., Lasry, J.-M., Lions, P.-L.: Mean field games and applications. In: Carmona, R., Çınlar, E., Ekeland, I., Jouini, E., Scheinkman, J.A., Touzi, N. (eds.) Paris-Princeton Lectures on Mathematical Finance 2010. Lecture Notes in Mathematics, pp. 205–266. Springer, New York (2011)
Chapter Google Scholar
Gomes, D.A., Mohr, J., Souza, R.R.: Discrete time, finite state space mean field games. J. Math. Pures Appl. 93(3), 308–328 (2010)
Article MathSciNet MATH Google Scholar
Gomes, D.A., Mohr, J., Souza, R.R.: Continuous time finite state mean field games. Appl. Math. Optim. 68(1), 99–143 (2013)
Article MathSciNet MATH Google Scholar
Grimm, V., Mengel, F., Schmidt, M.: Extensions of the SEIR model for the analysis of tailored social distancing and tracing approaches to cope with COVID-19. medRxiv (2020). https://www.medrxiv.org/content/10.1101/2020.04.24.20078113v1
Gomes, D.A., Saúde, J.: A mean-field game approach to price formation. Dyn. Games Appl. (to appear, 2020)
Guéant, O.: Existence and uniqueness result for mean field games with congestion effect on graphs. Appl. Math. Optim. 72(2), 291–303 (2015)
Article MathSciNet MATH Google Scholar
Gomes, D., Velho, R.M., Wolfram, M.-T.: Socio-economic applications of finite state mean field games. Philosoph. Trans. R. Soc. A 372, 2014 (2028)
MATH Google Scholar
Hale, J.K.: Ordinary Differential Equations. Robert E. Krieger Publishing Company, Inc., New York (1980)
MATH Google Scholar
Hethcote, H.W.: The mathematics of infectious diseases. SIAM Rev. 42(4), 599–653 (2000)
Article MathSciNet MATH Google Scholar
Harko, T., Lobo, F.S.N., Mak, M.K.: Exact analytical solutions of the susceptible-infected-recovered (SIR) epidemic model and of the SIR model with equal death and birth rates. Appl. Math. Comput. 236, 184–194 (2014)
MathSciNet MATH Google Scholar
Huang, M., Malhamé, R.P., Caines, P.E.: Large population stochastic dynamic games: Closed-loop McKean-Vlasov systems and the Nash certainty equivalence principle. Commun. Inform. Syst. 6(3), 221–252 (2006)
Article MathSciNet MATH Google Scholar
Kolokoltsov, V.N., Bensoussan, A.: Mean-field-game model for botnet defense in cyber-security. Appl. Math. Optim. 74(3), 669–692 (2016)
Article MathSciNet MATH Google Scholar
Kolokoltsov, V.N., Malafeyev, O.A.: Mean-field-game model of corruption. Dyn. Games Appl. 7(1), 34–47 (2017)
Article MathSciNet MATH Google Scholar
Lacker, D.: A general characterization of the mean field limit for stochastic differential games. Probab. Theory Relat. Fields 165(3–4), 581–648 (2015)
MathSciNet MATH Google Scholar
Lacker, D.: On a strong form of propagation of chaos for McKean–Vlasov equations. Electron. Commun. Probab. 23(45), 11 (2018)
MathSciNet MATH Google Scholar
Lasry, J.-M., Lions, P.-L.: Mean field games. Jpn. J. Math. 2(1), 229–260 (2007)
Article MathSciNet MATH Google Scholar
Lachapelle, A., Lasry, J.-M., Lehalle, C.-A., Lions, P.-L.: Efficiency of the price formation process in presence of high frequency participants: a mean field game analysis. Math. Financ. Econ. 10(3), 223–262 (2016)
Article MathSciNet MATH Google Scholar
Laguzet, L., Turinici, G.: Individual vaccination as Nash equilibrium in a SIR model with application to the 2009–2010 Influenza A (H1N1) epidemic in France. Bull. Math. Biol. 77(10), 1955–1984 (2015)
Article MathSciNet MATH Google Scholar
Miller, J.C.: Mathematical models of SIR disease spread with combined non-sexual and sexual transmission routes. Infect. Dis. Model. 2(1), 35–55 (2017)
Google Scholar
Miller, E., Pham, H.: Linear-quadratic McKean–Vlasov Stochastic differential games. In: Yin, G., Zhang, Q. (eds.) Modeling, Stochastic Control, Optimization, and Applications (The IMA Volumes in Mathematics and its Applications), vol. 164, pp. 451–481. Springer, New York (2019)
Google Scholar
Neumann, B.A.: Stationary equilibria of mean field games with finite state and action space. Dyn. Games Appl. (to appear, 2020)
Nutz, M.: A mean field game of optimal stopping. SIAM J. Control Optim. 56(2), 1206–1221 (2018)
Article MathSciNet MATH Google Scholar
Pham, H., Wei, X.: Dynamic programming for optimal control of stochastic McKean–Vlasov dynamics. SIAM J. Control Optim. 55(2), 1069–1101 (2017)
Article MathSciNet MATH Google Scholar
Wang, P.: Finite state mean field games. Dissertation, Princeton University, Princeton, NJ (2019). http://arks.princeton.edu/ark:/88435/dsp01zw12z808g

Download references

Acknowledgements

We thank the anonymous referees for valuable comments and suggestions. Daniel Hoffmann and Frank Seifried gratefully acknowledge financial support from the German Research Foundation (DFG) within the Research Training Group 2126: Algorithmic Optimization.

Funding

Open Access funding enabled and organized by Projekt DEAL.

Author information

Authors and Affiliations

Faculty II – Mathematics and Natural Sciences, Institute of Mathematics, Technische Universität Berlin, Straße des 17. Juni 136, 10623, Berlin, Germany
Christoph Belak
Department IV – Mathematics, University of Trier, Universitätsring 19, 54296, Trier, Germany
Daniel Hoffmann & Frank T. Seifried

Authors

Christoph Belak
View author publications
You can also search for this author in PubMed Google Scholar
Daniel Hoffmann
View author publications
You can also search for this author in PubMed Google Scholar
Frank T. Seifried
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Christoph Belak.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Appendices

Appendix: Proof of Theorem 13

Let $E\subseteq {\mathbb {R}}^d$ and define the space

$$\begin{aligned} {\mathsf {D}}(E)\triangleq \bigl \{f:\ [0,T]\times {\mathbb {W}}^n \rightarrow E\, :\, f\ \text { is c}\grave{\mathrm{a}}\text {dl}\grave{\mathrm{a}}\text {g and non-anticipative} \bigr \} \end{aligned}$$

together with the norm $\Vert f\Vert _{\sup } \triangleq \sup _{\begin{array}{c} t\in [0,T],\, w\in {\mathbb {W}}^n \end{array}}\Vert f(t,w)\Vert $ for $f\in {\mathsf {D}}(E)$. It is clear that ${\mathsf {D}}(E)$ is a Banach space provided $E\subseteq {\mathbb {R}}^d$ is closed; the linear subspace of regular non-anticipative functions is denoted by

$$\begin{aligned} \mathsf {Reg}(E)\triangleq \{f\in {\mathsf {D}}(E)\, :\, f\ \text {is regular}\}. \end{aligned}$$

Lemma A.1

(Backward Gronwall estimate) Let $f\in {\mathsf {D}}([0,\infty ))$ and $\alpha ,\beta ,\vartheta ,\rho ,\eta \ge 0$. Suppose that $f(T,w) \le \rho \cdot \eta $ for all $w\in {\mathbb {W}}^n$,

$$\begin{aligned}&f(t,w) \le f(T_{k+1}-,w) + \alpha (T_{k+1}-t) \cdot \eta + \beta \cdot \int _t^{T_{k+1}} f(s,w)\mathrm {d}s,\nonumber \\&\quad t\in [T_k,T_{k+1}\rangle ,\, w\in {\mathbb {W}}^n, \end{aligned}$$

(28)

for $k\in [0:n]$, and

$$\begin{aligned} f(T_k-,w) \le \sum _{{\bar{w}}_k\in {\mathbb {W}}} \gamma _k(w_{-k},{\bar{w}}_k) \cdot f(T_k,(w_{-k},{\bar{w}}_k)) + \vartheta \cdot \eta , \quad w\in {\mathbb {W}}^n, \end{aligned}$$

(29)

for $k\in [1:n]$, where for all $w_1,\ldots ,w_{k-1}\in {\mathbb {W}}$ the family $\{\gamma _k(w_{-k},{\bar{w}}_k)\}_{{\bar{w}}_k\in {\mathbb {W}}}$ consists of probability weights on ${\mathbb {W}}$. Then we have

$$\begin{aligned}&f(t,w) \le C \cdot \eta \quad \text {for all }(t,w)\in [0,T]\times {\mathbb {W}}^n,\\&\qquad \text {where}\qquad C \triangleq \bigl ( \rho + \alpha T + (n+1)\vartheta \bigr ) \cdot \mathrm {e}^{\beta T}. \end{aligned}$$

Proof

We recursively define $C_{n+1}\triangleq 1$ and

$$\begin{aligned} C_k \triangleq \bigl ( C_{k+1} + \alpha (T_{k+1}-T_{k}) + \vartheta \bigr ) \cdot \mathrm {e}^{\beta (T_{k+1}-T_{k})}\quad \text {for }k\in [0:n] \end{aligned}$$

and note that $C_{n+1}\le C_n\le \cdots \le C_1\le C_0\le C$. Hence it suffices to show that

$$\begin{aligned} f(t,w) \le C_k \cdot \eta \quad \text {for all } (t,w)\in [T_k,T_{k+1}\rangle \times {\mathbb {W}}^n,\, k\in [0:n]. \end{aligned}$$

(30)

By assumption, $f(T_{n+1}-,w) = f(T,w) \le \rho \cdot \eta $ for all $w\in {\mathbb {W}}^n$. Next let $k\in [0:n]$ and assume that

$$\begin{aligned} f(T_{k+1}-,w) \le C_{k+1}\cdot \eta \quad \text {for all } w\in {\mathbb {W}}^n. \end{aligned}$$

It follows from (28) and Gronwall’s inequality on $[T_k,T_{k+1}\rangle $ that for all $w\in {\mathbb {W}}^n$

$$\begin{aligned} f(t,w)&\le \bigl ( f(T_{k+1}-,w) + \alpha (T_{k+1}-t) \cdot \eta \bigr ) \cdot \mathrm {e}^{\beta (T_{k+1}-t)} \\&\le \bigl ( C_{k+1} + \alpha (T_{k+1}-T_k) \bigr ) \cdot \mathrm {e}^{\beta (T_{k+1}-T_k)} \cdot \eta . \end{aligned}$$

In particular, we have $f(t,w)\le C_k\cdot \eta $ for $t\in [T_k,T_{k+1}\rangle $, and by (29)

$$\begin{aligned} f(T_k-,w)&\le \sum _{{\bar{w}}_k\in {\mathbb {W}}} \gamma _k(w_1,\ldots ,w_{k-1},{\bar{w}}_k) \cdot f(T_k,(w_{-k},{\bar{w}}_k)) + \vartheta \cdot \eta \\&\le \bigl ( C_{k+1} + \alpha (T_{k+1}-T_k) + \vartheta \bigr ) \cdot \mathrm {e}^{\beta (T_{k+1}-T_k)} \cdot \eta = C_k \cdot \eta . \end{aligned}$$

Hence (30) follows by backward induction on $k=n,n-1,\ldots ,0$. $\square $

In the following, we first consider the backward system (E2), (E4), (E6) and subsequently the forward system (E1), (E3), (E5).

Lemma A.2

Suppose that Assumption 11 holds and let $\mu \in {\mathsf {D}}({\mathbb {M}})$. Then there exists a unique solution ${\bar{v}}$ of (E2 ) subject to (E4 ) and (E6 ). Moreover, we have ${\bar{v}}\in \mathsf {Reg}({\mathbb {R}}^d)$ and $\Vert {\bar{v}}(t,w)\Vert \le v_{\max }$ for all $(t,w)\in [0,T]\times {\mathbb {W}}^n$.

Proof

Step 1: Construction of ${\bar{v}}$. We construct ${\bar{v}}$ by backward induction on $k\in [0:n]$ on each segment $[T_k,T_{k+1}\rangle \times {\mathbb {W}}^n$. First, we set ${\bar{v}}(T,w)\triangleq \Psi (w,\mu (T,w))$ for $w\in {\mathbb {W}}^n$. Suppose that $k\in [0:n]$, fix $w\in {\mathbb {W}}^n$, and let ${{\tilde{v}}}(T_{k+1},w_{T_k})\in {\mathbb {R}}^d$ be given and independent of $w_{k+1},\ldots ,w_n$. Using Assumptions 11(i)-(ii) it follows that the Carathéodory conditions are satisfied, so [36, Theorem I.5.3] yields the unique Carathéodory solution ${{\tilde{v}}}(\,\cdot \,,w_{T_k}):\, [T_k,T_{k+1}] \rightarrow {\mathbb {R}}^d$ of

$$\begin{aligned} {{\tilde{v}}}(t,w_{T_k})&= {{\tilde{v}}}(T_{k+1},w_{T_k}) + \int _t^{T_{k+1}}\Bigl ({\widehat{\psi }}\bigl (s,w_{T_k},\mu (s,w_{T_k}),{{\tilde{v}}}(s,w_{T_k})\bigr )\\&\qquad \qquad \qquad \qquad \quad \qquad + {\widehat{Q}}\bigl (s,w_{T_k},\mu (s,w_{T_k}),{{\tilde{v}}}(s,w_{T_k})\bigr ) \cdot {{\tilde{v}}}(s,w_{T_k})\Bigr )\mathrm {d}s\\&= {{\tilde{v}}}(T_{k+1},w_{T_k}) + \int _t^{T_{k+1}}\Bigl ({\widehat{\psi }}\bigl (s,w,\mu (s,w),{{\tilde{v}}}(s,w_{T_k})\bigr )\\&\qquad \qquad \qquad \qquad \quad \quad + {\widehat{Q}}\bigl (s,w,\mu (s,w),{{\tilde{v}}}(s,w_{T_k})\bigr ) \cdot {{\tilde{v}}}(s,w_{T_k})\Bigr )\mathrm {d}s, \quad t\in [T_k,T_{k+1}], \end{aligned}$$

where the final identity is due to the fact that ${\widehat{\psi }}(\,\cdot \,,\,\cdot \,,{\bar{m}},{\bar{v}})$ and ${\widehat{Q}}(\,\cdot \,,\,\cdot \,,{\bar{m}},{\bar{v}})$ are non-anticipative. Define

$$\begin{aligned} {\bar{v}}(t,w)\triangleq {{\tilde{v}}}(t,w_{T_k})\quad \text {for }t\in [T_k,T_{k+1}\rangle \quad \text {and each }w\in {\mathbb {W}}^n. \end{aligned}$$

By construction, ${\bar{v}}(\,\cdot \,,w)$ solves (E2) on $[T_k,T_{k+1}\rangle $ and does not depend on $w_{k+1},\ldots , w_n$. Having constructed ${\bar{v}}$ on $[T_k,T_{k+1}\rangle \times {\mathbb {W}}^n$, we use (E4) and define

$$\begin{aligned} {\bar{v}}(T_k,w_{T_{k-1}})\triangleq \Psi _k(w,\mu (T_k-,w),{\bar{v}}(T_k,\,\cdot \,)) \end{aligned}$$

for $w\in {\mathbb {W}}^n$. By (10) and the fact that $\mu $ and J are non-anticipative, it follows that this definition does not depend on $w_k,\ldots ,w_n$. Consequently, the above construction can be iterated, and hence we obtain ${\bar{v}}$ as the unique solution of (E2) subject to (E4) and (E6). By definition, ${\bar{v}}$ is non-anticipative and regular, i.e. ${\bar{v}}\in \mathsf {Reg}({\mathbb {R}}^d)$.

Step 2: A priori bound. For $k\in [0:n]$ and $(t,w)\in [T_k,T_{k+1}\rangle \times {\mathbb {W}}^n$ we have

$$\begin{aligned} \Vert {\bar{v}}(t,w)\Vert&\le \Vert {\bar{v}}(T_{k+1}-,w)\Vert + \int _t^{T_{k+1}}\bigl \Vert {\widehat{\psi }}(s,w,\mu (s,w),{\bar{v}}(s,w))\bigr \Vert \nonumber \\&\quad + \bigl \Vert {\widehat{Q}}(s,w,\mu (s,w),{\bar{v}}(s,w))\bigr \Vert \cdot \Vert {\bar{v}}(s,w)\Vert \mathrm {d}s \nonumber \\&\le \Vert {\bar{v}}(T_{k+1}-,w)\Vert + \psi _{\max } \cdot (T_{k+1}-t) + Q_{\max } \cdot \int _t^{T_{k+1}}\Vert {\bar{v}}(s,w)\Vert \mathrm {d}s. \end{aligned}$$

(31)

On the other hand, for $k\in [1:n]$, $w\in {\mathbb {W}}^n$ and $i\in {\mathbb {S}}$ we observe from (10) that

$$\begin{aligned} \Vert {\bar{v}}(T_k-,w)\Vert&\le \sum _{{\bar{w}}_k\in {\mathbb {W}}} \kappa _k({\bar{w}}_k|w_1,\ldots ,w_{k-1},\mu (T_k-,w_{T_k-})) \cdot \Vert {\bar{v}}(T_k,(w_{-k},{\bar{w}}_k)\Vert . \end{aligned}$$

(32)

Since $\Vert {\bar{v}}(T,w)\Vert = \Vert \Psi (w,\mu (T,w))\Vert \le \Psi _{\max }$ it follows from (31), (32) and Lemma A.1 with $\eta \triangleq \psi _{\max }$, $\vartheta \triangleq 0$ and $\rho \triangleq \Psi _{\max }/\psi _{\max }$ that

$$\begin{aligned} \Vert {\bar{v}}(t,w)\Vert \le C \cdot \eta \le \bigl (\Psi _{\max } + T \cdot \psi _{\max }\bigr ) \cdot \mathrm {e}^{Q_{\max } \cdot T} = v_{\max }\quad \text {for all }(t,w)\in [0,T]\times {\mathbb {W}}^n. \end{aligned}$$

$\square $

Lemma A.3

Suppose that Assumption 11 is satisfied and let $v\in {\mathsf {D}}({\mathbb {R}}^d)$. Then there is a unique solution ${{\bar{\mu }}}$ of (E1 ) subject to (E3 ) and (E5 ), and we have ${{\bar{\mu }}}\in \mathsf {Reg}({\mathbb {M}})$.

Proof

The proof is analogous to (but somewhat simpler than) that of Lemma A.2. $\square $

Proof of Theorem 13

We divide the proof into four steps:

Step 1: Solution operators. We define

where ${\bar{v}}\in \mathsf {Reg}({\mathbb {R}}^d)$ is the unique solution of (E2) subject to (E4) and (E6) given $\mu \in {\mathsf {D}}({\mathbb {M}})$; is well-defined by Lemma A.2. Moreover, let

where ${{\bar{\mu }}}\in \mathsf {Reg}({\mathbb {M}})$ is the unique solution of (E1) subject to (E3) and (E5) given $v\in {\mathsf {D}}({\mathbb {R}}^d)$; is well-defined by Lemma A.3.

Step 2: Continuity of . Let $\mu _0\in {\mathsf {D}}({\mathbb {M}})$, set and fix some $\varepsilon >0$. We set

$$\begin{aligned}&\alpha \triangleq L_{{\widehat{\psi }}} + L_{{\widehat{Q}}} \cdot v_{\max },\qquad \beta \triangleq L_{{\widehat{\psi }}} + Q_{\max } + L_{{\widehat{Q}}} \cdot v_{\max }, \\&C \triangleq ( \alpha T + n + 2 ) \cdot \mathrm {e}^{\beta T} \qquad \text {and}\qquad \eta \triangleq \frac{\varepsilon }{C}. \end{aligned}$$

By Assumptions 11(iii)-(iv), for each $k\in [1:n]$ we can pick $\delta _k>0$ such that

$$\begin{aligned}&\Biggl | \sum _{{\bar{w}}_k\in {\mathbb {W}}} \Bigl ( \kappa _k\bigl ({\bar{w}}_k|w_1,\ldots ,w_{k-1},m\bigr ) {\bar{v}}_0^{J^i(T_k,(w_{-k},{\bar{w}}_k),m)}(T_k,(w_{-k},{\bar{w}}_k)) \nonumber \\&\qquad - \kappa _k\bigl ({\bar{w}}_k|w_1,\ldots ,w_{k-1},\mu _0(T_k-,w)\bigr ) {\bar{v}}_0^{J^i(T_k,(w_{-k},{\bar{w}}_k),\mu _0(T_k-,w))}(T_k,(w_{-k},{\bar{w}}_k)) \Bigr ) \Biggr | \le \eta \nonumber \\&\quad \text {for all } i\in {\mathbb {S}}\text { and } (m,w)\in {\mathbb {M}}\times {\mathbb {W}}^n \text { with } \Vert m-\mu _0(T_k-,w)\Vert \le \delta _k, \end{aligned}$$

(33)

and $\delta _{n+1}>0$ such that

$$\begin{aligned}&\Vert \Psi (w,m) - \Psi (w,\mu _0(T,w))\Vert \le \eta \nonumber \\&\quad \text {for all } (m,w)\in {\mathbb {M}}\times {\mathbb {W}}^n \text { with } \Vert m - \mu _0(T,w)\Vert \le \delta _{n+1}. \end{aligned}$$

(34)

We define $\delta >0$ via

$$\begin{aligned} \delta \triangleq \eta \wedge \delta _1\wedge \cdots \wedge \delta _{n+1} \end{aligned}$$

(35)

and let $\mu \in {\mathsf {D}}({\mathbb {M}})$ such that $\Vert \mu -\mu _0\Vert _{\sup }\le \delta $; set . For each $w\in {\mathbb {W}}^n$, it follows from Assumptions 11(i)-(ii) and (35) that for all $t\in [T_k,T_{k+1}\rangle $, $k\in [0:n]$, we have

$$\begin{aligned}&\bigl \Vert {\bar{v}}(t,w) - {\bar{v}}_0(t,w)\bigr \Vert \le \bigl \Vert {\bar{v}}(T_{k+1}-,w) - {\bar{v}}_0(T_{k+1}-,w)\bigr \Vert \nonumber \\&\qquad + \int _t^{T_{k+1}} \bigl \Vert {\widehat{\psi }}\bigl (s,w,\mu (s,w),{\bar{v}}(s,w)\bigr ) - {\widehat{\psi }}\bigl (s,w,\mu _0(s,w),{\bar{v}}_0(s,w)\bigr )\bigr \Vert \mathrm {d}s \nonumber \\&\qquad + \int _t^{T_{k+1}} \bigl \Vert {\widehat{Q}}\bigl (s,w,\mu (s,w),{\bar{v}}(s,w)\bigr ) \cdot {\bar{v}}(s,w) - {\widehat{Q}}\bigl (s,w,\mu _0(s,w),{\bar{v}}_0(s,w)\bigr ) \cdot {\bar{v}}_0(s,w)\bigr \Vert \mathrm {d}s \nonumber \\&\le \bigl \Vert {\bar{v}}(T_{k+1}-,w) - {\bar{v}}_0(T_{k+1}-,w)\bigr \Vert + \bigl (L_{{\widehat{\psi }}} + L_{{\widehat{Q}}} \cdot v_{\max }\bigr ) \cdot (T_{k+1}-t)\cdot \eta \nonumber \\&\qquad + \bigl (L_{{\widehat{\psi }}} + Q_{\max } + L_{{\widehat{Q}}} \cdot v_{\max }\bigr ) \cdot \int _t^{T_{k+1}} \bigl \Vert {\bar{v}}(s,w) - {\bar{v}}_0(s,w)\bigr \Vert \mathrm {d}s\nonumber \\&=\bigl \Vert {\bar{v}}(T_{k+1}-,w) - {\bar{v}}_0(T_{k+1}-,w)\bigr \Vert + \alpha (T_{k+1}-t)\cdot \eta + \beta \cdot \int _t^{T_{k+1}} \bigl \Vert {\bar{v}}(s,w) - {\bar{v}}_0(s,w)\bigr \Vert \mathrm {d}s. \end{aligned}$$

(36)

Moreover, for $k\in [1:n]$ we obtain from (33) that

$$\begin{aligned}&\bigl \Vert {\bar{v}}(T_k-,w) - {\bar{v}}_0(T_k-,w)\bigr \Vert \nonumber \\&\le \sum _{{\bar{w}}_k\in {\mathbb {W}}} \kappa _k\bigl ({\bar{w}}_k|w_1,\ldots ,w_{k-1},\mu (T_k-,w_{T_k-})\bigr )\nonumber \\&\quad \cdot \bigl \Vert {\bar{v}}(T_k,(w_{-k},{\bar{w}}_k)) - {\bar{v}}_0(T_k,(w_{-k},{\bar{w}}_k))\bigr \Vert + \eta \end{aligned}$$

(37)

and from (34) that

$$\begin{aligned} \Vert {\bar{v}}(T,w) - {\bar{v}}_0(T,w)\Vert = \Vert \Psi (w,\mu (T,w)) - \Psi (w,\mu _0(T,w))\Vert \le \eta . \end{aligned}$$

(38)

In view of (36), (37) and (38), it follows from Lemma A.1 that

$$\begin{aligned} \Vert {\bar{v}}(t,w) - {\bar{v}}_0(t,w)\Vert \le C \cdot \eta = \varepsilon \quad \text {for all }(t,w)\in [0,T]\times {\mathbb {W}}^n, \end{aligned}$$

i.e. . Thus is continuous with respect to $\Vert \cdot \Vert _{\sup }$.

Step 3: Continuity of . Let $v_0\in {\mathsf {D}}({\mathbb {R}}^d)$, set and fix some $\varepsilon >0$. We set $\delta _{n+1}\triangleq \varepsilon $ and $c\triangleq Q_{\max } + L_{{\widehat{Q}}}$ and recursively determine $\delta _1,\ldots ,\delta _n\in (0,\varepsilon )$ using Assumption 11(v) such that

$$\begin{aligned}&\Vert \Phi _k(w,m) - \Phi _k(w,{{\bar{\mu }}}_0(T_k-,w))\Vert \le \tfrac{\delta _{k+1}}{2} \cdot \mathrm {e}^{-c(T_{k+1}-T_k)} \nonumber \\&\qquad \text {for all } (m,w)\in {\mathbb {M}}\times {\mathbb {W}}^n \text { with } \Vert m - {{\bar{\mu }}}_0(T_k-,w)\Vert \le \delta _k. \end{aligned}$$

(39)

We define $\delta >0$ by

$$\begin{aligned} \delta \triangleq \tfrac{\mathrm {e}^{-cT}}{2cT}\cdot \delta _1\wedge \cdots \wedge \delta _n \end{aligned}$$

(40)

and let $v\in {\mathsf {D}}({\mathbb {R}}^d)$ such that $\Vert v-v_0\Vert _{\sup }\le \delta $; put . We fix $k\in [0:n]$ and suppose that

$$\begin{aligned} \Vert {{\bar{\mu }}}(T_k,w) - {\bar{\mu }}_0(T_k,w)\Vert \le \tfrac{\delta _{k+1}}{2} \cdot \mathrm {e}^{-c(T_{k+1}-T_k)},\quad w\in {\mathbb {W}}^n. \end{aligned}$$

(41)

For each $w\in {\mathbb {W}}^n$, we have from Assumption 11(ii)

$$\begin{aligned}&\bigl \Vert {\bar{\mu }}(t,w) - {\bar{\mu }}_0(t,w)\bigr \Vert \le \bigl \Vert {\bar{\mu }}(T_k,w) - {\bar{\mu }}_0(T_k,w)\bigr \Vert \\&\qquad + \int _{T_k}^t \bigl \Vert {\bar{\mu }}(s,w)\cdot {\widehat{Q}}(s,w,{\bar{\mu }}(s,w),v(s,w)) - {\bar{\mu }}_0(s,w)\cdot {\widehat{Q}}(s,w,{\bar{\mu }}_0(s,w),v_0(s,w)) \bigr \Vert \mathrm {d}s\\&\le \Vert {\bar{\mu }}(T_k,w) - {\bar{\mu }}_0(T_k,w)\Vert + c (t-T_k)\cdot \Vert v-v_0\Vert _{\sup } + c \cdot \int _{T_k}^t \bigl \Vert {\bar{\mu }}(s,w) - {\bar{\mu }}_0(s,w)\bigr \Vert \mathrm {d}s \end{aligned}$$

on $[T_k,T_{k+1}\rangle $, so using Gronwall’s inequality it follows that

$$\begin{aligned} \bigl \Vert {\bar{\mu }}(t,w) - {\bar{\mu }}_0(t,w)\bigr \Vert \le \bigl ( \bigl \Vert {\bar{\mu }}(T_k,w) - {\bar{\mu }}_0(T_k,w)\bigr \Vert + c(t-T_k)\cdot \Vert v-v_0\Vert _{\sup } \bigr ) \cdot \mathrm {e}^{c(t-T_k)}. \end{aligned}$$

Since by (40) we have $c(t-T_k)\cdot \delta \cdot \mathrm {e}^{c(t-T_k)} \le \frac{\delta _{k+1}}{2}$, we obtain

$$\begin{aligned} \bigl \Vert {\bar{\mu }}(t,w) - {\bar{\mu }}_0(t,w)\bigr \Vert \le \delta _{k+1} \quad \text {for all } (t,w)\in [T_k,T_{k+1}\rangle \times {\mathbb {W}}^n. \end{aligned}$$

(42)

In particular, using (39) we conclude that

$$\begin{aligned} \Vert {\bar{\mu }}(T_{k+1},w) - {\bar{\mu }}_0(T_{k+1},w)\Vert&= \Vert \Phi _{k+1}(w,{\bar{\mu }}(T_{k+1}-,w)) - \Phi _{k+1}(w,{\bar{\mu }}_0(T_{k+1}-,w))\Vert \\&\le \tfrac{\delta _{k+2}}{2} \cdot \mathrm {e}^{-c(T_{k+2}-T_{k+1})}, \quad w\in {\mathbb {W}}^n. \end{aligned}$$

Since ${\bar{\mu }}(0,w)={\bar{\mu }}_0(0,w)=m_0$ for all $w\in {\mathbb {W}}^n$, it follows by induction that (41) holds for all $k\in [0:n]$, and thus (42) implies

$$\begin{aligned} \Vert {\bar{\mu }}(t,w) - {\bar{\mu }}_0(t,w)\Vert \le \varepsilon \quad \text {for all } (t,w)\in [0,T]\times {\mathbb {W}}^n. \end{aligned}$$

Hence , so is continuous with respect to $\Vert \cdot \Vert _{\sup }$.

Step 4: Construction of the fixed point. Let and note that $\chi $ is continuous with respect to $\Vert \cdot \Vert _{\sup }$ by Steps 2 and 3. We define

$$\begin{aligned}&\mathsf {Lip}({\mathbb {M}}) \triangleq \bigl \{ \mu \in {\mathsf {D}}({\mathbb {M}}):\, \mu (\,\cdot \,,w) \text { is } Q_{\max } -\text { Lipschitz}\\&\qquad \qquad \qquad \qquad \qquad \qquad \quad \text {on } [T_k,T_{k+1}\rangle ,\ k\in [0:n],\ \text {for all } w\in {\mathbb {W}}^n \bigr \} \end{aligned}$$

and note from (E1) that $\chi :\ {\mathsf {D}}({\mathbb {M}})\rightarrow \mathsf {Lip}({\mathbb {M}})$, i.e. $\chi [\mu ]\in \mathsf {Lip}({\mathbb {M}})$ for every $\mu \in {\mathsf {D}}({\mathbb {M}})$. It is clear that $\mathsf {Lip}({\mathbb {M}})$ is a non-empty, convex subset of ${\mathsf {D}}({\mathbb {M}})$; we now argue that $\mathsf {Lip}({\mathbb {M}})$ is compact. Given a sequence $\{\mu _\ell \}_{\ell \in {\mathbb {N}}}\subseteq \mathsf {Lip}({\mathbb {M}})$, for each $k\in [0:n]$ and $w\in {\mathbb {W}}^n$ we define

$$\begin{aligned} \mu ^{(k,w)}_\ell :\ [T_k,T_{k+1}]\rightarrow {\mathbb {M}},\quad \mu ^{(k,w)}_\ell (t) \triangleq {\left\{ \begin{array}{ll} \mu _\ell (t,w) &{} \text {if } t\in [T_k,T_{k+1}\rangle ,\\ \mu _\ell (T_{k+1}-,w) &{} \text {if } t=T_{k+1}, \end{array}\right. } \end{aligned}$$

and note that by the Arzelà-Ascoli theorem, the sequence $\{\mu ^{(k,w)}_\ell \}_{\ell \in {\mathbb {N}}}\subseteq {\mathsf {C}}([T_k,T_{k+1}];{\mathbb {M}})$ contains a uniformly convergent subsequence. Taking sub-subsequences for $k\in [0:n]$ and $w\in {\mathbb {W}}^n$, we obtain a subsequence $\{\ell _\nu \}_{\nu \in {\mathbb {N}}}$ such that $\Vert \mu _{\ell _\nu }-\mu \Vert _{\sup }\rightarrow 0$ as $\nu \rightarrow \infty $ for some $\mu \in {\mathsf {D}}({\mathbb {M}})$. It is easy to see that $\mu \in \mathsf {Lip}({\mathbb {M}})$, and thus $\mathsf {Lip}({\mathbb {M}})$ is indeed compact. Now Schauder’s fixed point theorem implies that the continuous map $\chi :\, \mathsf {Lip}({\mathbb {M}})\rightarrow \mathsf {Lip}({\mathbb {M}})$ has a fixed point $\mu \in \mathsf {Lip}({\mathbb {M}})$; upon setting it follows that and that $(\mu ,v)$ is a solution of (E1)-(E6). $\square $

Appendix: Proof of Theorem 16

Proof of Theorem 16

Suppose that $(\mu _1,v_1)$ and $(\mu _2,v_2)$ are solutions of (E1)-(E6). In the following, we omit arguments to simplify notation when there is no ambiguity; e.g., $\widehat{{\mathcal {H}}}(\mu _1,v_2)=\widehat{{\mathcal {H}}}(t,w,\mu _1(t,w),v_2(t,w))$.

Step 1: Dynamics between Common Noise Times. Let $k\in [0:n]$. The product rule and (E1)-(E2) yield

$$\begin{aligned}&\frac{\mathrm {d}}{\mathrm {d}t}\bigl [ (\mu _1(t,w)-\mu _2(t,w)) \cdot (v_1(t,w)-v_2(t,w)) \bigr ] = \frac{\mathrm {d}}{\mathrm {d}t}\bigl [ (\mu _1-\mu _2) \cdot (v_1-v_2) \bigr ]\\&= \bigl [ \mu _1\cdot {\widehat{Q}}(\mu _1,v_1) - \mu _2\cdot {\widehat{Q}}(\mu _2,v_2) \bigr ] \cdot (v_1-v_2) - (\mu _1-\mu _2) \cdot \bigl [ \widehat{{\mathcal {H}}}(\mu _1,v_1) - \widehat{{\mathcal {H}}}(\mu _2,v_2) \bigr ]\\&= \mu _2 \cdot \bigl [ \widehat{{\mathcal {H}}}(\mu _2,v_1) - \widehat{{\mathcal {H}}}(\mu _2,v_2) - {\widehat{Q}}(\mu _2,v_2) \cdot (v_1-v_2) \bigr ] \\&\quad + \mu _1 \cdot \bigl [ \widehat{{\mathcal {H}}}(\mu _1,v_2) - \widehat{{\mathcal {H}}}(\mu _1,v_1) - {\widehat{Q}}(\mu _1,v_1) \cdot (v_2-v_1)\bigr ]\\&\quad + \mu _1 \cdot \bigl [ \widehat{{\mathcal {H}}}(\mu _2,v_2) - \widehat{{\mathcal {H}}}(\mu _1,v_2) \bigr ] + \mu _2 \cdot \bigl [ \widehat{{\mathcal {H}}}(\mu _1,v_1) - \widehat{{\mathcal {H}}}(\mu _2,v_1) \bigr ] \end{aligned}$$

and thus by Assumptions 14(ii)-(iii) we obtain

$$\begin{aligned}&\frac{\mathrm {d}}{\mathrm {d}t}\bigl [ (\mu _1(t,w)-\mu _2(t,w)) \cdot (v_1(t,w)-v_2(t,w)) \bigr ] \nonumber \\&\quad \ge \gamma \cdot \Vert \mu _1(t,w)-\mu _2(t,w)\Vert ^\alpha , \quad (t,w)\in [T_k,T_{k+1}\rangle \times {\mathbb {W}}^n. \end{aligned}$$

(43)

Step 2: Boundary Conditions at Common Noise Times. Let $k\in [1:n]$ and $w\in {\mathbb {W}}^n$. For $j=1,2$ and ${\bar{w}}_k\in {\mathbb {W}}$ we briefly write

$$\begin{aligned}&\mu _j \triangleq \mu _j\bigl (T_k,(w_{-k},{\bar{w}}_k)\bigr ) \text { and }\quad \mu _j^- \triangleq \mu _j(T_k-,w) = \mu _j\bigl (T_k-,(w_{-k},{\bar{w}}_k)\bigr ),\\&v_j \triangleq v_j\bigl (T_k,(w_{-k},{\bar{w}}_k)\bigr ), J(\mu _j^-) \quad \triangleq J\bigl (T_k,(w_{-k},{\bar{w}}_k),\mu _j^-\bigr ) \\&\quad \text { and }\quad \kappa _k(\mu _j^-) \triangleq \kappa _k\bigl ({\bar{w}}_k|w_1,\ldots ,w_{k-1},\mu _j^-). \end{aligned}$$

By (E3) we have for $j,\ell =1,2$

$$\begin{aligned} \mu _j \cdot v_\ell = \mu _j^- \cdot v_\ell ^{J^{\cdot }(\mu _j^-)}\quad \text {for all } {\bar{w}}_k\in {\mathbb {W}}\end{aligned}$$

(44)

where it is understood that $\mu _j,v_\ell ,J(\mu _j^-),\kappa _k(\mu _j^-)$ depend on ${\bar{w}}_k$. Using (E4) in the first identity and (44) in the second and fourth, we find by elementary algebraic manipulations

$$\begin{aligned}&(\mu _1(T_k-,w)-\mu _2(T_k-,w)) \cdot (v_1(T_k-,w)-v_2(T_k-,w))\\&= \sum _{{\bar{w}}_k\in {\mathbb {W}}} \kappa _k(\mu _1^-) \cdot (\mu _1^--\mu _2^-) \cdot v_1^{J^{\cdot }(\mu _1^-)} - \sum _{{\bar{w}}_k\in {\mathbb {W}}} \kappa _k(\mu _2^-) \cdot (\mu _1^--\mu _2^-) \cdot v_2^{J^\cdot (\mu _2^-)}\\&= \frac{1}{2}\sum _{{\bar{w}}_k\in {\mathbb {W}}} \kappa _k(\mu _1^-) \cdot (\mu _1^- - \mu _2^-)\cdot v_1^{J^{\cdot }(\mu _1^-)} \\&\quad + \frac{1}{2}\sum _{{\bar{w}}_k\in {\mathbb {W}}} \kappa _k(\mu _1^-) \cdot \Bigl \{ (\mu _1 - \mu _2)\cdot v_1 - \mu _2^-\cdot \bigl [v_1^{J^{\cdot }(\mu _1^-)}-v_1^{J^{\cdot }(\mu _2^-)}\bigr ] \Bigr \}\\&\quad - \frac{1}{2}\sum _{{\bar{w}}_k\in {\mathbb {W}}} \kappa _k(\mu _2^-) \cdot (\mu _1^--\mu _2^-)\cdot v_2^{J^{\cdot }(\mu _2^-)} \\&\quad - \frac{1}{2}\sum _{{\bar{w}}_k\in {\mathbb {W}}} \kappa _k(\mu _2^-) \cdot \Bigl \{ (\mu _1-\mu _2)\cdot v_2 + \mu _1^-\cdot \bigl [v_2^{J^{\cdot }(\mu _2^-)}-v_2^{J^{\cdot }(\mu _1^-)}\bigr ] \Bigr \}\\&= \frac{1}{2}\sum _{{\bar{w}}_k\in {\mathbb {W}}} \bigl [ \kappa _k(\mu _1^-)+\kappa _k(\mu _2^-) \bigr ] \cdot (\mu _1-\mu _2) \cdot (v_1-v_2) \\&\quad - \frac{1}{2}\sum _{{\bar{w}}_k\in {\mathbb {W}}} \bigl [ \kappa _k(\mu _2^-)\cdot v_1 - \kappa _k(\mu _1^-)\cdot v_2\bigr ] \cdot (\mu _1-\mu _2)\\&\quad + \frac{1}{2}\sum _{{\bar{w}}_k\in {\mathbb {W}}} \kappa _k(\mu _1^-) \cdot \Bigl \{ (\mu _1^--\mu _2^-)\cdot v_1^{J^{\cdot }(\mu _1^-)} - \mu _2^-\cdot \bigl [v_1^{J^{\cdot }(\mu _1^-)}-v_1^{J^{\cdot }(\mu _2^-)}\bigr ] \Bigr \}\\&\quad - \frac{1}{2}\sum _{{\bar{w}}_k\in {\mathbb {W}}} \kappa _k(\mu _2^-) \cdot \Bigl \{ (\mu _1^--\mu _2^-)\cdot v_2^{J^{\cdot }(\mu _2^-)} + \mu _1^-\cdot \bigl [v_2^{J^{\cdot }(\mu _2^-)}-v_2^{J^{\cdot }(\mu _1^-)}\bigr ] \Bigr \}\\&= \frac{1}{2}\sum _{{\bar{w}}_k\in {\mathbb {W}}} \bigl [ \kappa _k(\mu _1^-) +\kappa _k(\mu _2^-) \bigr ] \cdot (\mu _1-\mu _2) \cdot (v_1-v_2)\\&\quad - \frac{1}{2}\sum _{{\bar{w}}_k\in {\mathbb {W}}} \Bigl \{ \kappa _k(\mu _1^-) \cdot \bigl [ \mu _2^-\cdot v_2^{J^{\cdot }(\mu _2^-)} - \mu _1^-\cdot v_2^{J^{\cdot }(\mu _1^-)} \bigr ] + \kappa _k(\mu _2^-) \cdot \bigl [ \mu _1^-\cdot v_1^{J^{\cdot }(\mu _1^-)} - \mu _2^-\cdot v_1^{J^{\cdot }(\mu _2^-)} \bigr ] \Bigr \}\\&\quad + \frac{1}{2}\sum _{{\bar{w}}_k\in {\mathbb {W}}} \kappa _k(\mu _1^-) \cdot \Bigl \{ \mu _1^-\cdot v_1^{J^{\cdot }(\mu _1^-)} - \mu _2^-\cdot v_1^{J^{\cdot }(\mu _2^-)} - 2\mu _2^-\cdot \bigl [ v_1^{J^{\cdot }(\mu _1^-)}-v_1^{J^{\cdot }(\mu _2^-)} \bigr ] \Bigr \}\\&\quad - \frac{1}{2}\sum _{{\bar{w}}_k\in {\mathbb {W}}} \kappa _k(\mu _2^-) \cdot \Bigl \{ 2\mu _1^-\cdot \bigl [ v_2^{J^{\cdot }(\mu _2^-)}-v_2^{J^{\cdot }(\mu _1^-)} \bigr ] + \mu _1^-\cdot v_2^{J^{\cdot }(\mu _1^-)} - \mu _2^-\cdot v_2^{J^{\cdot }(\mu _2^-)} \Bigr \}\\&= \frac{1}{2} \sum _{{\bar{w}}_k\in {\mathbb {W}}} \bigl [ \kappa _k(\mu _1^-) + \kappa _k(\mu _2^-) \bigr ] \cdot (\mu _1-\mu _2) \cdot (v_1-v_2)\\&\quad - \frac{1}{2} \sum _{{\bar{w}}_k\in {\mathbb {W}}} \bigl [ \kappa _k(\mu _1^-) - \kappa _k(\mu _2^-) \bigr ] \cdot \bigl [ \mu _2^- \cdot v_1^{J^{\cdot }(\mu _2^-)} - \mu _1^- \cdot v_1^{J^{\cdot }(\mu _1^-)} + \mu _2^- \cdot v_2^{J^{\cdot }(\mu _2^-)} - \mu _1^- \cdot v_2^{J^{\cdot }(\mu _1^-)} \bigr ]\\&\quad - \sum _{{\bar{w}}_k\in {\mathbb {W}}} \Bigl \{ \kappa _k(\mu _2^-) \cdot \mu _1^- \cdot \bigl [ v_2^{J^{\cdot }(\mu _2^-)} - v_2^{J^{\cdot }(\mu _1^-)} \bigr ] + \kappa _k(\mu _1^-) \cdot \mu _2^- \cdot \bigl [ v_1^{J^{\cdot }(\mu _1^-)} - v_1^{J^{\cdot }(\mu _2^-)} \bigr ] \Bigr \}. \end{aligned}$$

Hence by Assumption 14(iv) we conclude that

$$\begin{aligned}&(\mu _1^--\mu _2^-) \cdot (v_1^--v_2^-) \le \frac{1}{2} \sum _{{\bar{w}}_k\in {\mathbb {W}}} \bigl [ \kappa _k(\mu _1^-) + \kappa _k(\mu _2^-) \bigr ] \cdot (\mu _1-\mu _2) \cdot (v_1-v_2)\nonumber \\&\qquad \text {provided that }(\mu _1-\mu _2)\cdot (v_1-v_2) = \bigl [ \Phi _k(w,\mu _1^-) - \Phi _k(w,\mu _2^-) \bigr ] \cdot (v_1-v_2)\le 0. \end{aligned}$$

(45)

Step 3: Backward Propagation of Monotonicity. Suppose that for some $k\in [1:n]$ we have

$$\begin{aligned} (\mu _1(T_{k+1}-,w)-\mu _2(T_{k+1}-,w)) \cdot (v_1(T_{k+1}-,w)-v_2(T_{k+1}-,w)) \le 0 \quad \text {for all } w\in {\mathbb {W}}^n. \end{aligned}$$

(46)

By regularity, the fundamental theorem of calculus applies and using (43) we obtain

$$\begin{aligned}&(\mu _1(T_k,w)-\mu _2(T_k,w)) \cdot (v_1(T_k,w)-v_2(T_k,w))\\&= (\mu _1(T_{k+1}-,w)-\mu _2(T_{k+1}-,w)) \cdot (v_1(T_{k+1}-,w)-v_2(T_{k+1}-,w))\\&\qquad - \int _{T_k}^{T_{k+1}} \frac{\mathrm {d}}{\mathrm {d}t} \bigl [(\mu _1(s,w)-\mu _2(s,w)) \cdot (v_1(s,w)-v_2(s,w))\bigr ] \mathrm {d}s \le 0 \quad \text {for all } w\in {\mathbb {W}}^n. \end{aligned}$$

Thus (45) implies that (46) also holds at time $T_k-$, i.e.

$$\begin{aligned} (\mu _1(T_k-,w)-\mu _2(T_k-,w)) \cdot (v_1(T_k-,w)-v_2(T_k-,w)) \le 0 \quad \text {for all } w\in {\mathbb {W}}^n. \end{aligned}$$

Since (46) is satisfied for $k=n$ by Assumption 14(i), it follows by induction that

$$\begin{aligned}&\bigl (\mu _1(T_{k+1}-,w)-\mu _2(T_{k+1}-,w)\bigr ) \cdot \bigl (v_1(T_{k+1}-,w) \nonumber \\&\quad -v_2(T_{k+1}-,w)\bigr ) \le 0 \quad \text {for all } k\in [0:n],\, w\in {\mathbb {W}}^n. \end{aligned}$$

(47)

Step 4: Forward Propagation of Uniqueness. Suppose that for some $k\in [0:n]$ we have

$$\begin{aligned} \mu _1(T_k,w) = \mu _2(T_k,w) \quad \text {for all } w\in {\mathbb {W}}^n. \end{aligned}$$

(48)

Using the fundamental theorem of calculus, (43) and (47) we find that

$$\begin{aligned}&\gamma \cdot \int _{T_k}^{T_{k+1}} \Vert \mu _1(s,w)-\mu _2(s,w)\Vert ^\alpha \mathrm {d}s\\&\le \int _{T_k}^{T_{k+1}} \Bigl (\frac{\mathrm {d}}{\mathrm {d}t}\bigl [(\mu _1(s,w)-\mu _2(s,w)) \cdot (v_1(s,w)-v_2(s,w))\bigr ]\Bigr ) \mathrm {d}s\\&= (\mu _1(T_{k+1}-,w)-\mu _2(T_{k+1}-,w)) \cdot (v_1(T_{k+1}-,w)-v_2(T_{k+1}-,w))\\&\qquad - (\mu _1(T_k,w)-\mu _2(T_k,w)) \cdot (v_1(T_{k+1}-,w)-v_2(T_{k+1}-,w)) \le 0 \quad \text {for all } w\in {\mathbb {W}}^n. \end{aligned}$$

As a consequence, we have

$$\begin{aligned} \mu _1(t,w)=\mu _2(t,w) \quad \text {for all }(t,w)\in [T_k,T_{k+1}\rangle \times {\mathbb {W}}^n \end{aligned}$$

and, in particular, $\mu _1(T_{k+1}-,w)=\mu _2(T_{k+1}-,w)$, implying $\mu _1(T_{k+1},w)=\mu _2(T_{k+1},w)$ for all $w\in {\mathbb {W}}^n$. Since $\mu _1(0)=m_0=\mu _2(0)$ by (E5), (48) is satisfied for $k=0$, and we conclude that $\mu _1=\mu _2$. Finally, since Assumption 14 subsumes Assumptions 11(i)-(ii), the arguments in the proof of Lemma A.2 yield $v_1=v_2$, and the proof is complete. $\square $

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Belak, C., Hoffmann, D. & Seifried, F.T. Continuous-Time Mean Field Games with Finite State Space and Common Noise. Appl Math Optim 84, 3173–3216 (2021). https://doi.org/10.1007/s00245-020-09743-7

Download citation

Accepted: 26 December 2020
Published: 07 February 2021
Issue Date: December 2021
DOI: https://doi.org/10.1007/s00245-020-09743-7

Keywords

Mathematics Subject Classification

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Continuous-Time Mean Field Games with Finite State Space and Common Noise

Abstract

Similar content being viewed by others

Master Equation for Finite State Mean Field Games with Additive Common Noise

Uniqueness for Linear-Quadratic Mean Field Games with Common Noise

Mean field games with monotonous interactions through the law of states and controls of the agents

1 Introduction

2 Mean Field Model

2.1 Probabilistic Setting and Common Noise

Definition 1

2.2 Optimization Problem

2.3 State Dynamics

Lemma 2

Proof

3 Solution of the Optimization Problem

Assumption 3

Definition 4

Remark 5

Theorem 6

Proof

4 Equilibrium

Remark 7

4.1 Aggregation

Lemma 8

Proof

Theorem 9

Proof

4.2 Mean Field Equilibrium System

Definition 10

Assumption 11

Remark 12

Theorem 13

Proof

Assumption 14

Remark 15

Theorem 16

Proof

5 Applications

5.1 A Decentralized Agricultural Production Model

5.2 An \(\mathrm {SIR}\) Model with Random One-Shot Vaccination

Remark 17

Notes

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Appendices

Appendix: Proof of Theorem 13

Lemma A.1

Proof

Lemma A.2

Proof

Lemma A.3

Proof

Proof of Theorem 13

Appendix: Proof of Theorem 16

Proof of Theorem 16

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Mathematics Subject Classification

Search

Navigation