A Note on the Differentiability of the Hellinger–Kantorovich Distances

Fleißner, Florentine Catharina

doi:10.1007/s00245-022-09948-y

A Note on the Differentiability of the Hellinger–Kantorovich Distances

Open access
Published: 13 March 2023

Volume 87, article number 37, (2023)
Cite this article

Download PDF

You have full access to this open access article

Applied Mathematics & Optimization Aims and scope Submit manuscript

A Note on the Differentiability of the Hellinger–Kantorovich Distances

Download PDF

Florentine Catharina Fleißner ORCID: orcid.org/0000-0002-3999-7886¹

1007 Accesses
Explore all metrics

Abstract

This paper will deal with differentiability properties of the class of Hellinger–Kantorovich distances $\textsf{HK}_{\Lambda , \Sigma } \ (\Lambda , \Sigma > 0)$ which was recently introduced on the space ${\mathcal {M}}({\mathbb {R}}^d)$ of finite nonnegative Radon measures. The derivatives of $t\mapsto \textsf{HK}_{\Lambda , \Sigma }(\mu _t, \nu _t)^2,$ for absolutely continuous curves $(\mu _t)_t, (\nu _t)_t$ in $({\mathcal {M}}({\mathbb {R}}^d),\textsf{HK}_{\Lambda , \Sigma })$, will be computed ${\mathscr {L}}^1$-a.e.. The characterization of absolutely continuous curves in $({\mathcal {M}}({\mathbb {R}}^d), \textsf{HK}_{\Lambda , \Sigma })$ will be refined.

Variability regions for the second derivative of bounded analytic functions

Article 05 January 2022

On stepanov Type differentiability Theorems

Article 16 November 2014

The continuity equation on metric measure spaces

Article 16 July 2014

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

Recently, a new class of distances on the space ${\mathcal {M}}({\mathbb {R}}^d)$ of finite nonnegative Radon measures was established by three independent teams [3, 4, 7,8,9]. We will follow the presentation of these distances by Liero, Mielke and Savaré [8, 9] who named it Hellinger–Kantorovich distances. The class of Hellinger–Kantorovich distances $\textsf{HK}_{\Lambda , \Sigma } \ (\Lambda , \Sigma > 0)$ is based on the conversion of one measure into another one (possibly having different total mass) by means of transport and creation / annihilation of mass. The parameters $\Lambda $ and $\Sigma $ serve as weightings of the transport part and the mass creation/annihilation part respectively. To be more precise, the square $\textsf{HK}_{\Lambda , \Sigma }(\mu _1, \mu _2)^2$ of the Hellinger–Kantorovich distance $\textsf{HK}_{\Lambda , \Sigma }$ between two measures $\mu _1, \mu _2 \in {\mathcal {M}}({\mathbb {R}}^d)$ on ${\mathbb {R}}^d$ corresponds to

$$\begin{aligned}{} & {} \min \Big \{\sum _{i=1}^{2}{\frac{4}{\Sigma }\int _{{\mathbb {R}}^d}{(\sigma _i\log \sigma _i - \sigma _i + 1)\,{\textrm{d}}\mu _i}} \nonumber \\{} & {} \quad + \int _{{\mathbb {R}}^d\times {\mathbb {R}}^d}{{\textsf {c}}_{\Lambda , \Sigma }(|x_1-x_2|)\,{\textrm{d}}\gamma }: \ \gamma \in {\mathcal {M}}({\mathbb {R}}^d\times {\mathbb {R}}^d), \ \gamma _i\ll \mu _i \Big \}, \end{aligned}$$

(1.1)

with entropy cost functions $\frac{4}{\Sigma }(\sigma _i\log \sigma _i - \sigma _i + 1)$,

$$\begin{aligned} \sigma _i := \frac{\,{\textrm{d}}\gamma _i}{\,{\textrm{d}}\mu _i} \quad (\gamma _i \text { i-th marginal of } \gamma ), \end{aligned}$$

(1.2)

and transportation cost function

$$\begin{aligned} {\textsf {c}}_{\Lambda , \Sigma }({\textsf {d}}):= {\left\{ \begin{array}{ll} -\frac{8}{\Sigma }\log (\cos (\sqrt{\Sigma /(4\Lambda )} {\textsf {d}})) &{}\text { if } {\textsf {d}} < \pi \sqrt{\Lambda /\Sigma },\\ +\infty &{}\text { if } {\textsf {d}} \ge \pi \sqrt{\Lambda /\Sigma }. \end{array}\right. } \end{aligned}$$

(1.3)

There exists an optimal plan $\gamma $ for the Logarithmic Entropy-Transport problem (1.1) (cf. Thm. 3.3 in [9]), and if $\mu _1$ is absolutely continuous with respect to the Lebesgue measure and $\gamma $ is such optimal plan, then there exists a Borel optimal transport mapping $t: {\mathbb {R}}^d\rightarrow {\mathbb {R}}^d$ so that $\gamma $ takes the form

$$\begin{aligned} \gamma \ = \ (I\times t)_{\#} \gamma _{1} \ = \ (I\times t)_{\#}\sigma _{1}\mu _1 \end{aligned}$$

(cf. Thm. 4.5 in [6] and Thm. 6.6 in [9]). We refer the reader to ([9], Cor. 7.14, Thms. 7.17 and 7.20) for the proofs that $\textsf{HK}_{\Lambda , \Sigma }$ defined via the Logarithmic Entropy-Transport problem (1.1) indeed represents a distance on the space of finite nonnegative Radon measures and that $({\mathcal {M}}({\mathbb {R}}^d), \textsf{HK}_{\Lambda , \Sigma })$ is a complete metric space. Furthermore, the Hellinger–Kantorovich distance $\textsf{HK}_{\Lambda , \Sigma }$ metrizes the weak topology on ${\mathcal {M}}({\mathbb {R}}^d)$ in duality with continuous and bounded functions (cf. Thm. 7.15 in [9]) and can be interpreted as weighted infimal convolution of the Kantorovich-Wasserstein distance and the Hellinger-Kakutani distance. A representation formula à la Benamou-Brenier which can be proved for $\textsf{HK}_{\Lambda , \Sigma }$ (cf. ([9], Thm. 8.18; [8], Thm. 3.6(v))) justifies this interpretation:

$$\begin{aligned} \textsf{HK}_{\Lambda , \Sigma }(\mu _1, \mu _2)^2 = \min \Big \{\int _0^1{\int _{{\mathbb {R}}^d}{(\Lambda |v_t|^2 + \Sigma |w_t|^2)\,{\textrm{d}}\mu _t}\,{\textrm{d}}t}: \ \mu _1 {\mathop {\rightsquigarrow }\limits ^{(\mu , v,w)}}\mu _2 \Big \}\nonumber \\ \end{aligned}$$

(1.4)

where $\mu _1{\mathop {\rightsquigarrow }\limits ^{(\mu , v, w)}}\mu _2$ means that $\mu : [0, 1]\rightarrow {\mathcal {M}}({\mathbb {R}}^d)$ is a continuous curve connecting $\mu (0)=\mu _1$ and $\mu (1)=\mu _2$ and satisfying the continuity equation with reaction

$$\begin{aligned} \partial _t\mu _t = -\Lambda \textrm{div}(v_t\mu _t) + \Sigma w_t\mu _t, \end{aligned}$$

(1.5)

governed by Borel functions $v: (0,1)\times {\mathbb {R}}^d\rightarrow {\mathbb {R}}^d$ and $w:(0,1)\times {\mathbb {R}}^d\rightarrow \mathbb {R}$ with

$$\begin{aligned} \int _0^1{\int _{{\mathbb {R}}^d}{(\Lambda |v_t|^2 + \Sigma |w_t|^2)\,{\textrm{d}}\mu _t}\,{\textrm{d}}t} < +\infty , \end{aligned}$$

(1.6)

in duality with $\textrm{C}^{\infty }$-functions with compact support in $(0.1)\times {\mathbb {R}}^d$, i.e.

$$\begin{aligned}{} & {} \int _0^{1}\int _{{\mathbb {R}}^d}(\partial _t\psi (t,x) + \Lambda \langle \nabla \psi (t,x), v(t,x)\rangle \nonumber \\{} & {} \quad + \Sigma \psi (t,x)w(t,x))\,{\textrm{d}}\mu _t(x)\,{\textrm{d}}t \ = \ 0 \end{aligned}$$

(1.7)

for all $\psi \in \textrm{C}^{\infty }_c((0,1)\times {\mathbb {R}}^d)$.

The class of such continuous curves $\mu $ satisfying (1.5), (1.6) for some Borel vector field (v, w) coincides with the class of absolutely continuous curves $(\mu _t)_{t\in [0,1]}$ in $({\mathcal {M}}({\mathbb {R}}^d), \textsf{HK}_{\Lambda , \Sigma })$ with square-integrable metric derivatives (cf. Thms. 8.16 and 8.17 in [9], see Sect. 3 in this paper).

In order to deepen our understanding of a distance, it is always worth studying its differentiability along absolutely continuous curves (e.g. see Chap. 8 in [1] for the corresponding analysis of the Kantorovich-Wasserstein distance on the space of Borel probability measures with finite second order moments). The present paper addresses this issue for the class of Hellinger–Kantorovich distances on the space of finite nonnegative Radon measures. Clearly, if $(\mu _t)_{t\in [0,1]}, (\nu _t)_{t\in [0,1]}$ are absolutely continuous curves in $({\mathcal {M}}({\mathbb {R}}^d), \textsf{HK}_{\Lambda , \Sigma })$, then the mapping

$$\begin{aligned} t\mapsto \textsf{HK}_{\Lambda , \Sigma }(\mu _t, \nu _t)^2 \end{aligned}$$

(1.8)

is absolutely continuous and therefore ${\mathscr {L}}^1$-a.e. differentiable. A natural question that arises is the one of the concrete form of the corresponding derivatives. We will answer this question for absolutely continuous curves with square-integrable metric derivatives (for which such characterization (1.5) is available), refine that characterization by providing more information on (v, w) (see Prop. 3.1), establish a linearization result (see Thm. 3.4), and determine

$$\begin{aligned} \frac{\,{\textrm{d}}}{\,{\textrm{d}}t} \textsf{HK}_{\Lambda , \Sigma }(\mu _t, \nu _t)^2 \end{aligned}$$

(1.9)

at ${\mathscr {L}}^1$-a.e. $t\in [0,1]$ (see Thm. 4.1). This piece of work can be viewed as continuation of Sect. 3 in the author’s paper [5] constituting a starting point for the study of differentiability properties of the Hellinger–Kantorovich distances. Therein, we identified elements of the Fréchet subdifferential of mappings

$$\begin{aligned} t\mapsto -\textsf{HK}_{\Lambda , \Sigma }((I+tv)_{\#}(1+tR)^2\mu _0, \nu )^2 \end{aligned}$$

at $t=0$, for $\mu _0, \nu \in {\mathcal {M}}({\mathbb {R}}^d)$ and bounded Borel functions $v: {\mathbb {R}}^d\rightarrow {\mathbb {R}}^d$ and $R: {\mathbb {R}}^d\rightarrow \mathbb {R}$. That subdifferential calculus was an essential ingredient for our Minimizing Movement approach to a class of scalar reaction-diffusion equations [5] substantiating their gradient-flow-like structure in the space of finite nonnegative Radon measures endowed with the Hellinger–Kantorovich distance $\textsf{HK}_{\Lambda , \Sigma }$.

The proof in [9] that absolutely continuous curves in $({\mathcal {M}}({\mathbb {H}}), \textsf{HK}_{\Lambda , \Sigma })$ with square-integrable metric derivatives are characterized via (1.5), (1.6) was carried out only for $\mathbb {H}={\mathbb {R}}^d$, endowed with usual scalar product $\langle \cdot , \cdot \rangle $ and norm $|\cdot |:=\sqrt{\langle \cdot ,\cdot \rangle }$, but according to a comment at the beginning of Sect. 8.5 in [9], it should be possible to prove such characterization result in a more general setting. We would like to remark that also our computation of the derivatives (1.9) may be adapted for general separable Hilbert spaces ${\mathbb {H}}$.

Our plan for the paper is to give an equivalent characterization of the Hellinger–Kantorovich distances in Sect. 2, to state and prove new results on absolutely continuous curves in Sect. 3 and to perform the computation of the derivatives (1.9) in Sect. 4.

2 Optimal Transportation on the Cone

According to ([8], Sect. 4) and ([9], Sect. 7), the Logarithmic Entropy-Transport problem (1.1) translates into a problem of optimal transportation on the geometric cone ${\mathfrak {C}}$ on ${\mathbb {R}}^d$, see (2.16), (2.17) below. The fact that all the information on transport of mass and creation / annihilation of mass according to (1.1) lies in a pure transportation problem has proved extremely useful for the analysis of $\textsf{HK}_{\Lambda , \Sigma }$ in [9] and for our subdifferential calculus in [5].

Geometric cone $({\mathfrak {C}}, {\textsf {d}}_{{\mathfrak {C}}, \Lambda , \Sigma })$. The geometric cone is defined as the quotient space

$$\begin{aligned} {\mathfrak {C}}:= {\mathbb {R}}^d\times [0, +\infty )/\sim \end{aligned}$$

(2.1)

with

$$\begin{aligned} (x_1, r_1) \sim (x_2, r_2) \quad \Leftrightarrow \quad r_1 = r_2 = 0 \text { or } r_1=r_2, \ x_1 = x_2 \end{aligned}$$

(2.2)

and is endowed with a class of distances ${\textsf {d}}_{{\mathfrak {C}}, \Lambda , \Sigma } \ (\Lambda , \Sigma > 0)$. The vertex ${\mathfrak {o}}$ (for $r=0$) and [x, r] (for $x\in {\mathbb {R}}^d$ and $r>0$) denote the corresponding equivalence classes and

$$\begin{aligned}{} & {} {\textsf {d}}_{{\mathfrak {C}}, \Lambda , \Sigma }([x_1,r_1],[x_2, r_2])^2\nonumber \\{} & {} \quad := \frac{4}{\Sigma }\Big (r_1^2 + r_2^2 - 2 r_1 r_2 \cos \Big (\Big (\sqrt{\Sigma /4\Lambda }\ |x_1 - x_2|\Big )\wedge \pi \Big )\Big ) \end{aligned}$$

(2.3)

(where ${\mathfrak {o}}$ is identified with $[{\bar{x}}, 0]$ for some ${\bar{x}}\in {\mathbb {R}}^d$). It can be proved that

$$\begin{aligned} {\textsf {d}}_{{\mathfrak {C}}, \Lambda , \Sigma }(y_0, y_1)^2 = \min \Big \{\int _0^1{\Big (\frac{4}{\Sigma }({\dot{r}}(s))^2 + \frac{1}{\Lambda }r(s)^2|{\dot{x}}(s)|^2\Big )\,{\textrm{d}}s} \Big | \ y_0 {\mathop {\rightsquigarrow }\limits ^{[x,r]}} y_1 \Big \}\nonumber \\ \end{aligned}$$

(2.4)

for $y_i=[x_i,r_i]\in {\mathfrak {C}}$, where $y_0 {\mathop {\rightsquigarrow }\limits ^{[x,r]}} y_1$ means that $x\in {\textrm{C}}^{1}([0,1]; {\mathbb {R}}^d), r\in {\textrm{C}}^{1}([0,1];[0,+\infty ))$ and $ [x(i), r(i)]=y_i$, so that the cone distance may be interpreted as dissipation distance generated by the metric tensor

$$\begin{aligned} {\mathfrak {G}}^{\Lambda , \Sigma }_{[x,r]}((\dot{x_1},\dot{r_1}), (\dot{x_2}, \dot{r_2})) := \frac{4}{\Sigma } \dot{r_1}\dot{r_2} + \frac{1}{\Lambda }r^2 \langle \dot{x_1}, \dot{x_2} \rangle \end{aligned}$$

(2.5)

(cf. Sect. 8.1 in [9]). This metric tensor (2.5) will appear in the formulas in our differential calculus of $\textsf{HK}_{\Lambda , \Sigma }$.

We show how to construct geodesics in $({\mathfrak {C}}, {\textsf {d}}_{{\mathfrak {C}}, \Lambda , \Sigma })$ (cf. Sect. 8.1 in [9]) as they will play an important role in our analysis of (1.9), too. Let $y_i:=[x_i,r_i]\in {\mathfrak {C}}, i=1,2,$ and suppose that $|x_1 - x_2| \le \pi \sqrt{\Lambda /\Sigma }, \ r_1, r_2 > 0$. We search for functions ${\mathcal {R}}_{y_1,y_2}: [0, 1]\rightarrow [0, +\infty )$ and $\theta _{y_1,y_2}: [0, 1]\rightarrow [0, 1]$ so that the curve $\eta : [0,1]\rightarrow {\mathfrak {C}}$ defined as $\eta (s):=[x_1 + \theta _{y_1,y_2}(s)(x_2-x_1), {\mathcal {R}}_{y_1,y_2}(s)]$ is a (constant speed) geodesic connecting $[x_1, r_1]$ and $[x_2, r_2]$, which means ${\textsf {d}}_{{\mathfrak {C}}, \Lambda , \Sigma }(\eta (s), \eta (t)) = |s-t| {\textsf {d}}_{{\mathfrak {C}}, \Lambda , \Sigma }([x_1,r_1], [x_2,r_2])$ for all $s, t\in [0,1]$. If $x_1 = x_2$, we set $\theta _{y_1,y_2}\equiv 0$. We note that

$$\begin{aligned} {\textsf {d}}_{{\mathfrak {C}}, \Lambda , \Sigma }(\eta (s), \eta (t))^2 \ = \ |z(s) - z(t)|^2_{{\mathbb {C}}}, \end{aligned}$$

(2.6)

where $z: [0, 1]\rightarrow {\mathbb {C}}$ is the curve in the complex plane ${\mathbb {C}}$ defined as

$$\begin{aligned} z(s):=\frac{2}{\sqrt{\Sigma }}{\mathcal {R}}_{y_1,y_2}(s)\exp \Big (i\theta _{y_1,y_2}(s)\sqrt{\Sigma /4\Lambda } \ |x_1 - x_2|\Big ), \end{aligned}$$

(2.7)

and $|\cdot |_{\mathbb {C}}$ denotes the absolute value for complex numbers. Thus, if z is a geodesic in the complex plane between $z_1:=\frac{2}{\sqrt{\Sigma }} r_1$ and $z_2:=\frac{2}{\sqrt{\Sigma }}r_2\exp \Big (i\sqrt{\Sigma /4\Lambda } \ |x_1 - x_2|\Big )$, i.e.

$$\begin{aligned} z(s) = z_1 + s(z_2-z_1) \quad \text { for all } s\in [0,1], \end{aligned}$$

(2.8)

then $\eta $ is a geodesic in $({\mathfrak {C}}, {\textsf {d}}_{{\mathfrak {C}}, \Lambda , \Sigma })$ between $[x_1, r_1]$ and $[x_2, r_2]$. This condition yields an appropriate choice for ${\mathcal {R}}_{y_1,y_2}: [0, 1] \rightarrow [0, +\infty )$ and $\theta _{y_1,y_2}: [0,1]\rightarrow [0,1]$, and it is not difficult to see that they are both smooth functions, their first derivatives satisfy

$$\begin{aligned}{} & {} \frac{4}{\Sigma }({\mathcal {R}}_{y_1,y_2}'(s))^2 + \frac{1}{\Lambda }{\mathcal {R}}_{y_1,y_2}(s)^2(\theta _{y_1,y_2}'(s))^2|x_1 - x_2|^2 \ \nonumber \\{} & {} \quad = \ {\textsf {d}}_{{\mathfrak {C}}, \Lambda , \Sigma }([x_1, r_1], [x_2, r_2])^2 \quad \text { for all } s\in (0,1), \end{aligned}$$

(2.9)

and they are right differentiable at $s=0$ with right derivatives

$$\begin{aligned}{} & {} \theta _{y_1,y_2,+}'(0)=\frac{r_2}{r_1}\frac{\sin (\sqrt{\Sigma /4\Lambda } \ {\textsf {d}}(x_1,x_2))}{\sqrt{\Sigma /4\Lambda } \ {\textsf {d}}(x_1,x_2)} \ \text { , } \nonumber \\ {\mathcal {R}}_{y_1,y_2,+}'(0){} & {} \quad =r_2\cos \Big (\sqrt{\Sigma /4\Lambda } \ {\textsf {d}}(x_1,x_2)\Big )-r_1. \end{aligned}$$

(2.10)

It is noteworthy that

$$\begin{aligned} {\mathfrak {t}}_{y_1,y_2}(s):=\Big (\theta _{y_1,y_2}'(s)(x_2-x_1), {\mathcal {R}}_{y_1,y_2}'(s)\Big ) \end{aligned}$$

(2.11)

represents the tangent vector to the geodesic at $\eta (s), s\in (0,1)$, with

$$\begin{aligned} {\mathfrak {t}}_{y_1,y_2}(0):= \lim _{s\downarrow 0}{\mathfrak {t}}_{y_1,y_2}(s)=\Big (\theta _{y_1,y_2,+}'(0)(x_2-x_1), {\mathcal {R}}_{y_1,y_2,+}'(0)\Big ), \end{aligned}$$

(2.12)

and the left-hand side of (2.9) equals the metric tensor ${\mathfrak {G}}^{\Lambda , \Sigma }_{\eta (s)}({\mathfrak {t}}_{y_1,y_2}(s), {\mathfrak {t}}_{y_1,y_2}(s))$ (cf. (2.5)).

We obtain a geodesic from $[x_1, r_1]$ to the vertex ${\mathfrak {o}}$ by setting $\theta _{y_1,{\mathfrak {o}}}\equiv 0$ and ${\mathcal {R}}_{y_1,{\mathfrak {o}}}(s):=(1-s)r_1$ and identifying ${\mathfrak {o}}$ with $[x_1, 0]$. Also in this case, (2.9) and the second part of (2.10) hold good.

Optimal transportation problem. The distance ${\textsf {d}}_{{\mathfrak {C}}, \Lambda , \Sigma }$ gives rise to an optimal transport problem on the cone and therefore to an extended quadratic Kantorovich-Wasserstein distance ${\mathcal {W}}_{{\mathfrak {C}}, \Lambda , \Sigma }$ on the space ${\mathcal {M}}_2({\mathfrak {C}})$ of finite nonnegative Radon measures on ${\mathfrak {C}}$ with finite second order moments, i.e. $\int _{{\mathfrak {C}}}{{\textsf {d}}_{{\mathfrak {C}}, \Lambda , \Sigma }([x,r],{\mathfrak {o}})^2\,{\textrm{d}}\alpha ([x,r])} < +\infty $. The extended Kantorovich-Wasserstein distance ${\mathcal {W}}_{{\mathfrak {C}}, \Lambda , \Sigma }(\alpha _1, \alpha _2)$ between two measures $\alpha _1, \alpha _2\in {\mathcal {M}}_2({\mathfrak {C}})$ is equal to $+\infty $ if $\alpha _1({\mathfrak {C}})\ne \alpha _2({\mathfrak {C}})$ and is given by

$$\begin{aligned}{} & {} {\mathcal {W}}_{{\mathfrak {C}}, \Lambda , \Sigma }(\alpha _1, \alpha _2)^2 \nonumber \\{} & {} \quad := \min \Big \{\int _{{\mathfrak {C}}\times {\mathfrak {C}}}{{\textsf {d}}_{{\mathfrak {C}}, \Lambda , \Sigma }([x_1,r_1],[x_2,r_2])^2\,{\textrm{d}}\beta } \ | \ \beta \in \Gamma (\alpha _1, \alpha _2)\Big \} \end{aligned}$$

(2.13)

if $\alpha _1({\mathfrak {C}}) = \alpha _2({\mathfrak {C}})$, with $\Gamma (\alpha _1, \alpha _2)$ being the set of finite nonnegative Radon measures on ${\mathfrak {C}}\times {\mathfrak {C}}$ whose first and second marginals coincide with $\alpha _1$ and $\alpha _2$. Every measure $\alpha \in {\mathcal {M}}_2({\mathfrak {C}})$ on the cone is assigned a measure ${\mathfrak {h}}\alpha \in {\mathcal {M}}({\mathbb {R}}^d)$ on ${\mathbb {R}}^d$,

$$\begin{aligned} {\mathfrak {h}}\alpha := {\textsf {x}}_{\#}({\textsf {r}}^2\alpha ), \end{aligned}$$

(2.14)

with $({\textsf {x}}, {\textsf {r}}): {\mathfrak {C}}\rightarrow {\mathbb {R}}^d\times [0, +\infty )$ defined as

$$\begin{aligned} ({\textsf {x}}, {\textsf {r}})([x, r]) := (x,r) \text { for } [x,r]\in {\mathfrak {C}}, \ r > 0, \ ({\textsf {x}}, {\textsf {r}})({\mathfrak {o}}):=({\bar{x}},0), \end{aligned}$$

(2.15)

which means $\int _{{\mathbb {R}}^d}{\phi (x)\,{\textrm{d}}({\mathfrak {h}}\alpha )} = \int _{{\mathfrak {C}}}{{\textsf {r}}^2\phi ({\textsf {x}})\,{\textrm{d}}\alpha }$ for all continuous and bounded functions $\phi : {\mathbb {R}}^d\rightarrow \mathbb {R}$ (short $\phi \in \textrm{C}^{0}_b({\mathbb {R}}^d)$). Please note that the mapping ${\mathfrak {h}}: {\mathcal {M}}_2({\mathfrak {C}}) \rightarrow {\mathcal {M}}({\mathbb {R}}^d)$ is not injective.

Now, an equivalent characterization of the Hellinger–Kantorovich distance $\textsf{HK}_{\Lambda , \Sigma }$ is given by the transportation problems

$$\begin{aligned}{} & {} \textsf{HK}_{\Lambda , \Sigma }(\mu _1, \mu _2)^2 \ = \ \min \Big \{{\mathcal {W}}_{{\mathfrak {C}}, \Lambda , \Sigma }(\alpha _1, \alpha _2)^2 \ \Big | \ \alpha _i\in {\mathcal {M}}_2({\mathfrak {C}}), \ {\mathfrak {h}}\alpha _i = \mu _i \Big \} \end{aligned}$$

(2.16)

$$\begin{aligned}{} & {} = \ \min \Big \{{\mathcal {W}}_{{\mathfrak {C}}, \Lambda , \Sigma }(\alpha _1, \alpha _2)^2 + \frac{4}{\Sigma }\sum _{i=1}^{2}{(\mu _i - {\mathfrak {h}}\alpha _i)({\mathbb {R}}^d)} \Big | \ \alpha _i\in {\mathcal {M}}_2({\mathfrak {C}}),\nonumber \\{} & {} \qquad {\mathfrak {h}}\alpha _i \le \mu _i \Big \}, \end{aligned}$$

(2.17)

cf. Probl. 7.4, Thm. 7.6, Lem. 7.9, Thm. 7.20 in [9]. Every solution $\gamma \in {\mathcal {M}}({\mathbb {R}}^d\times {\mathbb {R}}^d)$ to the Logarithmic Entropy-Transport problem (1.1) induces a solution $\beta \in {\mathcal {M}}({\mathfrak {C}}\times {\mathfrak {C}})$ to ((2.17), (2.13)): if $\gamma $ is an optimal plan for (1.1) with Lebesgue decompositions^{Footnote 1}

$$\begin{aligned} \mu _i = \rho _i\gamma _i + \mu _i^\bot , \end{aligned}$$

(2.18)

then

$$\begin{aligned} \beta := ([x_1, \sqrt{\rho _1(x_1)}], [x_2, \sqrt{\rho _2(x_2)}])_{\#}\gamma \ \in {\mathcal {M}}({\mathfrak {C}}\times {\mathfrak {C}}) \end{aligned}$$

(2.19)

is an optimal plan for the transport problem (2.17), (2.13) (cf. ([9], Thm. 7.20(iii))). Furthermore, if $\beta \in {\mathcal {M}}({\mathfrak {C}}\times {\mathfrak {C}})$ is a solution to (2.17), (2.13) or a solution to (2.16), (2.13) (which exists by ( [9], Thm. 7.6)), then

$$\begin{aligned} \beta \Big (\Big \{([x_1, r_1], [x_2,r_2])\in {\mathfrak {C}}\times {\mathfrak {C}}: \ r_1, r_2> 0, \ |x_1 - x_2| > \pi \sqrt{\Lambda /\Sigma }\Big \}\Big )= 0,\nonumber \\ \end{aligned}$$

(2.20)

(cf. ([9], Lem. 7.19)).

3 Absolutely Continuous Curves

We fix $\Lambda , \Sigma > 0$ and examine the behaviour of absolutely continuous curves in $({\mathcal {M}}({\mathbb {R}}^d), \textsf{HK}_{\Lambda , \Sigma })$.

Let $(\mu _t)_{t\in [0,1]}$ be an absolutely continuous curve in $({\mathcal {M}}({\mathbb {R}}^d), \textsf{HK}_{\Lambda , \Sigma })$ with square-integrable metric derivative, i.e. the limit

$$\begin{aligned} |\mu _t'| := \mathop {\lim }_{h\rightarrow 0} \frac{\textsf{HK}_{\Lambda , \Sigma }(\mu _{t+h}, \mu _t)}{|h|} \end{aligned}$$

(3.1)

exists for ${\mathscr {L}}^1$-a.e. $t\in (0,1)$, the function $t \mapsto |\mu _t'|$ which is called metric derivative of $(\mu _t)_t$ belongs to ${\textrm{L}}^{2}((0,1))$ and

$$\begin{aligned} \textsf{HK}_{\Lambda , \Sigma }(\mu _s,\mu _t) \le \int ^{t}_{s}{|\mu _r'| \,{\textrm{d}}r} \quad \quad \text { for all } 0\le s \le t \le 1 \end{aligned}$$

(3.2)

(cf. Def. 1.1.1 and Thm. 1.1.2 in [1]). According to Thms. 8.16 and 8.17 in [9], there exists an essentially unique Borel vector field $(v,w): (0,1)\times {\mathbb {R}}^d \rightarrow {\mathbb {R}}^d\times \mathbb {R}$ so that the continuity equation with reaction

$$\begin{aligned} \partial _t\mu _t = -\Lambda \textrm{div}(v_t\mu _t) + \Sigma w_t\mu _t \end{aligned}$$

(3.3)

($v_t:= v(t, \cdot ), \ w_t:= w(t, \cdot )$) holds good, in duality with $\textrm{C}^{\infty }$-functions with compact support in $(0,1)\times {\mathbb {R}}^d$ (see (1.7)), and

$$\begin{aligned} \int _{{\mathbb {R}}^d}{(\Lambda |v_t|^2 + \Sigma |w_t|^2)\,{\textrm{d}}\mu _t} \ = \ |\mu _t'|^2 \quad \text { for } {\mathscr {L}}^1\text {-a.e. } t\in (0,1). \end{aligned}$$

(3.4)

For every $t\in (0,1)$ and $h\in (-t, 1-t)$, there exists a plan $\beta _{t, t+h}\in {\mathcal {M}}({\mathfrak {C}}\times {\mathfrak {C}})$ which is optimal in the definition of $\textsf{HK}_{\Lambda , \Sigma }(\mu _t, \mu _{t+h})^2$ according to (2.16), (2.13), i.e.

$$\begin{aligned}{} & {} \textsf{HK}_{\Lambda , \Sigma }(\mu _t, \mu _{t+h})^2 = \int _{{\mathfrak {C}}\times {\mathfrak {C}}}{{\textsf {d}}_{{\mathfrak {C}}, \Lambda , \Sigma }([x_1,r_1],[x_2,r_2])^2\,{\textrm{d}}\beta _{t,t+h}}, \\{} & {} \qquad {\mathfrak {h}}(\pi ^1_{\#}\beta _{t,t+h}) = \mu _t, \ {\mathfrak {h}}(\pi ^2_{\#}\beta _{t,t+h}) = \mu _{t+h}, \end{aligned}$$

and whose first marginal $\pi ^1_{\#}\beta _{t, t+h}$ satisfies

$$\begin{aligned} \int _{{\mathfrak {C}}}{\phi ([x,r])\,{\textrm{d}}(\pi ^1_{\#}\beta _{t, t+h})} = \int _{{\mathbb {R}}^d}{\phi ([x, 1])\,{\textrm{d}}\mu _t} + h^2\phi ({\mathfrak {o}}) \end{aligned}$$

(3.5)

for all $\phi \in \textrm{C}^{0}_b({\mathfrak {C}})$ (cf. Thm. 7.6 and Lem. 7.10 in [9]).

This notation holds good throughout the rest of the paper.

As a first result of our analysis of absolutely continuous curves, Prop. 3.1 will identify $(v_t, w_t)$ as belonging to a particular class of functions.

Proposition 3.1

For ${\mathscr {L}}^1$-a.e. $t\in (0,1)$, the Borel function $(v_t, w_t)$ belongs to the closure in ${\textrm{L}}^{2}(\mu _t, {\mathbb {R}}^d\times \mathbb {R})$ of the subspace $\{(\nabla \zeta , \zeta ): \ \zeta \in \textrm{C}^{\infty }_c({\mathbb {R}}^d)\}$.

Here $({\textrm{L}}^{2}(\mu _t, {\mathbb {R}}^d\times {\mathbb {R}}), ||\cdot ||_{{\textrm{L}}^{2}(\mu _t, {\mathbb {R}}^d\times {\mathbb {R}})})$ denotes the normed space of all $\mu _t$-measurable functions $({\bar{v}}, {\bar{w}})$ from ${\mathbb {R}}^d$ to ${\mathbb {R}}^d\times {\mathbb {R}}$ satisfying

$$\begin{aligned} ||({\bar{v}},{\bar{w}})||_{{\textrm{L}}^{2}(\mu _t, {\mathbb {R}}^d\times {\mathbb {R}})} := \Big (\int _{{\mathbb {R}}^d}{(\Lambda |{\bar{v}}|^2 + \Sigma |{\bar{w}}|^2)\,{\textrm{d}}\mu _t}\Big )^{1/2} < +\infty . \end{aligned}$$

(3.6)

Proof

We construct a Borel vector field $({\tilde{v}}, {\tilde{w}}): (0,1)\times {\mathbb {R}}^d\rightarrow {\mathbb {R}}^d\times {\mathbb {R}}$ satisfying (3.3) so that, for ${\mathscr {L}}^1$-a.e. $t\in (0,1)$, the function $({\tilde{v}}_t,{\tilde{w}}_t)$ belongs to the closure in ${\textrm{L}}^{2}(\mu _t, {\mathbb {R}}^d\times \mathbb {R})$ of the subspace $\{(\nabla \zeta , \zeta ): \ \zeta \in \textrm{C}^{\infty }_c({\mathbb {R}}^d)\}$ and

$$\begin{aligned} ||({\tilde{v}}_t,{\tilde{w}}_t)||_{{\textrm{L}}^{2}(\mu _t, {\mathbb {R}}^d\times {\mathbb {R}})}^2 = \int _{{\mathbb {R}}^d}{(\Lambda |{\tilde{v}}_t|^2 + \Sigma |{\tilde{w}}_t|^2)\,{\textrm{d}}\mu _t} \ \le \ |\mu _t'|^2. \end{aligned}$$

(3.7)

We begin the proof with some estimations. Let $\phi \in \textrm{C}^{\infty }_c({\mathbb {R}}^d)$. It follows from the construction of ${\mathcal {R}}_{[x_1,r_1],[x_2,r_2]}$ and $\theta _{[x_1,r_1],[x_2,r_2]}$ according to (2.6)-(2.9) that

$$\begin{aligned}{} & {} \frac{2}{\Sigma }\frac{\,{\textrm{d}}^2}{\,{\textrm{d}}s^2}{\mathcal {R}}_{[x_1,r_1],[x_2,r_2]}(s)^2 = {\textsf {d}}_{{\mathfrak {C}}, \Lambda , \Sigma }([x_1,r_1],[x_2,r_2])^2, \\{} & {} \quad \Big |\theta _{[x_1,r_1],[x_2,r_2]}''(s){\mathcal {R}}_{[x_1,r_1],[x_2,r_2]}(s)^2(x_2-x_1)\Big | \le C_{\Sigma , \Lambda } {\textsf {d}}_{{\mathfrak {C}},\Lambda , \Sigma }([x_1,r_1],[x_2,r_2])^2, \\{} & {} \quad \Big |2\theta _{[x_1,r_1],[x_2,r_2]}'(s){\mathcal {R}}_{[x_1,r_1],[x_2,r_2]}(s){\mathcal {R}}'_{[x_1,r_1],[x_2,r_2]}(s)(x_2-x_1)\Big | \\{} & {} \quad \le C_{\Sigma , \Lambda }{\textsf {d}}_{{\mathfrak {C}},\Lambda , \Sigma }([x_1,r_1],[x_2,r_2])^2, \\{} & {} \quad \Big |\frac{\,{\textrm{d}}^2}{\,{\textrm{d}}s^2}\Big [\phi (x_1+\theta _{[x_1,r_1],[x_2,r_2]}(s)(x_2-x_1)){\mathcal {R}}_{[x_1,r_1],[x_2,r_2]}(s)^2\Big ]\Big |\\{} & {} \quad \le C_\phi C_{\Sigma ,\Lambda }{\textsf {d}}_{{\mathfrak {C}}, \Lambda , \Sigma }([x_1,r_1],[x_2,r_2])^2, \end{aligned}$$

for $s\in (0,1)$, with $C_\phi > 0$ only depending on $\phi $ and $C_{\Sigma , \Lambda }:=2\Sigma + 4\Lambda $; we refer the reader to the proof of Prop. 2.5 in [5] for details. With (2.9) and these estimations on hand, it is straightforward to prove that there exists a constant $C_{\phi ,\Lambda , \Sigma } > 0$ only depending on $\phi , \ \Lambda $ and $\Sigma $ so that

$$\begin{aligned}{} & {} |\varphi '_{y_1,y_2}({\bar{s}}) - \varphi '_{y_1,y_2}(s)| \le C_{\phi ,\Lambda , \Sigma } \ {\textsf {d}}_{{\mathfrak {C}},\Lambda ,\Sigma }(y_1,y_2)^2, \end{aligned}$$

(3.8)

$$\begin{aligned}{} & {} \Big |\varphi '_{y_1,y_2}(s) - \langle \nabla \phi (x_1), \theta '_{y_1, y_2}(s)(x_2-x_1)\rangle {\mathcal {R}}_{y_1,y_2}(s)^2 + 2\phi (x_1){\mathcal {R}}'_{y_1,y_2}(s){\mathcal {R}}_{y_1,y_2}(s)\Big |\nonumber \\{} & {} \quad \le C_{\phi ,\Lambda , \Sigma } \ {\textsf {d}}_{{\mathfrak {C}},\Lambda ,\Sigma }(y_1,y_2)^2 \end{aligned}$$

(3.9)

and

$$\begin{aligned}{} & {} \Big |\Big (\langle \nabla \phi (x_1), \theta '_{y_1, y_2}(s)(x_2-x_1)\rangle {\mathcal {R}}_{y_1,y_2}(s) + 2\phi (x_1){\mathcal {R}}'_{y_1,y_2}(s)\Big )\Big ({\mathcal {R}}_{y_1,y_2}(s)-r_1\Big )\Big |\nonumber \\{} & {} \quad \le C_{\phi ,\Lambda , \Sigma } \ {\textsf {d}}_{{\mathfrak {C}},\Lambda ,\Sigma }(y_1,y_2)^2 \end{aligned}$$

(3.10)

for all $s, {\bar{s}} \in (0,1)$, with $y_i:=[x_i,r_i], \ \varphi _{y_1,y_2}(s):=\phi (x_1+\theta _{[x_1,r_1],[x_2,r_2]}(s)(x_2-x_1)){\mathcal {R}}_{[x_1,r_1],[x_2,r_2]}(s)^2$.

Now, let $t\in (0,1)$ so that the limit (3.1) exists and ${\mathfrak {C}}_{\mathfrak {o}} := {\mathfrak {C}}\setminus \{{\mathfrak {o}}\}$. By applying (2.20), (3.9), (3.10), (3.5), Hölder’s inequality and (2.9), we obtain

$$\begin{aligned}&\Big |\int _{{\mathbb {R}}^d}{\phi \,{\textrm{d}}\mu _{t+h}} - \int _{{\mathbb {R}}^d}{\phi \,{\textrm{d}}\mu _t}\Big | = \Big |\int _{{\mathfrak {C}}\times {\mathfrak {C}}}{(\phi (x_2)r_2^2 - \phi (x_1)r_1^2)\,{\textrm{d}}\beta _{t,t+h}}\Big | \\&\quad \le \int _{{\mathfrak {C}}\times {\mathfrak {C}}}{\int _0^1{|\varphi '_{y_1,y_2}(s)| \,{\textrm{d}}s}\,{\textrm{d}}\beta _{t,t+h}} \le \\&\quad \int _{{\mathfrak {C}}_{{\mathfrak {o}}}\times {\mathfrak {C}}}{\int _0^1{\Big |\langle \nabla \phi (x_1),\theta '_{[x_1,r_1],[x_2,r_2]}(s)(x_2-x_1)\rangle {\mathcal {R}}_{[x_1,r_1],[x_2,r_2]}(s) +2\phi (x_1){\mathcal {R}}'_{[x_1,r_1],[x_2,r_2]}(s)\Big | \,{\textrm{d}}s}\,{\textrm{d}}\beta _{t,t+h}}\\&\qquad + 2C_{\phi ,\Lambda , \Sigma }\textsf{HK}_{\Lambda , \Sigma }(\mu _{t},\mu _{t+h})^2 \\&\quad \le \Big (\int _{{\mathfrak {C}}_{\mathfrak {o}}}{\Big (\Lambda |\nabla \phi |^2 +\Sigma \phi ^2\Big )\,{\textrm{d}}(\pi ^1_{\#}\beta _{t,t+h})}\Big )^{1/2} Big(\int _{{\mathfrak {C}}_{\mathfrak {o}}\times {\mathfrak {C}}}{\int _0^1{\Big (\frac{1}{\Lambda }{\mathcal {R}}^2(\theta ')^2|x_2-x_1|^2 + \frac{4}{\Sigma }({\mathcal {R}}')^2\Big )\,{\textrm{d}}s}\,{\textrm{d}}\beta _{t,t+h}}\Big )^{1/2}\\&\qquad + 2C_{\phi ,\Lambda , \Sigma }\textsf{HK}_{\Lambda , \Sigma }(\mu _{t},\mu _{t+h})^2 \ \le \ ||(\nabla \phi , \phi )||_{{\textrm{L}}^{2}(\mu _t, {\mathbb {R}}^d\times {\mathbb {R}})} \textsf{HK}_{\Lambda , \Sigma }(\mu _t,\mu _{t+h}) + 2C_{\phi ,\Lambda , \Sigma }\textsf{HK}_{\Lambda , \Sigma }(\mu _{t},\mu _{t+h})^2 \end{aligned}$$

and thus,

$$\begin{aligned} \limsup _{h\rightarrow 0}\frac{1}{|h|}\Big |\int _{{\mathbb {R}}^d}{\phi \,{\textrm{d}}\mu _{t+h}} - \int _{{\mathbb {R}}^d}{\phi \,{\textrm{d}}\mu _t}\Big | \ \le \ ||(\nabla \phi , \phi )||_{{\textrm{L}}^{2}(\mu _t, {\mathbb {R}}^d\times {\mathbb {R}})}|\mu _t'|. \end{aligned}$$

(3.11)

At this point, we may follow the proof of Thm. 8.3.1 in [1]. Therein, a similar characterization of absolutely continuous curves in the space of Borel probability measures with finite second order moments, endowed with the Kantorovich-Wasserstein distance, was given by solving a suitable minimum problem. We adapt that approach. Let $\mu \in {\mathcal {M}}((0,1)\times {\mathbb {R}}^d)$ be defined by

$$\begin{aligned} \int _{(0,1)\times {\mathbb {R}}^d}{\psi (t,x)\,{\textrm{d}}\mu (t,x)} \ = \ \int _0^1{\int _{{\mathbb {R}}^d}{\psi (t,x)\,{\textrm{d}}\mu _t(x)}\,{\textrm{d}}t} \end{aligned}$$

for all $\psi \in \textrm{C}^{0}_b((0,1)\times {\mathbb {R}}^d)$, and let $({\textrm{L}}^{2}(\mu ,{\mathbb {R}}^d\times {\mathbb {R}}), ||\cdot ||_{{\textrm{L}}^{2}(\mu , {\mathbb {R}}^d\times {\mathbb {R}})})$ denote the normed space of all $\mu $-measurable vector fields $({\hat{v}},{\hat{w}})$ from $(0,1)\times {\mathbb {R}}^d$ to ${\mathbb {R}}^d\times {\mathbb {R}}$ satisfying

$$\begin{aligned} ||({\hat{v}},{\hat{w}})||_{{\textrm{L}}^{2}(\mu , {\mathbb {R}}^d\times {\mathbb {R}})} := \Big (\int _0^1{\int _{{\mathbb {R}}^d}{(\Lambda |{\hat{v}}_t|^2 + \Sigma |{\hat{w}}_t|^2)\,{\textrm{d}}\mu _t}\,{\textrm{d}}t}\Big )^{1/2} < +\infty .\nonumber \\ \end{aligned}$$

(3.12)

An application of (3.11), Fatou’s Lemma, Hölder’s inequality and Hahn-Banach Theorem shows that there exists a unique bounded linear functional L defined on the closure ${\mathcal {V}}$ in ${\textrm{L}}^{2}(\mu , {\mathbb {R}}^d\times {\mathbb {R}})$ of the subspace $\{(\nabla \zeta ,\zeta ): \ \zeta \in \textrm{C}^{\infty }_c((0,1)\times {\mathbb {R}}^d)\}$, satisfying

$$\begin{aligned} L((\nabla \zeta , \zeta )) := -\int _0^1{\int _{{\mathbb {R}}^d}{\partial _t\zeta (t,x)\,{\textrm{d}}\mu _t}\,{\textrm{d}}t} \quad \text { for all }\zeta \in \textrm{C}^{\infty }_c((0,1)\times {\mathbb {R}}^d).\nonumber \\ \end{aligned}$$

(3.13)

We consider the minimum problem

$$\begin{aligned} \min \Big \{\frac{1}{2}||({\hat{v}},{\hat{w}})||^2_{{\textrm{L}}^{2}(\mu , {\mathbb {R}}^d\times {\mathbb {R}})} - L(({\hat{v}},{\hat{w}})): \ ({\hat{v}},{\hat{w}})\in {\mathcal {V}}\Big \}. \end{aligned}$$

(3.14)

The same argument as in the proof of Thm. 8.3.1 in [1] proves that the unique solution $({\tilde{v}},{\tilde{w}})$ to (3.14) (which clearly exists) satisfies (3.3) and, for ${\mathscr {L}}^1$-a.e. $t\in (0,1)$, the function $({\tilde{v}}_t,{\tilde{w}}_t)$ belongs to the closure in ${\textrm{L}}^{2}(\mu _t, {\mathbb {R}}^d\times \mathbb {R})$ of the subspace $\{(\nabla \zeta , \zeta ): \ \zeta \in \textrm{C}^{\infty }_c({\mathbb {R}}^d)\}$ and (3.7) holds good. By Thm. 8.17 in [9], for every Borel vector field $({\hat{v}}, {\hat{w}})\in {\textrm{L}}^{2}(\mu ,{\mathbb {R}}^d\times {\mathbb {R}})$ satisfying the continuity equation with reaction (3.3) the opposite inequality holds good, i.e.

$$\begin{aligned} \int _{{\mathbb {R}}^d}{(\Lambda |{\hat{v}}_t|^2 + \Sigma |{\hat{w}}_t|^2)\,{\textrm{d}}\mu _t} \ \ge \ |\mu _t'|^2 \quad \text { for } {\mathscr {L}}^1\text {-a.e.} t\in (0,1). \end{aligned}$$

It follows from this and from the strict convexity of $||\cdot ||^2_{{\textrm{L}}^{2}(\mu _t, {\mathbb {R}}^d\times {\mathbb {R}})}$ that the Borel vector field $({\tilde{v}},{\tilde{w}})$ solves (3.3), (3.4) and that it coincides ${\mathscr {L}}^1$-a.e. with any other vector field solving (3.3), (3.4). This completes the proof of Prop. 3.1. $\square $

Definition 3.2

Let ${\mathcal {C}}({\mathbb {R}}^d)$ be a countable subset of $\textrm{C}^{\infty }_c({\mathbb {R}}^d)$ so that every function in $\textrm{C}^{\infty }_c({\mathbb {R}}^d)$ can be approximated in the ${\textrm{C}}^{1}$-norm by a sequence of functions in ${\mathcal {C}}({\mathbb {R}}^d)$.

We define ${\mathcal {N}}_{\mu }$ as the set of points $t\in (0,1)$ at which the following holds good:

(i)
The limit (3.1) exists,
(ii)
$(v_t,w_t)$ belongs to the closure in ${\textrm{L}}^{2}(\mu _t, {\mathbb {R}}^d\times \mathbb {R})$ of the subspace $\{(\nabla \zeta , \zeta ): \ \zeta \in \textrm{C}^{\infty }_c({\mathbb {R}}^d)\}$ and satisfies (3.4),
(iii)
and, for all $\psi \in {\mathcal {C}}({\mathbb {R}}^d)$,
$$\begin{aligned} \lim _{h\rightarrow 0}\frac{1}{h}\Big (\int _{{\mathbb {R}}^d}{\psi \,{\textrm{d}}\mu _{t+h}} - \int _{{\mathbb {R}}^d}{\psi \,{\textrm{d}}\mu _{t}}\Big ) = \int _{{\mathbb {R}}^d}{(\Lambda \langle \nabla \psi , v_t\rangle + \Sigma \psi w_t)\,{\textrm{d}}\mu _t}.\nonumber \\ \end{aligned}$$
(3.15)

Please note that $(0,1)\setminus {\mathcal {N}}_\mu $ is an ${\mathscr {L}}^1$-negligible set; it follows from (1.7) that, for fixed $\psi \in \textrm{C}^{\infty }_c({\mathbb {R}}^d)$, the mapping $t\mapsto \int _{{\mathbb {R}}^d}{\psi \,{\textrm{d}}\mu _t}$ is absolutely continuous from [0, 1] to ${\mathbb {R}}$ and (3.15) holds good at ${\mathscr {L}}^1$-a.e. $t\in (0,1)$.

The second step in our analysis is to establish a connection between the “tangent vector” $(v_t, w_t)$ to $\mu _t$ and tangent vectors to geodesics in $({\mathfrak {C}}, {\textsf {d}}_{{\mathfrak {C}}, \Lambda , \Sigma })$, measured by $\beta _{t,t+h}$ for |h| small. For $t\in {\mathcal {N}}_\mu , h\in (-t, 1-t)$ and $s\in (0,1)$, the mappings

$$\begin{aligned} {\mathfrak {D}}_{t,h,s}: (y_1,y_2){} & {} \mapsto \Big (({\textsf {x}}(y_1),{\textsf {r}}(y_1)), \Big (\frac{1}{h\Lambda }{\mathcal {R}}_{y_1,y_2}(s)\theta _{y_1,y_2}'(s)({\textsf {x}}(y_2)\nonumber \\{} & {} \quad -{\textsf {x}}(y_1)), \frac{2}{h\Sigma }{\mathcal {R}}_{y_1,y_2}'(s)\Big )\Big ) \end{aligned}$$

(3.16)

from $\Big ({\mathfrak {C}}\times {\mathfrak {C}}\Big )\setminus \Big \{([x_1, r_1], [x_2,r_2])\in {\mathfrak {C}}\times {\mathfrak {C}}: \ r_1, r_2> 0, \ |x_1 - x_2| > \pi \sqrt{\Lambda /\Sigma }\Big \}$ to $({\mathbb {R}}^d\times {\mathbb {R}})\times ({\mathbb {R}}^d\times {\mathbb {R}})$ will be considered, with ${\textsf {x}}$, ${\textsf {r}}$ as in (2.15), and ${\mathcal {R}}_{y_1,y_2}, \theta _{y_1,y_2}$ being constructed according to (2.6)–(2.9). Their second components may be interpreted as blow-ups of tangent vectors to geodesics in $({\mathfrak {C}}, {\textsf {d }}_{{\mathfrak {C}}, \Lambda , \Sigma })$; in fact, the transition from $({\textsf {x}}, {\textsf {r}})$ to the local chart $(1/\Lambda \ {\mathcal {R}}_{y_1,y_2}(s) \ {\textsf {x}}, 2/\Sigma \ {\textsf {r}})$ transforms the tangent vector ${\mathfrak {t}}_{y_1,y_2}(s)$ from (2.11) into the tangent vector

$$\begin{aligned} \tilde{{\mathfrak {t}}}_{y_1,y_2}(s):= \Big (\frac{1}{\Lambda }{\mathcal {R}}_{y_1,y_2}(s)\theta _{y_1,y_2}'(s)({\textsf {x}}(y_2)-{\textsf {x}}(y_1)), \frac{2}{\Sigma }{\mathcal {R}}_{y_1,y_2}'(s)\Big ). \end{aligned}$$

(3.17)

We will take advantage of the fact that this chart transition transforms the metric tensor ${\mathfrak {G}}^{\Lambda , \Sigma }$ from (2.5) into a metric tensor which is equal to $ \Lambda <{\mathfrak {v}}_1, {\mathfrak {w}}_1> + \Sigma \ {\mathfrak {v}}_2{\mathfrak {w}}_2$ for tangent vectors ${\mathfrak {v}}:=({\mathfrak {v}}_1, {\mathfrak {v}}_2), {\mathfrak {w}}:=({\mathfrak {w}}_1, {\mathfrak {w}}_2) \in {\mathbb {R}}^d\times {\mathbb {R}}$ at $[x,{\mathcal {R}}_{y_1,y_2}(s)]\in {\mathfrak {C}}$.

We turn to the push-forward $\Delta _{t,h,s}\in {\mathcal {M}}(({\mathbb {R}}^d\times {\mathbb {R}})\times ({\mathbb {R}}^d\times {\mathbb {R}}))$ of $\beta _{t,t+h}$ through (3.16), defined by

$$\begin{aligned} \int _{({\mathbb {R}}^d\times {\mathbb {R}})\times ({\mathbb {R}}^d\times {\mathbb {R}})}{\Phi (y)\,{\textrm{d}}\Delta _{t,h,s}}\quad \ = \ \quad \int _{{\mathfrak {C}}\times {\mathfrak {C}}}{\Phi ({\mathfrak {D}}_{t,h,s}(y_1,y_2))\,{\textrm{d}}\beta _{t,t+h}} \end{aligned}$$

for all $\Phi \in \textrm{C}^{0}_b(({\mathbb {R}}^d\times {\mathbb {R}})\times ({\mathbb {R}}^d\times {\mathbb {R}}))$. Please recall (2.20) in this context and note that, by (2.9), the mappings (3.16) are Borel measurable. The following proposition will provide information on the limits of $\Delta _{t,h,s}$ as $h\rightarrow 0$, linking them to $(v_t,w_t)$.

Proposition 3.3

Let $t\in {\mathcal {N}}_\mu $ and $s\in (0,1)$. Then

$$\begin{aligned} \lim _{h\rightarrow 0} \int _{({\mathbb {R}}^d\times {\mathbb {R}})\times ({\mathbb {R}}^d\times {\mathbb {R}})}{\Phi (y)\,{\textrm{d}}\Delta _{t,h,s}} \ = \ \int _{{\mathbb {R}}^d}{\Phi ((x,1),(v_t(x),w_t(x)))\,{\textrm{d}}\mu _t}\nonumber \\ \end{aligned}$$

(3.18)

for all continuous functions $\Phi : ({\mathbb {R}}^d\times {\mathbb {R}})\times ({\mathbb {R}}^d\times {\mathbb {R}})\rightarrow {\mathbb {R}}$ satisfying the growth condition

$$\begin{aligned} |\Phi ((x_1,r_1),(x_2,r_2))| \ \le \ C\Big (1+|x_2|^2+|r_2|^2\Big ) \end{aligned}$$

(3.19)

for some $C>0$.

Proof

We set $Y:={\mathbb {R}}^d\times {\mathbb {R}}$.

Let $t\in {\mathcal {N}}_\mu $ and $s\in (0,1)$. We note that, by (2.9) and Def. 3.2(i),

$$\begin{aligned}{} & {} \int _{Y\times Y}{(\Lambda |x_2|^2 + \Sigma |r_2|^2)\,{\textrm{d}}\Delta _{t,h,s}((x_1,r_1),(x_2,r_2))}\nonumber \\{} & {} \quad = \frac{\textsf{HK}_{\Lambda , \Sigma }(\mu _t,\mu _{t+h})^2}{h^2} \rightarrow |\mu _t'|^2 \quad \text { as } h\rightarrow 0. \end{aligned}$$

(3.20)

We may apply Prokhorov’s Theorem to any sequence $(\Delta _{t,h_k,s})_{k\in \mathbb {N}}, \ h_k\rightarrow 0,$ of measures from the family $(\Delta _{t,h,s})_{h\in (-t,1-t)}\subset {\mathcal {M}}(Y\times Y)$, since such sequence is bounded and equally tight by (3.5) and (3.20), and we obtain a subsequence $h_{k_l}\rightarrow 0$ and a measure $\Delta \in {\mathcal {M}}(Y\times Y)$ so that $(\Delta _{t,h_{k_l},s})_{l\in \mathbb {N}}$ converges to $\Delta $ in the weak topology on ${\mathcal {M}}(Y\times Y)$, in duality with continuous and bounded functions. So let $(\Delta _{t,h_l,s})_{l\in \mathbb {N}} \ (h_l\rightarrow 0)$ be a convergent sequence with limit measure $\Delta \in {\mathcal {M}}(Y\times Y)$, i.e.

$$\begin{aligned} \lim _{l\rightarrow \infty }\int _{Y\times Y}{\Phi (y)\,{\textrm{d}}\Delta _{t,h_l,s}} \ = \ \int _{Y\times Y}{\Phi (y)\,{\textrm{d}}\Delta } \end{aligned}$$

(3.21)

for all $\Phi \in \textrm{C}^{0}_b(Y\times Y)$. We want to identify $\Delta $ as $((x,1),(v_t(x),w_t(x)))_{\#}\mu _t$. It is not difficult to infer from (3.5) that the first marginal $\pi ^1_{\#}\Delta $ of $\Delta $ coincides with $(x,1)_{\#}\mu _t$, i.e.

$$\begin{aligned} \int _{Y}{\phi ((x,r))\,{\textrm{d}}(\pi ^1_{\#}\Delta )} = \int _{{\mathbb {R}}^d}{\phi ((x,1))\,{\textrm{d}}\mu _t} \end{aligned}$$

(3.22)

for all $\phi \in \textrm{C}^{0}_b(Y)$. Let $\psi \in {\mathcal {C}}({\mathbb {R}}^d)$. Then (3.21) also holds good for $\Phi ((x_1,r_1),(x_2,r_2)):=\Big [\Lambda \langle \nabla \psi (x_1),x_2\rangle + \Sigma \psi (x_1)r_2\Big ]r_1$: Indeed, we have

$$\begin{aligned} \lim _{l\rightarrow \infty }\int _{Y\times Y}{(\Phi _N) \,{\textrm{d}}\Delta _{t,h_l,s}} = \int _{Y\times Y}{(\Phi _N) \,{\textrm{d}}\Delta } \end{aligned}$$

for all $N > 0$, with $\Phi _N:= (\Phi \wedge N)\vee (-N)$. Setting $Y_N:=\{(x,r)\in Y: \ |x|+|r| > N\}, \ C_\psi :=\sup _{x\in {\mathbb {R}}^d}\{|\nabla \psi (x)|+|\psi (x)|\},$ and applying (3.5), (3.20) and (3.22), we conclude that for every $\epsilon > 0$ there exists $N_\epsilon > 0$ so that

$$\begin{aligned} \int _{Y\times Y_N}{(|x_2|+|r_2|)\,{\textrm{d}}\Delta _{t,h_l,s}} + \int _{Y\times Y_N}{(|x_2|+|r_2|)\,{\textrm{d}}\Delta } \ \le \epsilon \quad \text { for all } N\ge N_\epsilon , \ l\in \mathbb {N}, \end{aligned}$$

and

$$\begin{aligned}{} & {} \limsup _{l\rightarrow \infty } \Big |\int _{Y\times Y}{\Phi \,{\textrm{d}}\Delta _{t,h_l,s}} - \int _{Y\times Y}{\Phi \,{\textrm{d}}\Delta }\Big | \\{} & {} \quad \le \limsup _{l\rightarrow \infty } \Big |\int _{Y\times Y}{(\Phi _{C_\psi (\Lambda + \Sigma )N_\epsilon })\,{\textrm{d}}\Delta _{t,h_l,s}} - \int _{Y\times Y}{\Phi _{C_\psi (\Lambda + \Sigma )N_\epsilon } \,{\textrm{d}}\Delta }\Big | \\{} & {} \qquad + \ C_\psi (\Lambda +\Sigma )\limsup _{l\rightarrow \infty }\int _{Y\times Y_{N_\epsilon }}{(|x_2|+|r_2|)\,{\textrm{d}}(\Delta _{t,h_l,s}+\Delta )} \\{} & {} \quad \le C_\psi (\Lambda + \Sigma )\epsilon . \end{aligned}$$

Hence, taking (3.22) into account, we obtain

$$\begin{aligned}{} & {} \lim _{l\rightarrow \infty }\int _{Y\times Y}{\Big [\Lambda \langle \nabla \psi (x_1),x_2\rangle +\Sigma \psi (x_1)r_2\Big ]r_1\,{\textrm{d}}\Delta _{t,h_l,s}} \nonumber \\{} & {} \quad = \int _{Y\times Y}{\Big [\Lambda \langle \nabla \psi (x_1),x_2\rangle + \Sigma \psi (x_1)r_2\Big ] \,{\textrm{d}}\Delta }. \end{aligned}$$

(3.23)

It holds that

$$\begin{aligned}{} & {} \int _{{\mathbb {R}}^d}{\psi \,{\textrm{d}}\mu _{t+h_l}} - \int _{{\mathbb {R}}^d}{\psi \,{\textrm{d}}\mu _{t}} = \int _{{\mathfrak {C}}\times {\mathfrak {C}}}{(\psi (x_2)r_2^2 - \psi (x_1)r_1^2)\,{\textrm{d}}\beta _{t,t+h}} \\{} & {} \quad = \int _{{\mathfrak {C}}\times {\mathfrak {C}}}{\int _0^1{\frac{\,{\textrm{d}}}{\,{\textrm{d}}s}\Big [\psi (x_1+\theta _{[x_1,r_1],[x_2,r_2]}(s)(x_2-x_1)){\mathcal {R}}_{[x_1,r_1],[x_2,r_2]}(s)^2\Big ] \,{\textrm{d}}s}\,{\textrm{d}}\beta _{t,t+h_l}} \end{aligned}$$

so that (3.15), (3.8), (3.9), (3.10), Def. 3.2(i) and (3.23) yield

$$\begin{aligned}{} & {} \int _{{\mathbb {R}}^d}{(\Lambda \langle \nabla \psi , v_t\rangle + \Sigma \psi w_t)\,{\textrm{d}}\mu _t} = \lim _{l\rightarrow \infty }\frac{1}{h_l}\Big (\int _{{\mathbb {R}}^d}{\psi \,{\textrm{d}}\mu _{t+h_l}} - \int _{{\mathbb {R}}^d}{\psi \,{\textrm{d}}\mu _t}\Big ) \\{} & {} \quad = \lim _{l\rightarrow \infty }\int _{Y\times Y}{\Big [\Lambda \langle \nabla \psi (x_1),x_2\rangle + \Sigma \psi (x_1)r_2\Big ]r_1\,{\textrm{d}}\Delta _{t,h_l,s}} \\{} & {} \quad = \int _{Y\times Y}{\Big [\Lambda \langle \nabla \psi (x_1),x_2\rangle + \Sigma \psi (x_1)r_2\Big ] \,{\textrm{d}}\Delta }. \end{aligned}$$

According to the Disintegration Theorem (see e.g. Thm. 5.3.1 in [1]) and (3.22), there exists a Borel family of probability measures $(\Delta _{x_1})_{x_1\in {\mathbb {R}}^d}\subset {\mathcal {M}}(Y), \ \Delta _{x_1}(Y) = 1,$ so that

$$\begin{aligned} \int _{Y\times Y}{\Phi \,{\textrm{d}}\Delta } = \int _{{\mathbb {R}}^d}{\Big (\int _Y{\Phi ((x_1,1), (x_2,r_2))\,{\textrm{d}}\Delta _{x_1}((x_2,r_2))}\Big )\,{\textrm{d}}\mu _t(x_1)} \end{aligned}$$

for all $\Delta $-integrable maps $\Phi : Y\times Y \rightarrow {\mathbb {R}}$. We infer from (3.20) that, for $\mu _t$-a.e. $x_1\in {\mathbb {R}}^d$, the measure $\Delta _{x_1}$ has finite second order moment and we define the function $(v_\Delta , w_\Delta ): {\mathbb {R}}^d\rightarrow {\mathbb {R}}^d\times {\mathbb {R}}$ by

$$\begin{aligned} v_\Delta (x_1){} & {} := \int _Y{x_2\,{\textrm{d}}\Delta _{x_1}((x_2,r_2))}, \ w_{\Delta }(x_1):=\int _Y{r_2\,{\textrm{d}}\Delta _{x_1}((x_2,r_2))} \nonumber \\{} & {} \quad \text { for } \mu _t \text {-a.e. } x_1\in {\mathbb {R}}^d. \end{aligned}$$

(3.24)

The function $(v_\Delta , w_\Delta )$ is Borel measurable (cf. (5.3.1) and Def. 5.4.2 in [1]), and

$$\begin{aligned}{} & {} \int _{Y\times Y}{\Big [\Lambda \langle \nabla \psi (x_1),x_2\rangle + \Sigma \psi (x_1)r_2\Big ] \,{\textrm{d}}\Delta } \\{} & {} \quad = \int _{{\mathbb {R}}^d}{\Big (\int _Y{\Big [\Lambda \langle \nabla \psi (x_1),x_2\rangle + \Sigma \psi (x_1)r_2\Big ] \,{\textrm{d}}\Delta _{x_1}((x_2,r_2))}\Big )\,{\textrm{d}}\mu _t(x_1)} \\{} & {} \quad = \int _{{\mathbb {R}}^d}{(\Lambda \langle \nabla \psi , v_\Delta \rangle + \Sigma \psi w_\Delta )\,{\textrm{d}}\mu _t}. \end{aligned}$$

All in all, we have found that

$$\begin{aligned} \int _{{\mathbb {R}}^d}{(\Lambda \langle \nabla \psi , v_t\rangle + \Sigma \psi w_t)\,{\textrm{d}}\mu _t} = \int _{{\mathbb {R}}^d}{(\Lambda \langle \nabla \psi , v_\Delta \rangle + \Sigma \psi w_\Delta )\,{\textrm{d}}\mu _t} \end{aligned}$$

(3.25)

for all $\psi \in {\mathcal {C}}({\mathbb {R}}^d)$. Since every function in $\textrm{C}^{\infty }_c({\mathbb {R}}^d)$ can be approximated in the ${\textrm{C}}^{1}$-norm by a sequence of functions in ${\mathcal {C}}({\mathbb {R}}^d)$ (cf. Def. 3.2) and, by (3.20) and Def. 3.2(ii), the functions $v_\Delta , w_\Delta , v_t, w_t$ are square-integrable w.r.t. $\mu _t$, (3.25) holds good for all $\psi \in \textrm{C}^{\infty }_c({\mathbb {R}}^d)$ and for all pairs in the ${\textrm{L}}^{2}(\mu _t, {\mathbb {R}}^d\times {\mathbb {R}})$-closure of $\{(\nabla \zeta ,\zeta ): \ \zeta \in \textrm{C}^{\infty }_c({\mathbb {R}}^d)\}$. It follows from this and from Def. 3.2(ii) that

$$\begin{aligned} ||(v_t,w_t)||_{{\textrm{L}}^{2}(\mu _t, {\mathbb {R}}^d\times {\mathbb {R}})}^2 \ = \int _{{\mathbb {R}}^d}{(\Lambda \langle v_t, v_\Delta \rangle + \Sigma w_t w_\Delta )\,{\textrm{d}}\mu _t}. \end{aligned}$$

(3.26)

Applying Hölder’s inequality to (3.26), taking the definition (3.24) of $v_\Delta , w_\Delta $, Jensen’s inequality, (3.21), (3.20) and Def. 3.2(ii) into account, we obtain

$$\begin{aligned}{} & {} ||(v_t,w_t)||_{{\textrm{L}}^{2}(\mu _t, {\mathbb {R}}^d\times {\mathbb {R}})} \le ||(v_\Delta ,w_\Delta )||_{{\textrm{L}}^{2}(\mu _t, {\mathbb {R}}^d\times {\mathbb {R}})} \nonumber \\{} & {} \quad \le \Big (\int _{Y\times Y}{(\Lambda |x_2|^2 + \Sigma |r_2|^2)\,{\textrm{d}}\Delta }\Big )^{1/2} \le \end{aligned}$$

(3.27)

$$\begin{aligned}{} & {} \quad \le \lim _{l\rightarrow \infty }\Big (\int _{Y\times Y}{(\Lambda |x_2|^2 + \Sigma |r_2|^2)\,{\textrm{d}}\Delta _{t,h_l,s}}\Big )^{1/2}\nonumber \\{} & {} \quad =||(v_t,w_t)||_{{\textrm{L}}^{2}(\mu _t, {\mathbb {R}}^d\times {\mathbb {R}})} \end{aligned}$$

(3.28)

so that, in fact, equality holds good everywhere in (3.27) and (3.28). We infer from this and from (3.26) that

$$\begin{aligned} ||(v_t,w_t) - (v_\Delta , w_\Delta )||_{{\textrm{L}}^{2}(\mu _t, {\mathbb {R}}^d\times {\mathbb {R}})} = 0 \end{aligned}$$

which means

$$\begin{aligned} v_t(x) = v_\Delta (x) \text { and } w_t(x) = w_\Delta (x) \quad \quad \text { for } \mu _t\text {-a.e. } x\in {\mathbb {R}}^d. \end{aligned}$$

(3.29)

Moreover, the fact that the second inequality in (3.27), resulting from Jensen’s inequality, is in fact an equality and (3.29) yield $\Delta _{x_1} = \delta _{v_t(x_1)}\otimes \delta _{w_t(x_1)}$ for $\mu _t$-a.e. $x_1\in {\mathbb {R}}^d$ (cf. a canonical proof of Jensen’s inequality), i.e.

$$\begin{aligned} \int _Y{\phi ((x,r))\,{\textrm{d}}\Delta _{x_1}} = \phi (v_t(x_1), w_t(x_1)) \end{aligned}$$

(3.30)

for all $\phi \in \textrm{C}^{0}_b(Y)$, for $\mu _t$-a.e. $x_1\in {\mathbb {R}}^d$.

Altogether, we may conclude that $\Delta =((x,1),(v_t(x),w_t(x)))_{\#}\mu _t$,

$$\begin{aligned}{} & {} \int _{Y\times Y}{(\Lambda |x_2|^2+\Sigma |r_2|^2)\,{\textrm{d}}\Delta } = |\mu _t'|^2 \nonumber \\{} & {} \quad = \lim _{l\rightarrow \infty }\int _{Y\times Y}{(\Lambda |x_2|^2+\Sigma |r_2|^2)\,{\textrm{d}}\Delta _{t,h_l,s}} \end{aligned}$$

(3.31)

and that (3.18) holds good for all $\Phi \in \textrm{C}^{0}_b(Y\times Y)$. A similar argument as in the proof of (3.23), making use of (3.31), will show (3.18) for all continuous functions $\Phi : Y\times Y \rightarrow {\mathbb {R}}$ satisfying the growth condition (3.19) (cf. Thm. 7.12 in [10] where the space of Borel probability measures with finite second order moments is considered and the equivalence between convergence in the Kantorovich-Wasserstein distance and convergence in duality with continuous functions satisfying a suitable growth condition is proved). This completes the proof of Prop. 3.3. $\square $

Now, Theorem 3.4 yields a linearization result for absolutely continuous curves.

Theorem 3.4

Let $t\in {\mathcal {N}}_\mu $.

Define ${\mathfrak {C}}_{t,h}:=\Big \{[x,r]\in {\mathfrak {C}}\setminus \{{\mathfrak {o}}\}: \ |v_t(x)|< \frac{1}{\sqrt{|h|}} \text { and } |w_t(x)| < \frac{2}{\sqrt{|h|}\Sigma }\Big \}$ and $\Xi _{t,h}:{\mathfrak {C}}\rightarrow {\mathfrak {C}}$,

$$\begin{aligned} \Xi _{t,h}([x,r]):= {\left\{ \begin{array}{ll} [x+\Lambda hv_t(x), r(1+\frac{\Sigma }{2}hw_t(x))] \text { if } [x, r]\in {\mathfrak {C}}_{t,h}, \\ {[}x,r{]} \text { else }. \end{array}\right. } \end{aligned}$$

(3.32)

Let $\chi _{t,h}:=(\Xi _{t,h})_{\#}(\pi ^1_{\#}\beta _{t,t+h})$ be the push-forward of the first marginal of $\beta _{t,t+h}$ through $\Xi _{t,h}$, i.e.

$$\begin{aligned} \int _{{\mathfrak {C}}}{\phi ([x,r])\,{\textrm{d}}\chi _{t,h}} = \int _{{\mathfrak {C}}}{\phi (\Xi _{t,h}([x,r]))\,{\textrm{d}}(\pi ^1_{\#}\beta _{t,t+h})} \end{aligned}$$

for all $\phi \in \textrm{C}^{0}_b({\mathfrak {C}})$. Then

$$\begin{aligned} \lim _{h\rightarrow 0} \frac{\textsf{HK}_{\Lambda , \Sigma }(\mu _{t+h}, {\mathfrak {h}}\chi _{t,h})^2}{h^2} \ = \ 0. \end{aligned}$$

(3.33)

Remark 3.5

The technical role of ${\mathfrak {C}}_{t,h}$ will be visible in the proof. First, the restriction to ${\mathfrak {C}}_{t,h}$ ensures that $[x+\Lambda hv_t(x), r(1+\frac{\Sigma }{2}hw_t(x))]\in {\mathfrak {C}}$ is well-defined, and second, we will take advantage thereof in order to suitably estimate ${\textsf {d}}_{{\mathfrak {C}},\Lambda , \Sigma }(\Xi _{t,h}(y_1), y_2)^2/h^2$ for $(y_1,y_2)\in \text {supp }\beta _{t,t+h}$.

Proof

We set $Y:={\mathbb {R}}^d\times {\mathbb {R}}$.

Let $t\in {\mathcal {N}}_\mu $. According to (2.13), (2.16), we have

$$\begin{aligned} \frac{\textsf{HK}_{\Lambda , \Sigma }(\mu _{t+h},{\mathfrak {h}}\chi _{t,h})^2}{h^2} \ \le \ \frac{1}{h^2}\int _{{\mathfrak {C}}\times {\mathfrak {C}}}{{\textsf {d}}_{{\mathfrak {C}}, \Lambda , \Sigma }(\Xi _{t,h}([x_1,r_1]), [x_2,r_2])^2 \,{\textrm{d}}\beta _{t,t+h}}.\nonumber \\ \end{aligned}$$

(3.34)

We will prove that the right-hand side of (3.34) converges to 0 as $h\rightarrow 0$.

First we note that, by Prokhorov’s Theorem, Def. 3.2(ii) and the proof of Prop. 3.3, every sequence $\Big (((v_t(x_1),w_t(x_1)), (x_2,r_2))_{\#}\Delta _{t,h_l,s}\Big )_{l\in \mathbb {N}}, \ h_l\rightarrow 0,$ is relatively compact w.r.t. the weak topology in ${\mathcal {M}}(Y\times Y)$ and in duality with continuous functions $\Phi : Y\times Y\rightarrow {\mathbb {R}}$ satisfying (3.19), and the second marginals of the corresponding limit measures coincide with $(v_t(x), w_t(x))_{\#}\mu _t$. It follows therefrom that for $N\in \mathbb {N}, \ {\bar{s}}\in (0,1)$,

$$\begin{aligned}{} & {} \limsup _{h\rightarrow 0}\frac{1}{h^2}\int _{({\mathfrak {C}}\setminus {\mathfrak {C}}_{t,1/N})\times {\mathfrak {C}}}{{\textsf {d}}_{{\mathfrak {C}},\Lambda , \Sigma }([x_1,r_1],[x_2,r_2])^2\,{\textrm{d}}\beta _{t,t+h}} \\{} & {} \quad = \ \limsup _{h\rightarrow 0} \int _{Y\times Y}{(\Lambda |x_2|^2+\Sigma |r_2|^2)\mathbbm {1}_{\{x: |v_t(x)| \ge \sqrt{N} \text { or } |w_t(x)|\ge 2\sqrt{N}/\Sigma \}}(x_1)\,{\textrm{d}}\Delta _{t,h,{\bar{s}}}} \\{} & {} \quad \le \ \int _{Y\times Y}{(\Lambda |x_2|^2+\Sigma |r_2|^2)\mathbbm {1}_{\{(x,r): |x| \ge \sqrt{N} \text { or } |r|\ge 2\sqrt{N}/\Sigma \}}(x_1,r_1)\,{\textrm{d}}{\tilde{\Delta }}}, \end{aligned}$$

(where ${\tilde{\Delta }}$ denotes a suitable limit measure of $((v_t(x_1),w_t(x_1)), (x_2,r_2))_{\#}\Delta _{t,h,{\bar{s}}}$) and an application of the Dominated Convergence Theorem then yields

$$\begin{aligned} \lim _{N\rightarrow \infty }\limsup _{h\rightarrow 0}\frac{1}{h^2}\int _{({\mathfrak {C}}\setminus {\mathfrak {C}}_{t,1/N})\times {\mathfrak {C}}}{{\textsf {d}}_{{\mathfrak {C}},\Lambda , \Sigma }([x_1,r_1],[x_2,r_2])^2\,{\textrm{d}}\beta _{t,t+h}} \ = \ 0, \end{aligned}$$

which implies

$$\begin{aligned} \lim _{h\rightarrow 0} \ \frac{1}{h^2}\int _{({\mathfrak {C}}\setminus {\mathfrak {C}}_{t,h})\times {\mathfrak {C}}}{{\textsf {d}}_{{\mathfrak {C}},\Lambda , \Sigma }([x_1,r_1],[x_2,r_2])^2\,{\textrm{d}}\beta _{t,t+h}} \ = \ 0. \end{aligned}$$

(3.35)

Next we consider $\frac{1}{h^2}\int _{{\mathfrak {C}}_{t,h}\times {\mathfrak {C}}}{{\textsf {d}}_{{\mathfrak {C}}, \Lambda , \Sigma }(\Xi _{t,h}([x_1,r_1]), [x_2,r_2])^2 \,{\textrm{d}}\beta _{t,t+h}}$. According to ([2], Sect. 3.6) and ([9], Sect. 8.1), the geometric cone $({\mathfrak {C}},{\textsf {d}}_{{\mathfrak {C}},\Lambda , \Sigma })$ is a length space and it holds that any curve $\eta :=[x,r]: [0,1]\rightarrow {\mathfrak {C}}$ for ${\textrm{C}}^{1}$-functions $x: [0,1]\rightarrow {\mathbb {R}}^d$ and $r:[0,1]\rightarrow [0,+\infty )$ is absolutely continuous in $({\mathfrak {C}},{\textsf {d}}_{{\mathfrak {C}},\Lambda ,\Sigma })$ and

$$\begin{aligned} {\textsf {d}}_{{\mathfrak {C}},\Lambda , \Sigma }(\eta (1), \eta (0))^2\le \int _0^1{\Big (\frac{4}{\Sigma }(r'(s))^2+\frac{1}{\Lambda } r(s)^2|x'(s)|^2\Big )\,{\textrm{d}}s} \end{aligned}$$

(cf. ([9], Lem. 8.1)). We define, for $y_1:=[x_1,r_1]\in {\mathfrak {C}}_{t,h}$, $y_2:=[x_2,r_2]\in {\mathfrak {C}}$, with $|x_1-x_2|\le \pi \sqrt{\Lambda /\Sigma }$ if $r_2 >0$, an absolutely continuous curve ${\mathcal {A}}_{h,\Xi (y_1),y_2}: [0,1]\rightarrow {\mathfrak {C}}$ connecting $\Xi (y_1)=[x_1+\Lambda hv_t(x_1), r_1(1+ \Sigma h w_t(x_1)/2)]$ and $y_2$ by setting ${\mathcal {A}}_{h,\Xi (y_1),y_2}:= [{\mathcal {X}}_{h,\Xi (y_1),y_2},{\mathcal {R}}_{h,\Xi (y_1),y_2}]$,

$$\begin{aligned} {\mathcal {X}}_{h,\Xi (y_1),y_2}(s):= & {} x_1 + \theta _{y_1,y_2}(s)(x_2-x_1) + \Lambda (1-s)h v_t(x_1),\end{aligned}$$

(3.36)

$$\begin{aligned} {\mathcal {R}}_{h, \Xi (y_1),y_2}(s):= & {} {\mathcal {R}}_{y_1,y_2}(s)\Big (1+\Sigma (1-s)h w_t(x_1)/2\Big ) \end{aligned}$$

(3.37)

(cf. (2.6)-(2.9), (2.20)). The functions ${\mathcal {X}}_{h,\Xi (y_1),y_2}: [0,1]\rightarrow {\mathbb {R}}^d$ and ${\mathcal {R}}_{h,\Xi (y_1),y_2}: [0,1]\rightarrow [0,+\infty )$ are continuously differentiable with

$$\begin{aligned}{} & {} ({\mathcal {R}}_{h,\Xi (y_1),y_2}'(s))^2\\{} & {} \quad = \Big (\Sigma {\mathcal {R}}'_{y_1,y_2}(s)(1-s)h w_t(x_1)/2 + {\mathcal {R}}_{y_1,y_2}'(s) - \Sigma {\mathcal {R}}_{y_1,y_2}(s) hw_t(x_1)/2\Big )^2 \\{} & {} \quad \le 2|h| \Sigma \ {\textsf {d}}_{{\mathfrak {C}},\Lambda , \Sigma }(y_1,y_2)^2 + 2\Big ({\mathcal {R}}'_{y_1,y_2}(s)-\Sigma r_1 hw_t(x_1)/2\Big )^2 \end{aligned}$$

and

$$\begin{aligned}{} & {} {\mathcal {R}}_{h,\Xi (y_1),y_2}(s)^2|{\mathcal {X}}_{h,\Xi (y_1),y_2}'(s)|^2\\{} & {} \quad \le 4{\mathcal {R}}_{y_1,y_2}(s)^2|\theta '_{y_1,y_2}(s)(x_2-x_1) - \Lambda hv_t(x_1)|^2 \\{} & {} \quad \le 8\Big (|{\mathcal {R}}_{y_1,y_2}(s)\theta '_{y_1,y_2}(s)(x_2 - x_1)- \Lambda r_1 h v_t(x_1)|^2 + \Lambda ^2 |h| |{\mathcal {R}}_{y_1,y_2}(s)-r_1|^2\Big ) \\{} & {} \quad \le 8\Big (|{\mathcal {R}}_{y_1,y_2}(s)\theta '_{y_1,y_2}(s)(x_2 - x_1)- \Lambda r_1 h v_t(x_1)|^2 + \Lambda ^2 \Sigma |h| / 4 \ {\textsf {d}}_{{\mathfrak {C}},\Lambda , \Sigma }(y_1,y_2)^2\Big ), \end{aligned}$$

where we have made use of (2.9) and the fact that $y_1=[x_1,r_1]\in {\mathfrak {C}}_{t,h}$. It follows from the above estimations and an application of Fubini’s Theorem that

$$\begin{aligned}{} & {} \frac{1}{h^2}\int _{{\mathfrak {C}}_{t,h}\times {\mathfrak {C}}}{{\textsf {d}}_{{\mathfrak {C}}, \Lambda , \Sigma }(\Xi _{t,h}([x_1,r_1]), [x_2,r_2])^2 \,{\textrm{d}}\beta _{t,t+h}} \\{} & {} \quad \le \frac{1}{h^2}\int _{{\mathfrak {C}}_{t,h}\times {\mathfrak {C}}}{\int _0^1{\Big (\frac{4}{\Sigma }({\mathcal {R}}'_{h,\Xi (y_1),y_2}(s))^2+\frac{1}{\Lambda }{\mathcal {R}}_{h,\Xi (y_1),y_2}(s)^2|{\mathcal {X}}_{h,\Xi (y_1),y_2}'(s)|^2\Big )\,{\textrm{d}}s}\,{\textrm{d}}\beta _{t,t+h}} \\{} & {} \quad \le \int _0^1{\int _{Y\times Y}{\Big (2\Sigma (r_2-r_1 w_t(x_1))^2 + 8\Lambda |x_2-r_1 v_t(x_1)|^2\Big )\,{\textrm{d}}\Delta _{t,h,s}((x_1,r_1),(x_2,r_2))}\,{\textrm{d}}s} \\{} & {} \qquad + \ C_{\Lambda , \Sigma } \frac{\textsf{HK}_{\Lambda , \Sigma }(\mu _t,\mu _{t+h})^2}{|h|} \end{aligned}$$

with $C_{\Lambda , \Sigma }$ only depending on $\Lambda $ and $\Sigma $. According to Def. 3.2(ii), there exists a sequence of functions $\zeta _n\in \textrm{C}^{\infty }_c({\mathbb {R}}^d) \ (n\in \mathbb {N})$ so that $((\nabla \zeta _n, \zeta _n))_{n\in \mathbb {N}}$ converges to $(v_t,w_t)$ in ${\textrm{L}}^{2}(\mu _t, {\mathbb {R}}^d\times {\mathbb {R}})$, which means

$$\begin{aligned}{} & {} \lim _{n\rightarrow \infty } \ \int _{Y\times Y}\Big (r_1^2 (\zeta _n(x_1)-w_t(x_1))^2 \nonumber \\{} & {} \quad + r_1^2|\nabla \zeta _n(x_1)-v_t(x_1)|^2\Big )\,{\textrm{d}}\Delta _{t,h,s}((x_1,r_1),(x_2,r_2)) = 0 \end{aligned}$$

(3.38)

uniformly in $h\in (-t,1-t)$ and $s\in (0,1)$. Moreover, Prop. 3.3 and (3.5) yield

$$\begin{aligned}{} & {} \lim _{h\rightarrow 0} \ \int _{Y\times Y}{\Big (\Sigma (r_2-r_1 \zeta _n(x_1))^2 + \Lambda |x_2-r_1 \nabla \zeta _n(x_1)|^2\Big )\,{\textrm{d}}\Delta _{t,h,s}} \nonumber \\{} & {} \quad = ||(v_t,w_t)-(\nabla \zeta _n,\zeta _n)||_{{\textrm{L}}^{2}(\mu _t,{\mathbb {R}}^d\times {\mathbb {R}})}^2 \end{aligned}$$

(3.39)

for all $n\in \mathbb {N}$ and $s\in (0,1)$. Combining (3.38) and (3.39) and the fact that the right-hand side of (3.39) converges to 0 as $n\rightarrow \infty $, we obtain

$$\begin{aligned}{} & {} \limsup _{h\rightarrow 0}\int _{Y\times Y}\Big (2\Sigma (r_2-r_1 w_t(x_1))^2 \\{} & {} \quad + 8\Lambda |x_2-r_1 v_t(x_1)|^2\Big )\,{\textrm{d}}\Delta _{t,h,s}((x_1,r_1),(x_2,r_2)) \ = \ 0, \end{aligned}$$

for every $s\in (0,1)$, and thus, by Fatou’s lemma,

$$\begin{aligned}{} & {} \limsup _{h\rightarrow 0}\int _0^1\int _{Y\times Y}\Big (2\Sigma (r_2-r_1 w_t(x_1))^2\nonumber \\{} & {} \quad + 8\Lambda |x_2-r_1 v_t(x_1)|^2\Big )\,{\textrm{d}}\Delta _{t,h,s}((x_1,r_1),(x_2,r_2)) \,{\textrm{d}}s \ = \ 0. \end{aligned}$$

(3.40)

Finally, applying the above estimation of $\frac{1}{h^2}\int _{{\mathfrak {C}}_{t,h}\times {\mathfrak {C}}}{\textsf {d}}_{{\mathfrak {C}}, \Lambda , \Sigma }(\Xi _{t,h}([x_1,r_1]), [x_2,r_2])^2 \,{\textrm{d}}\beta _{t,t+h}$, (3.40) and Def. 3.2(i), we obtain

$$\begin{aligned} \lim _{h\rightarrow 0} \ \frac{1}{h^2}\int _{{\mathfrak {C}}_{t,h}\times {\mathfrak {C}}}{{\textsf {d}}_{{\mathfrak {C}}, \Lambda , \Sigma }(\Xi _{t,h}([x_1,r_1]), [x_2,r_2])^2 \,{\textrm{d}}\beta _{t,t+h}} \ = \ 0, \end{aligned}$$

(3.41)

which completes the proof of Thm. 3.4. $\square $

4 Differentiability Results

This section finally treats the differentiability of the Hellinger–Kantorovich distance $\textsf{HK}_{\Lambda , \Sigma }$ along absolutely continuous curves; the linearization result of Thm. 3.4 puts us in a position to precisely compute the corresponding derivatives.

We fix another absolutely continuous curve $(\nu _t)_{t\in [0,1]}$ in $({\mathcal {M}}({\mathbb {R}}^d), \textsf{HK}_{\Lambda , \Sigma })$ with square-integrable metric derivative $t\mapsto |\nu _t'|$. It follows from (3.2) that

$$\begin{aligned} t\mapsto \frac{1}{2} \textsf{HK}_{\Lambda , \Sigma }(\mu _t, \nu _t)^2 \end{aligned}$$

(4.1)

is an absolutely continuous mapping from [0, 1] to $[0, +\infty )$ and thus ${\mathscr {L}}^1$-a.e. differentiable.

Let $({\bar{v}},{\bar{w}}): (0,1)\times {\mathbb {R}}^d \rightarrow {\mathbb {R}}^d\times \mathbb {R}$ be the essentially unique Borel vector field associated with $(\nu _t)_t$ so that the continuity equation with reaction

$$\begin{aligned} \partial _t\nu _t = -\Lambda \textrm{div}({\bar{v}}_t\nu _t) + \Sigma {\bar{w}}_t\nu _t \end{aligned}$$

holds good and

$$\begin{aligned} \int _{{\mathbb {R}}^d}{(\Lambda |{\bar{v}}_t|^2 + \Sigma |{\bar{w}}_t|^2)\,{\textrm{d}}\nu _t} \ = \ |\nu _t'|^2 \quad \text { for } {\mathscr {L}}^1\text {-a.e. } t\in (0,1), \end{aligned}$$

let ${\mathcal {N}}_\nu $ be the associated set of times defined according to Def. 3.2 and let ${\mathcal {N}}$ denote the set of times $t\in {\mathcal {N}}_\mu \cap {\mathcal {N}}_\nu $ at which (4.1) is differentiable. Clearly, $(0,1)\setminus {\mathcal {N}}$ is an ${\mathscr {L}}^1$-negligible set.

Theorem 4.1

If $t\in {\mathcal {N}}$ and $\beta _{t}\in {\mathcal {M}}({\mathfrak {C}}\times {\mathfrak {C}})$ is optimal in the definition of $\textsf{HK}_{\Lambda , \Sigma }(\mu _t, \nu _t)^2$ according to ((2.17), (2.13)), i.e.

$$\begin{aligned}{} & {} {\hat{\mu }}_t:= \mu _t-{\mathfrak {h}}(\pi ^1_{\#}\beta _{t}) \ge 0, \quad {\hat{\nu }}_t:= \nu _t-{\mathfrak {h}}(\pi ^2_{\#}\beta _{t})\ge 0, \\{} & {} \textsf{HK}_{\Lambda , \Sigma }(\mu _t, \nu _{t})^2 \ = \ \int _{{\mathfrak {C}}\times {\mathfrak {C}}}{{\textsf {d}}_{{\mathfrak {C}}, \Lambda , \Sigma }([x_1,r_1],[x_2,r_2])^2\,{\textrm{d}}\beta _{t}} \\{} & {} \qquad \qquad \qquad \qquad +4/\Sigma \ {\hat{\mu }}_t({\mathbb {R}}^d) \ + \ 4/\Sigma \ {\hat{\nu }}_t({\mathbb {R}}^d), \end{aligned}$$

then the derivative $\frac{\,{\textrm{d}}}{\,{\textrm{d}}t} [\frac{1}{2}\textsf{HK}_{\Lambda , \Sigma }(\mu _t, \nu _t)^2]$ of (4.1) at t coincides with

$$\begin{aligned}{} & {} -\int _{{\mathfrak {C}}\times {\mathfrak {C}}}[{\mathfrak {G}}_{y_1}^{\Lambda , \Sigma }({\mathfrak {t}}_{y_1,y_2}(0), {\mathfrak {s}}^\mu _{t, y_1}) + {\mathfrak {G}}_{y_2}^{\Lambda , \Sigma }({\mathfrak {t}}_{y_2,y_1}(0), {\mathfrak {s}}^\nu _{t, y_2})]\,{\textrm{d}}\beta _{t} \nonumber \\{} & {} \quad + 2\Big (\int _{{\mathbb {R}}^d}{w_t \,{\textrm{d}}{\hat{\mu }}_t} + \int _{{\mathbb {R}}^d}{{\bar{w}}_t \,{\textrm{d}}{\hat{\nu }}_t}\Big ) \end{aligned}$$

(4.2)

where ${\mathfrak {s}}^\mu _{t,y}:= (\Lambda v_t({\textsf {x}}(y)), \Sigma /2 \ {\textsf {r}}(y) w_t({\textsf {x}}(y)))$ and ${\mathfrak {s}}^\nu _{t,y}:= (\Lambda {\bar{v}}_t({\textsf {x}}(y)), \Sigma /2 {\textsf {r}}(y) {\bar{w}}_t({\textsf {x}}(y)))$.

Before proving Thm. 4.1, let us try to gain an insight into the above formula (4.2).

Remark 4.2

Suppose that $\nu _s\equiv \nu \in {\mathcal {M}}({\mathbb {R}}^d)$. There exists an optimal plan $\beta _{t}$ associated with $\mu _t$ and $\nu $ whose marginals satisfy $\mu _t = {\mathfrak {h}}(\pi ^1_{\#}\beta _{t})$ and $\nu = {\mathfrak {h}}(\pi ^2_{\#}\beta _{t})$ (cf. Thm. 7.6 in [9]). The derivative $\frac{\,{\textrm{d}}}{\,{\textrm{d}}t} [\frac{1}{2}\textsf{HK}_{\Lambda , \Sigma }(\mu _t, \nu )^2]$ at $t\in {\mathcal {N}}$ then takes the form

$$\begin{aligned} -\int _{{\mathfrak {C}}\times {\mathfrak {C}}}{{\mathfrak {G}}_{y_1}^{\Lambda , \Sigma }({\mathfrak {t}}_{y_1,y_2}(0), {\mathfrak {s}}^\mu _{t, y_1})\,{\textrm{d}}\beta _t}. \end{aligned}$$

(4.3)

The tangent vectors ${\mathfrak {t}}_{y_1,y_2}(0)$ (see (2.12)) and ${\mathfrak {s}}^\mu _{t,y_1}$ to the geometric cone ${\mathfrak {C}}$, for $(y_1, y_2)\in \text {supp } \beta _t$, represent the directions $\mu _t\rightsquigarrow \nu $ and $\mu _t\rightsquigarrow \mu _{t+h}$ (for $h>0$ small) respectively on an infinitesimal level (cf. Thm. 3.4). It is noteworthy that the metric tensor ${\mathfrak {G}}^{\Lambda , \Sigma }$ (see (2.5)) at $y_1\in {\mathfrak {C}}$ between such tangent vectors ${\mathfrak {t}}_{y_1,y_2}(0)$ and ${\mathfrak {s}}^\mu _{t, y_1}$ is equal to the derivative at $h=0$ of $h\mapsto -1/2 \ {\textsf {d}}_{{\mathfrak {C}},\Lambda , \Sigma }(\Xi _{t,h}(y_1), y_2)^2$ (see (3.32)), i.e.

$$\begin{aligned} -{\mathfrak {G}}_{y_1}^{\Lambda , \Sigma }({\mathfrak {t}}_{y_1,y_2}(0), {\mathfrak {s}}^\mu _{t, y_1}) \ = \ \frac{1}{2} \frac{{\textsf {d}}}{{\textsf {d}}h}\bigg \vert _{h=0} \ {\textsf {d}}_{{\mathfrak {C}},\Lambda , \Sigma }(\Xi _{t,h}(y_1), y_2)^2 \end{aligned}$$

(4.4)

for a simple computation shows that both terms in (4.4) equal

$$\begin{aligned}{} & {} 2 r_1^2 w_t(x_1)-2r_1r_2 w_t(x_1)\cos (\sqrt{\Sigma /4\Lambda }|x_1-x_2|)\\{} & {} \quad -2r_1r_2\sqrt{\Lambda /\Sigma } \ \Big \langle \frac{\sin (\sqrt{\Sigma /4\Lambda }|x_1-x_2|)}{|x_1-x_2|}(x_2-x_1) , v_t(x_1)\Big \rangle \end{aligned}$$

($y_i=[x_i, r_i]\in {\mathfrak {C}}$).

Also, we would like to remark that the derivatives of (4.1) at $t\in {\mathcal {N}}$ can be expressed equally in terms of the Logarithmic Entropy-Transport characterization (1.1) of the Hellinger–Kantorovich distance $\textsf{HK}_{\Lambda , \Sigma }$, by applying (2.19) to the above representation (4.2) of the derivatives.

Proof

Let $t\in {\mathcal {N}}$. Then $t\in {\mathcal {N}}_\mu \cap {\mathcal {N}}_\nu $ and (4.1) is differentiable at t. We apply Thm. 3.4 to both curves $(\mu _s)_s$ and $(\nu _s)_s$ defining $\Xi _{\mu , t, h}, \ \chi _{\mu , t,h}$ and $\Xi _{\nu , t, h}, \ \chi _{\nu , t, h}$ respectively according thereto so that, by the corresponding linearization results,

$$\begin{aligned}{} & {} \frac{\,{\textrm{d}}}{\,{\textrm{d}}s} \Big [\frac{1}{2}\textsf{HK}_{\Lambda , \Sigma }(\mu _s, \nu _s)^2\Big ]\Bigg \vert _{s=t} \ \nonumber \\{} & {} \quad = \ \lim _{h\rightarrow 0}\frac{\frac{1}{2}\textsf{HK}_{\Lambda , \Sigma }({\mathfrak {h}}\chi _{\mu , t,h}, {\mathfrak {h}}\chi _{\nu , t, h})^2 - \frac{1}{2}\textsf{HK}_{\Lambda , \Sigma }(\mu _t, \nu _t)^2}{h} \end{aligned}$$

(4.5)

(cf. (3.32), (3.33)). Let ${\bar{\chi }}_{\mu ,t,h}:=(\Xi _{\mu ,t,h})_{\#}\alpha _{\mu ,t}$ and ${\bar{\chi }}_{\nu ,t,h}:=(\Xi _{\nu ,t,h})_{\#}\alpha _{\nu ,t}$ be the push-forwards of the marginals $\alpha _{\mu ,t}:=\pi ^1_{\#}\beta _t$ and $\alpha _{\nu ,t}:=\pi ^2_{\#}\beta _t$ of $\beta _t$ through the mappings $\Xi _{\mu ,t,h}$ and $\Xi _{\nu , t, h}$ respectively. We have

$$\begin{aligned} \int _{{\mathbb {R}}^d}{\phi \,{\textrm{d}}({\mathfrak {h}}{\bar{\chi }}_{\mu , t,h})}{} & {} = \int _{\mathfrak {C}_{\mu ,t,h}}{{\textsf {r}}^2(1+\Sigma hw_t({\textsf {x}})/2)^2\phi ({\textsf {x}}+\Lambda hv_t({\textsf {x}}))\,{\textrm{d}}\alpha _{\mu ,t}}\\{} & {} \quad + \int _{{\mathfrak {C}}\setminus {\mathfrak {C}}_{\mu ,t,h}}{{\textsf {r}}^2\phi ({\textsf {x}})\,{\textrm{d}}\alpha _{\mu ,t}}\\{} & {} = \int _{{\textsf {x}}({\mathfrak {C}}_{\mu ,t,h})}{(1+\Sigma hw_t(x)/2)^2\phi (x+\Lambda hv_t(x))\,{\textrm{d}}{\mathfrak {h}}\alpha _{\mu ,t}}\\{} & {} \quad + \int _{{\textsf {x}}({\mathfrak {C}}\setminus {\mathfrak {C}}_{\mu ,t,h})}{\phi (x)\,{\textrm{d}}{\mathfrak {h}}\alpha _{\mu ,t}} \\{} & {} \le \int _{{\textsf {x}}({\mathfrak {C}}_{\mu ,t,h})}{(1+\Sigma hw_t(x)/2)^2\phi (x+\Lambda hv_t(x))\,{\textrm{d}}\mu _t} \\{} & {} \quad + \int _{{\textsf {x}}({\mathfrak {C}}\setminus {\mathfrak {C}}_{\mu ,t,h})}{\phi (x)\,{\textrm{d}}\mu _t} \ = \int _{{\mathbb {R}}^d}{\phi \,{\textrm{d}}({\mathfrak {h}}\chi _{\mu ,t,h})} \end{aligned}$$

for all nonnegative bounded Borel functions $\phi : {\mathbb {R}}^d\rightarrow {\mathbb {R}}$ (cf. (2.14), (2.15)), from which we infer that

$$\begin{aligned} {\mathfrak {h}}{\bar{\chi }}_{\mu ,t,h}{} & {} \le {\mathfrak {h}}\chi _{\mu ,t,h}, ({\mathfrak {h}}\chi _{\mu ,t,h} - {\mathfrak {h}}{\bar{\chi }}_{\mu ,t,h})({\mathbb {R}}^d) = {\hat{\mu }}_t({\mathbb {R}}^d) \ + \int _{{\textsf {x}}({\mathfrak {C}}_{\mu ,t,h})}{\Big (\Sigma hw_t(x)+\frac{\Sigma ^2}{4}h^2 w_t(x)^2\Big )\,{\textrm{d}}{\hat{\mu }}_t} . \end{aligned}$$

Similarly,

$$\begin{aligned} {\mathfrak {h}}{\bar{\chi }}_{\nu ,t,h}\le & {} {\mathfrak {h}}\chi _{\nu ,t,h}, \\ ({\mathfrak {h}}\chi _{\nu ,t,h} - {\mathfrak {h}}{\bar{\chi }}_{\nu ,t,h})({\mathbb {R}}^d)= & {} {\hat{\nu }}_t({\mathbb {R}}^d) \ + \int _{{\textsf {x}}({\mathfrak {C}}_{\nu ,t,h})}{\Big (\Sigma h{\bar{w}}_t(x)+\frac{\Sigma ^2}{4}h^2 {\bar{w}}_t(x)^2\Big )\,{\textrm{d}}{\hat{\nu }}_t} . \end{aligned}$$

We obtain

$$\begin{aligned}{} & {} \frac{1}{2}\Big (\textsf{HK}_{\Lambda , \Sigma }({\mathfrak {h}}\chi _{\mu ,t,h}, {\mathfrak {h}}\chi _{\nu , t, h})^2 - \textsf{HK}_{\Lambda , \Sigma }(\mu _t, \nu _t)^2\Big ) \\{} & {} \quad \le \frac{1}{2}\Big ({\mathcal {W}}_{{\mathfrak {C}}, \Lambda , \Sigma }({\bar{\chi }}_{\mu ,t,h}, {\bar{\chi }}_{\nu ,t,h})^2 - {\mathcal {W}}_{{\mathfrak {C}}, \Lambda , \Sigma }(\alpha _{\mu ,t}, \alpha _{\nu ,t})^2\Big ) \\{} & {} \qquad + 2 \int _{{\textsf {x}}({\mathfrak {C}}_{\mu ,t,h})}{\Big (hw_t(x)+\frac{\Sigma }{4}h^2 w_t(x)^2\Big )\,{\textrm{d}}{\hat{\mu }}_t} \\{} & {} \qquad + 2 \int _{{\textsf {x}}({\mathfrak {C}}_{\nu ,t,h})}{\Big (h{\bar{w}}_t(x)+\frac{\Sigma }{4}h^2 {\bar{w}}_t(x)^2\Big )\,{\textrm{d}}{\hat{\nu }}_t}, \end{aligned}$$

and

$$\begin{aligned} {\mathcal {W}}_{{\mathfrak {C}}, \Lambda , \Sigma }({\bar{\chi }}_{\mu ,t,h}, {\bar{\chi }}_{\nu ,t,h})^2 \ \le \ \int _{{\mathfrak {C}}\times {\mathfrak {C}}}{{\textsf {d}}_{{\mathfrak {C}}, \Lambda , \Sigma }(\Xi _{\mu , t, h}([x_1,r_1]), \Xi _{\nu , t, h}([x_2,r_2]))^2\,{\textrm{d}}\beta _t}. \end{aligned}$$

The same argument as in the proof of Lem. 2.2 in [5] then yields

$$\begin{aligned}{} & {} \limsup _{h\downarrow 0}\frac{\frac{1}{2}{\mathcal {W}}_{{\mathfrak {C}}, \Lambda , \Sigma }({\bar{\chi }}_{\mu ,t,h}, {\bar{\chi }}_{\nu , t, h})^2 - \frac{1}{2}{\mathcal {W}}_{{\mathfrak {C}}, \Lambda , \Sigma }(\alpha _{\mu , t}, \alpha _{\nu ,t})^2}{h} \\{} & {} \quad \le 2\int _{{\mathfrak {C}}\times {\mathfrak {C}}}\Big [r_1^2 w_t(x_1)-r_1r_2 w_t(x_1)\cos (\sqrt{\Sigma /4\Lambda }|x_1-x_2|)\\{} & {} \quad -r_1r_2\sqrt{\Lambda /\Sigma } \ \langle S_{\Lambda ,\Sigma }(x_1,x_2), v_t(x_1)\rangle \Big ]\,{\textrm{d}}\beta _{t} \\{} & {} \quad + \ 2 \int _{{\mathfrak {C}}\times {\mathfrak {C}}}\Big [r_2^2 {\bar{w}}_t(x_2)-r_1r_2 {\bar{w}}_t(x_2)\cos (\sqrt{\Sigma /4\Lambda }|x_1-x_2|)\\{} & {} \quad +r_1r_2\sqrt{\Lambda /\Sigma } \ \langle S_{\Lambda ,\Sigma }(x_1,x_2), {\bar{v}}_t(x_2)\rangle \Big ]\,{\textrm{d}}\beta _{t} \\{} & {} \quad \le \liminf _{h\uparrow 0}\frac{\frac{1}{2}{\mathcal {W}}_{{\mathfrak {C}}, \Lambda , \Sigma }({\bar{\chi }}_{\mu ,t,h}, {\bar{\chi }}_{\nu , t, h})^2 - \frac{1}{2}{\mathcal {W}}_{{\mathfrak {C}}, \Lambda , \Sigma }(\alpha _{\mu ,t}, \alpha _{\nu ,t})^2}{h}, \end{aligned}$$

with $S_{\Lambda , \Sigma }$ defined as

$$\begin{aligned} S_{\Lambda , \Sigma }(x_1, x_2):= {\left\{ \begin{array}{ll} \frac{\sin (\sqrt{\Sigma /4\Lambda }|x_1-x_2|)}{|x_1-x_2|}(x_2-x_1) &{}\text { if } x_1\ne x_2, \\ 0 &{}\text { if } x_1=x_2. \end{array}\right. } \end{aligned}$$

Since the limit (4.5) exists, the sum of the above integrands is identical with

$$\begin{aligned} -{\mathfrak {G}}_{y_1}^{\Lambda , \Sigma }({\mathfrak {t}}_{y_1,y_2}(0), {\mathfrak {s}}^\mu _{t, y_1}) - {\mathfrak {G}}_{y_2}^{\Lambda , \Sigma }({\mathfrak {t}}_{y_2,y_1}(0), {\mathfrak {s}}^\nu _{t, y_2}) \quad \quad \quad (y_i:=[x_i,r_i]) \end{aligned}$$

(cf. Rem. 4.2), and

$$\begin{aligned}{} & {} \lim _{h\rightarrow 0} \int _{{\textsf {x}}({\mathfrak {C}}_{\mu , t,h})}{\Big (w_t(x)+\frac{\Sigma }{4}h w_t(x)^2\Big )\,{\textrm{d}}{\hat{\mu }}_t} = \ \int _{{\mathbb {R}}^d}{w_t(x)\,{\textrm{d}}{\hat{\mu }}_t}, \\{} & {} \quad \lim _{h\rightarrow 0} \int _{{\textsf {x}}({\mathfrak {C}}_{\nu , t,h})}{\Big ({\bar{w}}_t(x)+\frac{\Sigma }{4}h {\bar{w}}_t(x)^2\Big )\,{\textrm{d}}{\hat{\nu }}_t} \ = \ \int _{{\mathbb {R}}^d}{{\bar{w}}_t(x)\,{\textrm{d}}{\hat{\nu }}_t}, \end{aligned}$$

it follows from the above computations that

$$\begin{aligned}{} & {} \lim _{h\rightarrow 0}\frac{\frac{1}{2}\textsf{HK}_{\Lambda , \Sigma }({\mathfrak {h}}\chi _{\mu ,t,h}, {\mathfrak {h}}\chi _{\nu ,t,h})^2 - \frac{1}{2}\textsf{HK}_{\Lambda , \Sigma }(\mu _t, \nu _t)^2}{h} \quad \\{} & {} \quad =-\int _{{\mathfrak {C}}\times {\mathfrak {C}}}{[{\mathfrak {G}}_{y_1}^{\Lambda , \Sigma }({\mathfrak {t}}_{y_1,y_2}(0), {\mathfrak {s}}^\mu _{t, y_1}) + {\mathfrak {G}}_{y_2}^{\Lambda , \Sigma }({\mathfrak {t}}_{y_2,y_1}(0), {\mathfrak {s}}^\nu _{t, y_2})]\,{\textrm{d}}\beta _{t}} \ \\{} & {} \qquad + \ 2\Big (\int _{{\mathbb {R}}^d}{w_t \,{\textrm{d}}{\hat{\mu }}_t} + \int _{{\mathbb {R}}^d}{{\bar{w}}_t \,{\textrm{d}}{\hat{\nu }}_t}\Big ). \end{aligned}$$

The proof of Thm. 4.1 is complete. $\square $

Notes

according to Lem. 2.3 in [9], there exist Borel functions $\rho _i: {\mathbb {R}}^d\rightarrow [0, +\infty )$ and nonnegative finite Radon measures $\mu _i^\bot \in {\mathcal {M}}({\mathbb {R}}^d), \ \mu _i^\bot \bot \gamma _i,$ so that (2.18) holds good.

References

Ambrosio, L., Gigli, N., Savaré, G.: Gradient Flows in Metric Spaces and in the Space of Probability Measures. Lectures in Mathematics. ETH Zürich, Birkhäuser (2005)
MATH Google Scholar
Burago, D., Burago, Y., Ivanov, S.: A Course in Metric Geometry. Graduate Studies in Mathematics, vol. 33. American Mathematical Society, Providence, RI (2001)
MATH Google Scholar
Chizat, L., Peyré, G., Schmitzer, B., Vialard, F.-X.: An interpolating distance between optimal transport and Fisher-Rao metrics. Found. Comput. Math. 18, 1–44 (2018)
Article MathSciNet MATH Google Scholar
Chizat, L., Peyré, G., Schmitzer, B., Vialard, F.-X.: Unbalanced optimal transport: dynamic and Kantorovich formulations. J. Funct. Anal. 274, 3090–3123 (2018)
Article MathSciNet MATH Google Scholar
Fleißner, F.: A Minimizing Movement approach to a class of scalar reaction-diffusion equations, ESAIM:COCV 27 (2021) 18. https://arXiv.org/2002.04496 (2020)
Gangbo, W., McCann, R.J.: The geometry of optimal transportation. Acta Math. 177, 113–161 (1996)
Article MathSciNet MATH Google Scholar
Kondratyev, S., Monsaingeon, L., Vorotnikov, D., et al.: A new optimal transport distance on the space of finite Radon measures. Adv. Differ. Equ. 21, 1117–1164 (2016)
MathSciNet MATH Google Scholar
Liero, M., Mielke, A., Savaré, G.: Optimal transport in competition with reaction: the Hellinger-Kantorovich distance and geodesic curves. SIAM J. Math. Anal. 48, 2869–2911 (2016)
Article MathSciNet MATH Google Scholar
Liero, M., Mielke, A., Savaré, G.: Optimal entropy-transport problems and a new Hellinger-Kantorovich distance between positive measures. Invent. Math. 211, 969–1117 (2018)
Article MathSciNet MATH Google Scholar
Villani, C.: Topics in Optimal Transportation. Graduate Studies in Mathematics, vol. 58. American Mathematical Society, Providence, RI (2003)
MATH Google Scholar

Download references

Acknowledgements

I gratefully acknowledge support from the Erwin Schrödinger International Institute for Mathematics and Physics (Vienna) during my participation in the programme “Optimal Transport”.

Funding

Open Access funding enabled and organized by Projekt DEAL.

Author information

Authors and Affiliations

Technische Universität München, Munich, Germany
Florentine Catharina Fleißner

Authors

Florentine Catharina Fleißner
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Florentine Catharina Fleißner.

Ethics declarations

Conflict of interest

The authors have not disclosed any competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Fleißner, F.C. A Note on the Differentiability of the Hellinger–Kantorovich Distances. Appl Math Optim 87, 37 (2023). https://doi.org/10.1007/s00245-022-09948-y

Download citation

Accepted: 24 November 2022
Published: 13 March 2023
DOI: https://doi.org/10.1007/s00245-022-09948-y

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

A Note on the Differentiability of the Hellinger–Kantorovich Distances

Abstract

Similar content being viewed by others

Variability regions for the second derivative of bounded analytic functions

On stepanov Type differentiability Theorems

The continuity equation on metric measure spaces

1 Introduction

2 Optimal Transportation on the Cone

3 Absolutely Continuous Curves

Proposition 3.1

Proof

Definition 3.2

Proposition 3.3

Proof

Theorem 3.4

Remark 3.5

Proof

4 Differentiability Results

Theorem 4.1

Remark 4.2

Proof

Notes

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Navigation

A Note on the Differentiability of the Hellinger–Kantorovich Distances

Abstract

Similar content being viewed by others

Variability regions for the second derivative of bounded analytic functions

On stepanov Type differentiability Theorems

The continuity equation on metric measure spaces

1 Introduction

2 Optimal Transportation on the Cone

3 Absolutely Continuous Curves

Proposition 3.1

Proof

Definition 3.2

Proposition 3.3

Proof

Theorem 3.4

Remark 3.5

Proof

4 Differentiability Results

Theorem 4.1

Remark 4.2

Proof

Notes

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Search

Navigation