Tropical diagrams of probability spaces

Matveev, R.; Portegies, J. W.

doi:10.1007/s41884-020-00027-1

Tropical diagrams of probability spaces

Research Paper
Open access
Published: 07 April 2020

Volume 3, pages 61–88, (2020)
Cite this article

Download PDF

You have full access to this open access article

Information Geometry Aims and scope Submit manuscript

Tropical diagrams of probability spaces

Download PDF

3001 Accesses
3 Citations
8 Altmetric
Explore all metrics

Abstract

After endowing the space of diagrams of probability spaces with an entropy distance, we study its large-scale geometry by identifying the asymptotic cone as a closed convex cone in a Banach space. We call this cone the tropical cone, and its elements tropical diagrams of probability spaces. Given that the tropical cone has a rich structure, while tropical diagrams are rather flexible objects, we expect the theory of tropical diagrams to be useful for information optimization problems in information theory and artificial intelligence. In a companion article, we give a first application to derive a statement about the entropic cone.

Tropical Ehrhart theory and tropical volume

Article Open access 21 September 2020

Information geometry

Article 02 January 2021

Main Directions in the Theory of Probability Metrics

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

With [9] we started a research program aiming for a systematic approach to a class of information optimization problems in information theory and artificial intelligence. A prototypical example of such a problem, still wide open, is the characterization of the entropic cone: For an N-tuple of random variables, one may evaluate their entropies and the entropies of the joint variables and obtain a vector in $\mathbb {R}^{2^N - 1}$. A vector obtained in this way is called an entropy vector of an N-tuple of random variables. The closure of the set of all entropy vectors of N-tuples is what we call the entropic cone, see also [8]. Besides the characterization of the entropic cone, other information optimization problems arise for instance in causal inference [13], artificial intelligence [14], information decomposition [3], robotics [1], neuroscience [5] and in variational autoencoders [7].

The global strategy of our program is roughly based on the following way of thinking. The entropic cone is clearly a very complicated object: to date, there is no explicit description of the entropic cone for four or more random variables, while it is known that it is not polyhedral [8]. Yet, perhaps, much of its complexity may be explained by it being the closure of an image under a linear map of another, simpler, higher-dimensional cone.

The purpose of this article is to construct such a higher-dimensional (infinite-dimensional, in fact) object, which we call the tropical cone and to derive some of its properties which are testimony to its simple structure and which help the study of information optimization problems. As an example of its use, in [11] we apply the theory to derive a statement about the entropic cone.

Before outlining the construction of the tropical cone, let us mention that for our purposes, the language of random variables proved inconvenient, which is why work with diagrams of probability spaces instead.

Diagrams of probability spaces are commutative diagrams in the category of probability spaces, with (equivalence classes of) measure-preserving maps as morphisms, such as

(1.1)

Collections of n random variables give rise to a special type of diagrams, that include, besides the target spaces of the random variables themselves, the target space of every joint variable. Such diagrams have a particular combinatorial type. The first and the last diagrams in (1.1) are examples of such special types of diagrams in case of two and three random variables respectively. The description of other diagrams, such as the diagram in the middle of (1.1), using the language of random variables is less transparent.

We will construct the tropical cone and derive its properties over several sections. In Sect. 2 we describe the construction of the asymptotic cone in the abstract setting of a metric Abelian monoid $(\Gamma , +, \mathbf{d})$. We believe that this abstract setting will make the construction more transparent and easier to follow. The results we present in that section are probably quite standard, but we find it beneficial to gather them under one roof. Such an asymptotic cone consists of equivalence classes of quasi-linear sequences in the monoid. Whereas linear sequences have the form $(n\cdot a)_{n\in {\mathbb {N}}_{0}}$, where a is an element of the monoid, quasi-linear sequences may deviate from linearity in a controlled fashion, measured by a sublinear function $\varphi $ satisfying some additional conditions that we will specify later. A sequence $\gamma \in \Gamma ^{{\mathbb {N}}_0}$ is called $\varphi $-quasi-linear if for all $m, n \in \mathbb {N}$, it satisfies

$$\begin{aligned} \mathbf{d}\big (\gamma (m + n), \gamma (m) + \gamma (n)\big ) \le \varphi (m + n) \end{aligned}$$

and two sequences $\gamma $ and $\gamma '$ are equivalent if

$$\begin{aligned} \lim _{n {\mathop {\rightarrow }\limits ^{}}\infty } \frac{1}{n} \mathbf{d}\big (\gamma (n), \gamma '(n)\big ) = 0 \end{aligned}$$

The asymptotic cone is itself again a metric Abelian monoid, but it admits additional structure. It admits a distributive action of $\mathbb {R}_{\ge 0}$ and the metric becomes homogeneous and translation invariant. As an example of this construction, A’Campo [2] constructed the real numbers as the asymptotic cone in the monoid of integers.

In Sect. 3 we show that, under certain conditions, the asymptotic cone is a complete metric space and it can be realized as a closed convex cone in a Banach space.

In Sect. 4 we apply the general construction of Sects. 2 and 3 to the monoid of diagrams of probability spaces endowed with the intrinsic entropy distance [6, 9, 15] and with the tensor product as the binary operation. We call the resulting space tropical cone and its elements tropical diagrams.^{Footnote 1} In Sect. 6, we give a simple characterization of the tropical cone for special types of diagrams.

For more complicated diagrams, we currently do not have an explicit description of the tropical cone, but we do show that it possesses a rich algebraic structure. In particular, one can take convex combinations of tropical diagrams. Other useful operations and constructions can be carried through for tropical diagrams, whereas they do not have an equivalent in the classical context of probability spaces, see [10]. All in all, from some perspective, tropical diagrams are easier to deal with than diagrams or probability spaces, since only rough, asymptotic relations between probability spaces are preserved under tropicalization, similar to how all complicated features of the landscape disappear when looking at the Earth from outer space.

In order to study information optimization problems, we may as well study the more malleable tropical cone. This is because the entropic cone is the closure of the image of the bounded linear map defined on the tropical cone that evaluates entropies of the individual spaces in a tropical diagram. More generally, we call any non-negative bounded linear functional on the tropical cone an entropic quantity. These include entropies of individual spaces, but also some other quantities, such as optima of some linear combinations of entropies of an extended diagram, where some extra spaces are added to the original diagram. Study of such entropic quantities is the subject of our future research.

One of the main tools in the study of entropic quantities through the tropical cone is the Asymptotic Equipartition Property for diagrams. Originally derived in [9], we cast it here into a density statement of simpler, so-called homogeneous tropical diagrams in the tropical cone, in terms of Theorems 5.1 and 5.2. Therefore, to prove statements about entropic quantities, it suffices to study the much simpler homogeneous tropical diagrams.

2 Asymptotic cones of metric abelian monoids

In this section we define the asymptotic cone in the setting of an abstract metric Abelian monoid. In a later section, we will specify to the case of diagrams of probability spaces.

2.1 Metric and pseudo-metric spaces

A pseudo-metric space $(X,\mathbf{d})$ is a set X equipped with a pseudo-distance $\mathbf{d}$, a bivariate function satisfying all the axioms of a distance function, except that it is allowed to vanish on pairs of non-identical points. An isometry of such spaces is a distance-preserving map, such that for any point in the target space there is a point in the image at zero distance away from it. Given such a pseudo-metric space $(X,\mathbf{d})$ one could always construct an isometric metric space $(X/_{\mathbf{d}=0}\,,\mathbf{d})$, the metric quotient, by identifying all pairs of points that are distance zero apart.

Any property formulated in terms of the pseudo-metric holds simultaneously for a pseudo-metric space and its metric quotient. It will be convenient for us to construct pseudo-metrics on spaces instead of passing to the quotient spaces.

For a pair of points $x,y\in X$ in a pseudo-metric space $(X,\mathbf{d})$ we will write $x{\mathop {=}\limits ^{\mathbf{d}}} y$ if $\mathbf{d}(x,y)=0$. We call such a pair of points ($\mathbf{d}$-)metrically equivalent.

Many metric-topological notions such as (Lipschitz-)continuity, compactness, $\epsilon $-nets, dense subsets, etc., extend to the setting of a pseudo-metric spaces and exercising certain care one may switch between a pseudo-metric space and its metric quotient replacing the ${\mathop {=}\limits ^{\mathbf{d}}}$-sign with equality.

2.2 Metric abelian monoids

A monoid is a set equipped with a bivariate associative operation and a neutral element. The operation is usually called multiplication, or addition if it is commutative. We call a monoid with pseudo-distance $(\Gamma ,+,\mathbf{d})$ a metric Abelian monoid if it satisfies:

1.
For all $\gamma ,\gamma '\in \Gamma $ holds
$$\begin{aligned} \gamma + \gamma ' {\mathop {=}\limits ^{\mathbf{d}}} \gamma ' + \gamma \end{aligned}$$
2.
The binary operation is 1-Lipschitz with respect to each argument: For all $\gamma ,\gamma ',\eta \in \Gamma $
$$\begin{aligned} \mathbf{d}(\eta +\gamma ,\eta +\gamma ') \le \mathbf{d}(\gamma ,\gamma ') \end{aligned}$$
In other words, the translation maps
$$\begin{aligned} T_{\eta }:\Gamma {\mathop {\rightarrow }\limits ^{}}\Gamma , \quad \gamma \mapsto \eta +\gamma \end{aligned}$$
are non-expanding for every $\eta \in \Gamma $.

The following proposition is an elementary consequence of the triangle inequality.

Proposition 2.1

Let $(\Gamma ,+,\mathbf{d})$ be a metric Abelian monoid. Then:

1.
For any quadruple $\gamma _{1},\gamma _{2},\gamma _{3},\gamma _{4}\in \Gamma $ holds
$$\begin{aligned} \mathbf{d}(\gamma _{1}+\gamma _{2},\gamma _{3}+\gamma _{4}) \le \mathbf{d}(\gamma _{1},\gamma _{3}) + \mathbf{d}(\gamma _{2},\gamma _{4}) \end{aligned}$$
2.
For every $n \in {\mathbb {N}}$, and $\gamma _1, \gamma _2 \in \Gamma $ also holds
$$\begin{aligned} \mathbf{d}(n\cdot \gamma _1, n\cdot \gamma _2 ) \le n\cdot \mathbf{d}(\gamma _1, \gamma _2) \end{aligned}$$

A metric Abelian monoid $(\Gamma , +, \varvec{\delta })$ will be called homogeneous if it satisfies for all $n \in \mathbb {N}_0$

$$\begin{aligned} \varvec{\delta }(n\cdot \gamma _1, n\cdot \gamma _2) = n\cdot \varvec{\delta }(\gamma _1, \gamma _2) \end{aligned}$$

(2.1)

A homogeneous metric Abelian monoid is called an $\mathbb {R}_{\ge 0}$-semi-module$(\Gamma ,+,\cdot \,,\varvec{\delta })$ if in addition there is a doubly distributive $\mathbb {R}_{\ge 0}$-action such that for any $\lambda _{1},\lambda _{2}\in \mathbb {R}_{\ge 0}$ and $\gamma _{1},\gamma _{2}\in \Gamma $ holds

$$\begin{aligned} \lambda _{1}\cdot (\lambda _{2}\cdot \gamma _{1})&{\mathop {=}\limits ^{\varvec{\delta }}} (\lambda _{1}\lambda _{2})\cdot \gamma _{1} \\ \lambda \cdot (\gamma _{1}+\gamma _{2})&{\mathop {=}\limits ^{\varvec{\delta }}} \lambda \cdot \gamma _{1}+\lambda \cdot \gamma _{2} \\ (\lambda +\lambda ')\cdot \gamma _{1}&{\mathop {=}\limits ^{\varvec{\delta }}} \lambda \cdot \gamma _{1} + \lambda '\cdot \gamma _{1}\\ \varvec{\delta }(\lambda \cdot \gamma ,\lambda \cdot \gamma ')&= \lambda \cdot \varvec{\delta }(\gamma ,\gamma ') \end{aligned}$$

A convex cone in a normed vector space would be a typical example of an $\mathbb {R}_{\ge 0}$-semimodule. An intersection of a convex cone in $\mathbb {R}^{n}$ with the integer lattice is an example of a monoid, that does not admit semimodule structure.

The following proposition asserts that if a metric Abelian monoid is homogeneous, then the pseudo-distance is translation invariant, and, in particular, it satisfies a cancellation property. This result was communicated to us by Tobias Fritz.

Proposition 2.2

Let $(\Gamma ,+,\varvec{\delta })$ be a homogeneous metric Abelian monoid. Then the pseudo-distance function $\varvec{\delta }$ is translation invariant, that is it satisfies for any $\gamma _{1},\gamma _{2},\eta \in \Gamma $

$$\begin{aligned} \varvec{\delta }(\gamma _{1}+\eta ,\gamma _{2}+\eta ) = \varvec{\delta }(\gamma _{1},\gamma _{2}) \end{aligned}$$

In particular, the following cancellation property holds in $\Gamma $

If $\gamma _{1}+\eta {\mathop {=}\limits ^{\varvec{\delta }}} \gamma _{2}+\eta $, then $\gamma _{1}{\mathop {=}\limits ^{\varvec{\delta }}}\gamma _{2}$.

The proof of this proposition is essentially the same as the proof of [9, Proposition 3.7]. Even though the latter proposition is formulated for a specific homogeneous metric Abelian monoid, it does not use any of its specific properties, but only defining properties of a generic homogeneous metric Abelian monoid.

2.3 Asymptotic cones (tropicalization) of monoids

In our construction points of the asymptotic cone of $(\Gamma ,+,\mathbf{d})$ will be sequences of points in $\Gamma $ that grow almost linearly in a certain sense described below.

2.3.1 Admissible functions

Admissible functions will be used to measure the deviation of a sequence from being linear. We call a function $\varphi :\mathbb {R}_{\ge 1}{\mathop {\rightarrow }\limits ^{}}\mathbb {R}_{\ge 0}$admissible if

1.
the function $\varphi $ is non-decreasing;
2.
the function $\varphi (t)/t$ is non-increasing;
3.
there exists a constant $D_{\varphi }\ge 0$ such that $s\cdot \int _{s}^\infty \frac{\varphi (t)}{t^2} {\mathrm{d}}t \le \frac{D_{\varphi }}{8}\cdot \varphi (s)$ for any $s\ge 1$. In particular the function $\varphi $ is summable against ${\mathrm{d}}t/t^{2}$.

For example, the function $\varphi (t)\!\!{:}= t^{\alpha }$ is admissible for any $0\le \alpha <1$. Any admissible function is necessarily sub-linear, that is $\varphi (t)/t{\mathop {\rightarrow }\limits ^{}}0$ as $t{\mathop {\rightarrow }\limits ^{}}\infty $. A linear combination of admissible functions with non-negative coefficients is also admissible.

Lemma 2.3

Let $\varphi $ be a positive admissible function. Then for any $\alpha \ge 0$ and $\lambda \ge 1$ there is $C>1$ such that for any $t\ge 1$

$$\begin{aligned} \varphi (\lambda \cdot t)+\alpha \le C\cdot \varphi (t) \end{aligned}$$

Proof

From positivity and monotonicity of $\varphi $ we have

$$\begin{aligned} \alpha = \frac{\alpha }{\varphi (1)}\cdot \varphi (1) \le \frac{\alpha }{\varphi (1)}\cdot \varphi (t) \end{aligned}$$

On the other hand from monotonicity of the function $\varphi (t)/t$ it follows that for any $\lambda ,t\ge 1$

$$\begin{aligned} \varphi (\lambda \cdot t)\le \lambda \cdot \varphi (t) \end{aligned}$$

Adding the two inequalities above we obtain the conclusion of the lemma. $\square $

2.3.2 Quasi-linear sequences

Let $(\Gamma ,+,\mathbf{d})$ be a metric Abelian monoid and $\varphi $ be an admissible function. A sequence ${\bar{\gamma }}=\left\{ \gamma (i)\right\} \in \Gamma ^{{\mathbb {N}}_{0}}$ will be called quasi-linear with defect bounded by $\varphi $ if for every $m, n \in {\mathbb {N}}$ the following bound is satisfied

$$\begin{aligned} \mathbf{d}\big ( \gamma (m+n), \gamma (m) + \gamma (n) \big ) \le \varphi (m + n) \end{aligned}$$

(2.2)

For technical reasons we also require $\gamma (0)=0$. Sequences that are quasi-linear with defect bounded by $\varphi \equiv 0$ will be called linear sequences.

We will often need the following corollary of quasi-linearity, which follows from applying the bound (2.2) twice and using the monotonicity of $\varphi $: for all $m,n,k\in {\mathbb {N}}$

$$\begin{aligned} \mathbf{d}\big ( \gamma (m+n+k), \gamma (m) + \gamma (n) + \gamma (k) \big ) \le 2\varphi (m + n+k) \end{aligned}$$

(2.3)

For an admissible function $\varphi $ we will write $\textsf {Q}\textsf {L}_\varphi (\Gamma , \mathbf{d})$ for the space of all quasi-linear sequences with defect bounded by $C\cdot \varphi $ for some (depending on the sequence) constant $C\ge 0$. We will also use notation $\textsf {L}(\Gamma , \mathbf{d}){:}{=}\textsf {Q}\textsf {L}_{0}(\Gamma ,\mathbf{d})$ for the space of linear sequences.

2.3.3 Asymptotic distance

Given two quasi-linear sequences $\bar{\gamma }_1\in \textsf {Q}\textsf {L}_{\varphi _1}(\Gamma , \mathbf{d})$ and $\bar{\gamma }_2 \in \textsf {Q}\textsf {L}_{\varphi _2}(\Gamma ,\mathbf{d})$ the sequence of distances $a(n) {:}{=} \mathbf{d}(\gamma _1(n), \gamma _2(n))$ is $\varphi _{3}$-subadditive, where $\varphi _{3}=\varphi _{1}+\varphi _{2}$ is also admissible, i.e.

$$\begin{aligned} a(m+n)\le a(m) + a(n) + \varphi _{3}(m+n) \end{aligned}$$

for any $n,m\in {\mathbb {N}}$. By the generalization of Fekete’s Lemma by De Bruijn and Erdös [4, Theorem 23], it follows that the following limit exists and finite

$$\begin{aligned} {\hat{\mathbf{d}}}(\bar{\gamma _1},\bar{\gamma _2}) := \lim _{n {\mathop {\rightarrow }\limits ^{}}\infty } \frac{1}{n} \mathbf{d}( \gamma _1(n), \gamma _2(n) ) \end{aligned}$$

We call the quantity ${\hat{\mathbf{d}}}({\bar{\gamma }}_{1},{\bar{\gamma }}_2)$ the asymptotic distance between ${\bar{\gamma }}_{1},{\bar{\gamma }}_{2}\in \textsf {Q}\textsf {L}_{\varphi }(\Gamma ,\mathbf{d})$. It is easy to verify that ${\hat{\mathbf{d}}}$ indeed satisfies all axioms of a pseudo-distance. Even if $\mathbf{d}$ was a proper distance function, the corresponding asymptotic distance may vanish on some pairs of non-identical elements. We call two sequences ${\bar{\gamma }}_{1}\in \textsf {Q}\textsf {L}_{\varphi _{1}}(\Gamma ,\mathbf{d})$, ${\bar{\gamma }}_{2}\in \textsf {Q}\textsf {L}_{\varphi _{2}}(\Gamma ,\mathbf{d})$asymptotically equivalent if ${\hat{\mathbf{d}}}({\bar{\gamma }}_{1},{\bar{\gamma }}_{2})=0$ and write

$$\begin{aligned} {\bar{\gamma }}_{1} {\mathop {=}\limits ^{{\hat{\mathbf{d}}}}} {\bar{\gamma }}_2 \end{aligned}$$

2.3.4 Quasi-homogeneity

We will show that quasi-linear sequences are also quasi-homogeneous in the sense of the following lemma.

Lemma 2.4

Let $\bar{\gamma }\in \Gamma ^{{\mathbb {N}}_{0}}$ be a sequence with $\varphi $-bounded defect. Then for any $m, n \in {\mathbb {N}}$

$$\begin{aligned} \mathbf{d}( \gamma (m \cdot n) , m \cdot \gamma (n) ) \le 8 \cdot m \cdot n \cdot \int _{n}^{2m\cdot n}\frac{\varphi (t)}{t^{2}}{\mathrm{d}}t \end{aligned}$$

Proof

Define the function $\psi : \mathbb {R}_{\ge 0} {\mathop {\rightarrow }\limits ^{}}\mathbb {R}$ related to $\varphi $ as follows

$$\begin{aligned} \psi (s){:}{=}\varphi (\mathbf{e}^{s})/\mathbf{e}^{s} \quad \text {or}\quad \varphi (t)=:t\cdot \psi (\ln t) \end{aligned}$$

The conclusion of the lemma in terms of $\psi $ then reads

$$\begin{aligned} \mathbf{d}( \gamma (m \cdot n) , m \cdot \gamma (n) ) \le 8 \cdot m \cdot n \cdot \int _{\ln n}^{\ln (2\cdot m\cdot n)}\psi (s){\mathrm{d}}s \end{aligned}$$

and it is in that form it will be proven below.

Due to monotonicity properties of $\varphi $, the function $\psi $ satisfies, for all $0\le s_{0}\le s$

$$\begin{aligned} \psi (s_{0}) \le \psi (s)\cdot \mathbf{e}^{s-s_{0}} \end{aligned}$$

which integrated over s yields

$$\begin{aligned} \psi (s_{0}) \le \frac{2}{\ln 2}\int _{s_{0}}^{s_{0}+\ln 2}\psi (s){\mathrm{d}}s \le 4 \int _{s_{0}}^{s_{0}+\ln 2}\psi (s){\mathrm{d}}s \end{aligned}$$

(2.4)

We proceed by induction with respect to m, keeping n fixed. The conclusion of the lemma is obvious for $m=1$. For the induction step let $m=2m'+\epsilon \ge 2$, where $m'=\lfloor m/2\rfloor $ and $\epsilon \in \left\{ 0,1\right\} $. We first use the bound in (2.3) to estimate

$$\begin{aligned}&\mathbf{d}\big (\, \gamma (m \cdot n)\,,\, m \cdot \gamma (n) \,\big ) \\&\quad = \mathbf{d}\big (\, \gamma (m'\cdot n + m'\cdot n + \epsilon \cdot n) \,,\, m'\cdot \gamma (n)+m'\cdot \gamma (n)+\epsilon \cdot \gamma (n) \,\big ) \\&\quad \le 2\mathbf{d}\big (\, \gamma (m' \cdot n) \,,\, m' \cdot \gamma (n) \,\big ) + 2\varphi \big (\,m\cdot n\,\big ) \end{aligned}$$

Next, we continue the estimate using bound (2.4)

$$\begin{aligned}&\mathbf{d}\big (\, \gamma (m \cdot n)\,,\, m \cdot \gamma (n) \,\big ) \\&\quad \le 2\mathbf{d}\big (\, \gamma (m' \cdot n) \,,\, m' \cdot \gamma (n) \,\big ) + 2\varphi \big (\,m\cdot n\,\big ) \\&\quad \le 16 m'\cdot n\cdot \int _{\ln n}^{\ln (2m'\cdot n)}\psi (s){\mathrm{d}}s + 2m\cdot n\cdot \psi \big (\ln (m\cdot n)\big ) \\&\quad \le 8m\cdot n \left( \int _{\ln n}^{\ln (2m'\cdot n)}\psi (s){\mathrm{d}}s + \int _{\ln (m\cdot n)}^{\ln (2m\cdot n)}\psi (s){\mathrm{d}}s \right) \\&\quad \le 8 m\cdot n\cdot \!\!\!\int _{\ln n}^{\ln (2m\cdot n)}\psi (s){\mathrm{d}}s \end{aligned}$$

$\square $

Applying bound (3) in the definition of admissible functions, we obtain the following corollary.

Corollary 2.5

Let $\bar{\gamma }$ be a sequence with $\varphi $-bounded defect. Then for any $m, n \in \mathbb {N}$

$$\begin{aligned} \mathbf{d}( \gamma (m \cdot n) , m \cdot \gamma (n) ) \le 8 \cdot m \cdot n \cdot \int _{n}^{\infty }\frac{\varphi (t)}{t^{2}}{\mathrm{d}}t \le D_\varphi \cdot m\cdot \varphi (n) \end{aligned}$$

2.3.5 The semi-module structure

The group operation $+$ on $\Gamma $ induces a ${\hat{\mathbf{d}}}$-continuous (in fact, 1-Lipschitz) group operation on $\textsf {Q}\textsf {L}_{\varphi }(\Gamma ,\mathbf{d})$ by adding sequences element-wise. Thus $(\textsf {Q}\textsf {L}_{\varphi }(\Gamma ,\mathbf{d}),+,{\hat{\mathbf{d}}})$ is also a metric Abelian monoid. In addition, if $\varphi $ is positive it carries the structure of a $\mathbb {R}_{\ge 0}$-semi-module, as explained below.

If $\varphi >0$ is a positive admissible function, the set $\textsf {Q}\textsf {L}_{\varphi }(\Gamma ,\mathbf{d})$ admits an action of the multiplicative semigroup $(\mathbb {R}_{\ge 0},\,\cdot \,)$ defined in the following way. Let $\lambda \in \mathbb {R}_{\ge 0}$ and ${\bar{\gamma }}=\left\{ \gamma (n)\right\} \in \textsf {Q}\textsf {L}_{\varphi }(\Gamma ,\mathbf{d})$. Then define the action of $\lambda $ on ${\bar{\gamma }}$ by

$$\begin{aligned} {\lambda \cdot {\bar{\gamma }}} {:}{=} \left\{ \gamma \big (\lfloor \lambda \cdot n\rfloor \big )\right\} _{n\in {\mathbb {N}}_{0}} \end{aligned}$$

(2.5)

To show that ${\tilde{\gamma }}{:}{=}\lambda \cdot {\bar{\gamma }}$ belongs to $\textsf {Q}\textsf {L}_\varphi (\Gamma , \mathbf{d})$ we bound its defect as follows. Let $m, n \in \mathbb {N}_0$, and define $\epsilon {:}{=} \lfloor \lambda (m + n)\rfloor - \lfloor \lambda \cdot m \rfloor - \lfloor \lambda \cdot n \rfloor \in \left\{ 0,1\right\} $. In the computation below we assume that $\lambda \ge 1$. For $\lambda \in [0,1]$ the computation is similar, but simpler. We estimate

$$\begin{aligned}&\mathbf{d}\big ( {\tilde{\gamma }}(m + n), {\tilde{\gamma }}(m) + {\tilde{\gamma }}(n) \big ) \\&\quad = \mathbf{d}\Big ( \gamma \big (\lfloor \lambda (m + n) \rfloor \big ), \gamma \big (\lfloor \lambda \cdot m \rfloor \big ) + \gamma \big (\lfloor \lambda \cdot n \rfloor \big ) \Big ) \\&\quad = \mathbf{d}\Big ( \gamma \big (\lfloor \lambda \cdot m \rfloor + \lfloor \lambda \cdot n \rfloor + \epsilon \big ), \gamma \big (\lfloor \lambda \cdot m \rfloor \big ) + \gamma \big (\lfloor \lambda \cdot n \rfloor \big ) \Big ) \\&\quad \le \mathbf{d}\Big ( \gamma \big (\lfloor \lambda \cdot m \rfloor \big ) + \gamma \big (\lfloor \lambda \cdot n \rfloor \big ) + \gamma (\epsilon ) , \gamma \big (\lfloor \lambda \cdot m \rfloor \big ) + \gamma \big (\lfloor \lambda \cdot n \rfloor \big ) \Big ) \\&\qquad + 2\varphi \big (\lfloor \lambda \cdot m \rfloor + \lfloor \lambda \cdot n \rfloor + \epsilon \big ) \\&\quad \le \mathbf{d}(\gamma (\epsilon ),0)+ 2\varphi \big (\lfloor \lambda (m+n) \rfloor \big ) \\&\quad \le \mathbf{d}(\gamma (1),0)+ 2\varphi \big (\lambda (m+n)\big ) \\&\quad \le C\cdot \varphi \big (m+n) \end{aligned}$$

The first inequality above is the bound (2.3) and the last inequality is obtained by applying Lemma 2.3.

The action defined above is only an action up to asymptotic equivalence. Similarly, in the constructions that follow we are tacitly assuming they are valid up to asymptotic equivalence.

The action

$$\begin{aligned} \cdot :\mathbb {R}_{\ge 0}\times \textsf {Q}\textsf {L}_{\varphi }(\Gamma ,\mathbf{d}) {\mathop {\rightarrow }\limits ^{}}\textsf {Q}\textsf {L}_{\varphi }(\Gamma ,\mathbf{d}) \end{aligned}$$

is a homothety (dilation)

$$\begin{aligned} {\hat{\mathbf{d}}}(\lambda \cdot {\bar{\gamma }}_{1}, \lambda \cdot {\bar{\gamma }}_{2}) = \lambda \cdot {\hat{\mathbf{d}}}({\bar{\gamma }}_{1},{\bar{\gamma }}_{2}) \end{aligned}$$

and therefore it is continuous with respect to $\mathbf{d}$.

The semigroup structure on $\textsf {Q}\textsf {L}_{\varphi }(\Gamma ,\mathbf{d})$ is distributive with respect to the $\mathbb {R}_{\ge 0}$-action

$$\begin{aligned} \lambda \cdot ({\bar{\gamma }}_{1}+{\bar{\gamma }}_{2})&= \lambda \cdot {\bar{\gamma }}_{1}+ \lambda \cdot {\bar{\gamma }}_{2} \\ (\lambda _{1}+\lambda _{2})\cdot {\bar{\gamma }}&{\mathop {=}\limits ^{{\hat{\mathbf{d}}}}} \lambda _{1}\cdot {\bar{\gamma }}+ \lambda _{2}\cdot {\bar{\gamma }} \end{aligned}$$

In particular, for $n \in \mathbb {N}$ and ${\bar{\gamma }}\in \textsf {Q}\textsf {L}_{\varphi }(\Gamma ,\mathbf{d})$

$$\begin{aligned} \underbrace{\bar{\gamma }+\cdots +\bar{\gamma }}_{n} {\mathop {=}\limits ^{{\hat{\mathbf{d}}}}} n\cdot {\bar{\gamma }} \end{aligned}$$

2.3.6 Completeness

Here, we introduce additional conditions on a metric Abelian monoid $(\Gamma ,+,\mathbf{d})$, that guarantee that $(\textsf {Q}\textsf {L}_{\varphi }(\Gamma ),{\hat{\mathbf{d}}})$ is a complete metric space.

Suppose $\varphi $ is an admissible function and $(\Gamma ,+,\mathbf{d})$ is a metric Abelian monoid satisfying the following additional property: there exists a constant $C>0$, such that for any quasi-linear sequence ${\bar{\gamma }}\in \textsf {Q}\textsf {L}_\varphi (\Gamma ,\mathbf{d})$, there exists an asymptotically equivalent quasi-linear sequence ${\bar{\gamma }}'$ with defect bounded by $C \varphi $. Note that, contrary to the situation in the definition of $\textsf {Q}\textsf {L}_\varphi (\Gamma , \mathbf{d})$, the constant C is now not allowed to depend on the sequence. If this is the case, we say that $\textsf {Q}\textsf {L}_{\varphi }(\Gamma ,\mathbf{d})$ has the (C-)uniformly bounded defect property.

Proposition 2.6

Suppose a metric Abelian monoid $(\Gamma ,+,\varvec{\delta })$ and an admissible function $\varphi >0$ are such that $(\textsf {Q}\textsf {L}_\varphi (\Gamma ,\varvec{\delta }),{\hat{\varvec{\delta }}})$ has the uniformly bounded defect property and the distance function $\varvec{\delta }$ is homogeneous. Then the space $(\textsf {Q}\textsf {L}_\varphi (\Gamma ,\varvec{\delta }),{\hat{\varvec{\delta }}})$ is complete.

Proof

Given a Cauchy sequence $\left\{ {\bar{\gamma }}_{i}\right\} $ of elements in $(\textsf {Q}\textsf {L}_\varphi (\Gamma ,\varvec{\delta }),{\hat{\varvec{\delta }}})$ we need to find a limit element ${\bar{\eta }}\in \textsf {Q}\textsf {L}_\varphi (\Gamma ,\varvec{\delta })$. We will construct ${\bar{\eta }}$ by a diagonal argument. First we replace each element of the sequence $\left\{ {\bar{\gamma }}_{i}\right\} $ by an asymptotically equivalent element with defect bounded by $C \varphi $ according to the assumption of the proposition. We will still call the new sequence $\left\{ {\bar{\gamma }}_{i}\right\} $. In fact, we may without loss of generality assume that $C=1$.

We begin by establishing a bound on the divergence of the tails of sequences ${\bar{\gamma }}_{i}$ and ${\bar{\gamma }}_{j}$. By homogeneity of $\varvec{\delta }$, the triangle inequality and Corollary 2.5, it holds for any $n,k\in {\mathbb {N}}$ that

$$\begin{aligned} k \cdot \varvec{\delta }\big (\gamma _{i}(n),\gamma _{j}(n)\big )&= \varvec{\delta }\big (k\cdot \gamma _{i}(n),k\cdot \gamma _{j}(n)\big ) \\&\le \varvec{\delta }\big (\gamma _{i}(k\cdot n),\gamma _{j}(k\cdot n)\big ) + 2k \cdot D_{\varphi }\cdot \varphi (n) \end{aligned}$$

Dividing by k and passing to the limit $k{\mathop {\rightarrow }\limits ^{}}\infty $, while keeping n fixed, we obtain

$$\begin{aligned} \varvec{\delta }(\gamma _{i}(n),\gamma _{j}(n)) \le n\cdot {\hat{\varvec{\delta }}}({\bar{\gamma }}_{i},{\bar{\gamma }}_{j}) + 2D_{\varphi }\cdot \varphi (n) \end{aligned}$$

Since the sequence $({\bar{\gamma }}_{i})_{i\in {\mathbb {N}}_{0}}$ is Cauchy, it follows that for any $n \in {\mathbb {N}}$ there is a number $\mathbf{i}(n)\in {\mathbb {N}}$ such that for any $i,j\ge \mathbf{i}(n)$ holds

$$\begin{aligned} {\hat{\varvec{\delta }}}({\bar{\gamma }}_{i},{\bar{\gamma }}_{j})\le \frac{1}{n} \end{aligned}$$

Then for any $i,j,n\in {\mathbb {N}}$ with $i,j\ge \mathbf{i}(n)$ we have the following bound

$$\begin{aligned} \varvec{\delta }\big (\gamma _{i}(n),\gamma _{j}(n)\big ) \le 2D_{\varphi } \cdot \varphi (n) + 1 \end{aligned}$$

(2.6)

Now we are ready to define the limiting sequence ${\bar{\eta }}$ by setting

$$\begin{aligned} \eta (n){:}{=}\gamma _{\mathbf{i}(n)}(n) \end{aligned}$$

First we verify that ${\bar{\eta }}$ is quasi-linear. For $m, n \in {\mathbb {N}}$, we have

$$\begin{aligned} \begin{aligned} \varvec{\delta }\big ( \eta (n+m), \eta (n)+\eta (m) \big )&= \varvec{\delta }\big (\, \gamma _{\mathbf {i}(n+m)}(n+m),\, \gamma _{\mathbf {i}(n)}(n)+\gamma _{\mathbf {i}(m)}(m) \,\big ) \\&\le \varvec{\delta }\big (\, \gamma _{\mathbf {i}(n+m)}(n+m),\, \gamma _{\mathbf {i}(n+m)}(n) + \gamma _{\mathbf {i}(n+m)}(m)\, \big ) \\&\quad + \varvec{\delta }\big (\, \gamma _{\mathbf {i}(n+m)}(n)+\gamma _{\mathbf {i}(n+m)}(m),\, \gamma _{\mathbf {i}(n)}(n)+\gamma _{\mathbf {i}(m)}(m)\, \big ) \\&\le \varphi (n+m) + 2D_{\varphi }\cdot \varphi (n) + 1 + 2D_{\varphi }\cdot \varphi (m) + 1 \\&\le (4D_{\varphi }+1)\varphi (n+m) + 2 \le C' \cdot \varphi (n + m) \end{aligned} \end{aligned}$$

for some constant $C'> 0$.

The convergence of ${\bar{\gamma }}_{i}$ to ${\bar{\eta }}$ is shown as follows. For $n,k\in {\mathbb {N}}$ let $q_n,r_n\in {\mathbb {N}}_{0}$ be the quotient and the remainder of the division of n by k, that is $n=q_n\cdot k+r_n$ and $0\le r_n < k$. Fix $k\in {\mathbb {N}}$ and let $i\ge \mathbf{i}(k)$, then

$$\begin{aligned} {\hat{\varvec{\delta }}}({\bar{\gamma }}_{i},{\bar{\eta }})&= \lim _{n{\mathop {\rightarrow }\limits ^{}}\infty }\frac{1}{n} \varvec{\delta }\big (\gamma _{i}(n),\eta (n)\big ) \\&= \lim _{n{\mathop {\rightarrow }\limits ^{}}\infty } \frac{1}{n} \varvec{\delta }\big (\, \gamma _{i}(q_n\cdot k+r_n), \gamma _{\mathbf{i}(n)}(q_n\cdot k+r_n)\, \big ) \\&\le \limsup _{n{\mathop {\rightarrow }\limits ^{}}\infty } \frac{1}{n} \Big (q_n\cdot \varvec{\delta }\big (\gamma _{i}(k),\gamma _{\mathbf{i}(n)}(k)\big )+ \varvec{\delta }\big (\gamma _{i}(r_n),\gamma _{\mathbf{i}(n)}(r_n)\big )\\&\quad \;+ 4q_n \cdot D_{\varphi } \cdot \varphi (k)+2 \varphi (n) \Big ) \\&\le \limsup _{n{\mathop {\rightarrow }\limits ^{}}\infty } \frac{1}{n} \Big (q_n \cdot (2D_\varphi \cdot \varphi (k) + 1) + (2D_\varphi \cdot \varphi (r_n) + 1) \;+ \\&\quad + 4q_n \cdot D_\varphi \cdot \varphi (k)+2 \varphi (n) \Big ) \\&=C'' \cdot \varphi (k)/k \end{aligned}$$

Since $k\in {\mathbb {N}}$ is arbitrary and $\varphi $ is sub-linear we have

$$\begin{aligned} \lim _{i{\mathop {\rightarrow }\limits ^{}}\infty }{\hat{\varvec{\delta }}}({\bar{\gamma }}_{i},{\bar{\eta }})=0 \end{aligned}$$

$\square $

2.3.7 On the density of linear sequences

For a metric Abelian monoid $(\Gamma , +, \mathbf{d})$ together with an admissible function $\varphi $ we say that $\textsf {Q}\textsf {L}_{\varphi }(\Gamma ,\mathbf{d})$ has the vanishing defect property if for every $\epsilon > 0$ and for every ${\bar{\gamma }} \in \textsf {Q}\textsf {L}_\varphi (\Gamma , \mathbf{d})$ there exists an asymptotically equivalent quasi-linear sequence ${\bar{\gamma }}'$ with defect bounded by another admissible function $\psi $ such that $\int _{1}^{\infty }\frac{\psi (t)}{t^{2}}{\mathrm{d}}t<\epsilon $.

The proposition below gives a sufficient condition under which the linear sequences are dense in the space of quasi-linear sequences.

Proposition 2.7

Suppose $\textsf {Q}\textsf {L}_{\varphi }(\Gamma , + , \mathbf{d})$ has the vanishing defect property. Then $\textsf {L}(\Gamma ,\mathbf{d})$ is dense in $\textsf {Q}\textsf {L}_\varphi (\Gamma ,\mathbf{d})$.

Proof

Let ${\bar{\gamma }}=\left\{ \gamma (n)\right\} $ be a quasi-linear sequence. For any $i\in {\mathbb {N}}$ select a sequence ${\bar{\gamma }}_{i}$ asymptotically equivalent to ${\bar{\gamma }}$ with defect bounded by an admissible function $\varphi _{i}$ such that $\int _{1}^{\infty }\frac{\varphi _{i}(t)}{t^{2}}{\mathrm{d}}t<1/i$ according to the “vanishing defect” assumption of the lemma.

Define $\bar{\eta }_i$ by

$$\begin{aligned} \eta _i(n) {:}{=} n \cdot \gamma _i(1) \end{aligned}$$

Then

$$\begin{aligned} {\hat{\mathbf{d}}}({\bar{\gamma }},{\bar{\eta }}_{i})&= {\hat{\mathbf{d}}}({\bar{\gamma }}_{i},{\bar{\eta }}_{i}) = \lim _{n{\mathop {\rightarrow }\limits ^{}}\infty } \frac{1}{n} \mathbf{d}(\gamma _{i}(n),\eta _{i}(n)) = \lim _{n{\mathop {\rightarrow }\limits ^{}}\infty } \frac{1}{n} \mathbf{d}\big (\gamma _{i}(n),n\cdot \gamma _{i}(1)\big ) \\&\le 8\int _{1}^{\infty }\frac{\varphi _{i}(t)}{t^{2}}{\mathrm{d}}t \le \frac{8}{i} \end{aligned}$$

where we used Lemma 2.4 in the first inequality. Thus, any quasi-linear sequence can be approximated by linear sequences. $\square $

2.3.8 Asymptotic distance on original monoid

Starting with an element $\gamma \in \Gamma $ one can construct a linear sequence $\mathbf {\gamma }=\left\{ i\cdot \gamma \right\} _{i\in {\mathbb {N}}_{0}}$. In view of Proposition 2.1, the map

$$\begin{aligned} {\mathbf {\cdot }}:\big (\Gamma , \mathbf{d}\big ) {\mathop {\rightarrow }\limits ^{}}\big (\textsf {L}(\Gamma ,\mathbf{d}), {\hat{\mathbf{d}}}\big ) \end{aligned}$$

(2.7)

is a contraction.

The inclusion in (2.7) induces a metric $\varvec{\delta }$ on $\Gamma $, satisfying for any $\gamma _{1},\gamma _{2}\in \Gamma $

$$\begin{aligned} \varvec{\delta }(\gamma _{1},\gamma _{2})\le \mathbf{d}(\gamma _{1},\gamma _{2}) \end{aligned}$$

(2.8)

and the following homogeneity condition

$$\begin{aligned} \varvec{\delta }(n\cdot \gamma _{1},n\cdot \gamma _{2}) = n\cdot \varvec{\delta }(\gamma _{1},\gamma _{2}) \end{aligned}$$

(2.9)

for all $n \in {\mathbb {N}}_0$.

Note that if $\mathbf{d}$ was homogeneous to begin with, then $\varvec{\delta }$ coincides with $\mathbf{d}$ on $\Gamma $.

By virtue of the bound $\varvec{\delta }\le \mathbf{d}$, sequences that are quasi-linear with respect to $\mathbf{d}$ are also quasi-linear with respect to $\varvec{\delta }$. Since $\varvec{\delta }$ is scale-invariant, the associated asymptotic distance ${\hat{\varvec{\delta }}}$ coincides with $\varvec{\delta }$ on $\Gamma $. We will show (in Lemma 2.8 below) that ${\hat{\varvec{\delta }}}$ also coincides with ${\hat{\mathbf{d}}}$ on $\mathbf{d}$-quasi-linear sequences.

Let $\varphi $ be an admissible function. In order to organize all these statements, and to be more precise, let us include the spaces in the following commutative diagram.

(2.10)

The maps $f, f'$ and are isometries. The maps and are isometric embeddings. The next lemmas show that is also an isometric embedding, and it has dense image.

Lemma 2.8

Let $\varphi $ be a positive, admissible function. Then, the natural inclusion

is an isometric embedding with the dense image.

Proof

First we show that the map is an isometric embedding. Let ${\bar{\gamma }}_{1},{\bar{\gamma }}_{2}\in \textsf {Q}\textsf {L}_{\varphi }(\Gamma ,\mathbf{d})$ be two $\varphi $-quasi-linear sequences with respect to the distance function $\mathbf{d}$. We have to show that the two numbers

$$\begin{aligned} {\hat{\mathbf{d}}}({\bar{\gamma }}_{1},{\bar{\gamma }}_{2}) = \lim _{n{\mathop {\rightarrow }\limits ^{}}\infty }\frac{1}{n} \mathbf{d}\big (\gamma _{1}(n),\gamma _{2}(n)\big ) \end{aligned}$$

and

$$\begin{aligned} {\hat{\varvec{\delta }}}({\bar{\gamma }}_{1},{\bar{\gamma }}_{2}) = \lim _{n{\mathop {\rightarrow }\limits ^{}}\infty }\frac{1}{n} \varvec{\delta }\big (\gamma _{1}(n),\gamma _{2}(n)\big ) \end{aligned}$$

are equal. Since shifts are non-expanding maps, we have $\varvec{\delta }\le \mathbf{d}$ and it follows immediately that

$$\begin{aligned} {\hat{\varvec{\delta }}}({\bar{\gamma }}_{1},{\bar{\gamma }}_{2}) \le {\hat{\mathbf{d}}}({\bar{\gamma }}_{1},{\bar{\gamma }}_{2}) \end{aligned}$$

and we are left to show the opposite inequality. We will do it as follows. Fix $n>0$, then

$$\begin{aligned} {\hat{\mathbf{d}}}({\bar{\gamma }}_{1},{\bar{\gamma }}_{2})&= \lim _{k{\mathop {\rightarrow }\limits ^{}}\infty } \frac{1}{k\cdot n} \mathbf{d}\big (\gamma _1(k\cdot n),\gamma _{2}(k\cdot n)\big ) \\&\le \lim _{k{\mathop {\rightarrow }\limits ^{}}\infty }\frac{1}{k\cdot n} \bigg ( \mathbf{d}\big (k\cdot \gamma _1(n),k\cdot \gamma _{2}(n)\big ) + 2 k\cdot D_\varphi \cdot \varphi (n) \bigg ) \\&\le \frac{1}{n}\varvec{\delta }\big (\gamma _{1}(n),\gamma _{2}(n)\big ) + 2D_{\varphi }\frac{\varphi (n)}{n} \end{aligned}$$

Passing to the limit with respect to n gives the required inequality

$$\begin{aligned} {\hat{\mathbf{d}}}({\bar{\gamma }}_{1},{\bar{\gamma }}_{2}) \le {\hat{\varvec{\delta }}}({\bar{\gamma }}_{1},{\bar{\gamma }}_{2}) \end{aligned}$$

Now we will show that the image of is dense. Given an element ${\bar{\gamma }}=\left\{ \gamma (n)\right\} $ in $\textsf {Q}\textsf {L}_{\varphi }(\Gamma ,{\hat{\mathbf{d}}})$ we have to find a ${\hat{\varvec{\delta }}}$-approximating sequence ${\bar{\gamma }}_{i}=\left\{ \gamma _{i}(n)\right\} $ in $\textsf {Q}\textsf {L}_{\varphi }(\Gamma ,\mathbf{d})$. Define

$$\begin{aligned} \gamma _{i}(n) {:}{=} \left\lfloor \frac{n}{i}\right\rfloor \cdot \gamma (i) \end{aligned}$$

We have to show that each ${\bar{\gamma }}_{i}$ is $\mathbf{d}$-quasi-linear and that ${\hat{\varvec{\delta }}}({\bar{\gamma }}_{i},{\bar{\gamma }}){\mathop {\longrightarrow }\limits ^{i{\mathop {\rightarrow }\limits ^{}}\infty }}0$. These statements follow from

$$\begin{aligned} \mathbf{d}\big (\gamma _i(m+n), \gamma _i(m) + \gamma _i(n) \big )&= \mathbf{d}\left( \left\lfloor \frac{m+n}{i}\right\rfloor \cdot \gamma (i), \left\lfloor \frac{m}{i}\right\rfloor \cdot \gamma (i) + \left\lfloor \frac{n}{i} \right\rfloor \cdot \gamma (i) \right) \\&\le \mathbf{d}\left( \gamma (i), {\mathbf {0}} \right) \\&\le C_i \cdot \varphi (m+n) \end{aligned}$$

for some $C_i > 0$. It is worth noting that the defect of ${\bar{\gamma }}_{i}$ may not be bounded uniformly with respect to i. Finally, it holds that

$$\begin{aligned} {\hat{\varvec{\delta }}}({\bar{\gamma }}_i,{\bar{\gamma }})&= \lim _{n {\mathop {\rightarrow }\limits ^{}}\infty } \frac{1}{n}\varvec{\delta }\left( \gamma _i(n), \gamma (n) \right) = \lim _{n {\mathop {\rightarrow }\limits ^{}}\infty } \frac{1}{n} \varvec{\delta }\left( \left\lfloor \frac{n}{i}\right\rfloor \cdot \gamma (i), \gamma (n) \right) \\&\le \lim _{n {\mathop {\rightarrow }\limits ^{}}\infty } \left[ \frac{1}{n} \varvec{\delta }\left( \gamma \left( i\lfloor \tfrac{n}{i}\rfloor \right) , \gamma (n) \right) + \frac{1}{n}\left\lfloor \frac{n}{i} \right\rfloor \cdot D_\varphi \cdot \varphi (i) \right] \\&\le \lim _{n {\mathop {\rightarrow }\limits ^{}}\infty } \left[ \frac{1}{n} \max _{k=0,\ldots ,i-1} \varvec{\delta }\left( \gamma (k), {\varvec{0}} \right) + \frac{1}{n}\varphi (n) \right] + D_{\varphi }\frac{\varphi (i)}{i} = D_{\varphi }\frac{\varphi (i)}{i}{\mathop {{\mathop {\longrightarrow }\limits ^{}}}\limits ^{i{\mathop {\rightarrow }\limits ^{}}\infty }}0 \end{aligned}$$

$\square $

The difference between the distance functions ${\hat{\mathbf{d}}}$, $\varvec{\delta }$ and ${\hat{\varvec{\delta }}}$ is very small: ${\hat{\mathbf{d}}}$ and $\varvec{\delta }$ are defined on the dense subset of the domain of definition of ${\hat{\varvec{\delta }}}$ and they coincide whenever are both defined. From now on we will write $\mathbf{d}$ for the original distance function and $\varvec{\delta }$ for the asymptotic metric on both the monoid and its tropicalization.

3 Grothendieck construction

Given an Abelian monoid with the cancellation property, there is a minimal Abelian group (called the Grothendieck Group of the monoid), into which it isomorphically embeds. Similarly, an $\mathbb {R}_{\ge 0}$-semi-module naturally embeds into a normed vector space. A nice example of this construction applied to the semi-module of convex sets in $\mathbb {R}^n$ (with the Minkowski sum and the Hausdorff distance) can be found in [12].

Proposition 3.1

Let $(\Gamma ,+,\cdot ,\varvec{\delta })$ be a complete metric Abelian monoid with $\mathbb {R}_{\ge 0}$ action (an $\mathbb {R}_{\ge 0}$-semi-module) with homogeneous pseudo-metric $\varvec{\delta }$. Then there exists a Banach space $(\mathbf{B},||\,\cdot \,||)$ and a distance-preserving homomorphism of monoids

$$\begin{aligned} f: \Gamma {\mathop {\rightarrow }\limits ^{}}\mathbf{B}\end{aligned}$$

such that the image of f is a closed convex cone.

If $\varvec{\delta }$ is a proper pseudo-metric (not a metric), then the map f is not injective.

Proof

By Lemma 2.2 the pseudo-metric $\varvec{\delta }$ is translation invariant. We can therefore apply the Grothendieck construction to define a normed vector space $\mathbf{B}_0$: Define

$$\begin{aligned} \mathbf{B}_0 {:}{=} \left\{ (x,y)\;\mathbf{: }\;x,y\in \Gamma \right\} /\sim \end{aligned}$$

where $(x,y)\sim (x',y')$ if there are $z,z'\in \Gamma $, such that $x+z{\mathop {=}\limits ^{\varvec{\delta }}}x'+z'$ and $y+z{\mathop {=}\limits ^{\varvec{\delta }}}y'+z'$.

Define also addition, multiplication by a scalar and a norm on $\mathbf{B}_0$ by setting for all $x,y,x',y'\in \Gamma $ and $\lambda \in \mathbb {R}$

$$\begin{aligned} (x,y)+(x',y')&:=\, (x+x',y+y') \\ (-1)\cdot (x,y)&:=\, (y,x) \\ \lambda \cdot (x,y)&:=\, \mathrm {sign}(\lambda )\cdot (|\lambda |\cdot x,|\lambda |\cdot y) \\ ||(x,y)||&:=\, \varvec{\delta }(x,y) \end{aligned}$$

These operations respect the equivalence relation and turn $(\mathbf{B}_0,+,\cdot ,||\,\cdot \,||)$ into a normed vector-space. The map f defined by

$$\begin{aligned} f:\Gamma {\mathop {\rightarrow }\limits ^{}}\mathbf{B}_0, \quad x\mapsto (x,{\mathbf {0}}) \end{aligned}$$

is a well-defined distance-preserving homomorphism.

That $f(\Gamma )$ is closed immediately follows as $\Gamma $ is complete and f is distance-preserving.

In general, the space $\mathbf{B}_0$ is not complete. We define the Banach space $\mathbf{B}$ as the completion of the normed vector space $\mathbf{B}_0$. $\square $

4 Tropical probability spaces and their diagrams

4.1 Diagrams of probability spaces

We will now briefly describe the construction of diagrams of probability spaces, see [9] for a more detailed discussion. By a finite probability space we will mean a set (not necessarily finite) with a probability measure, such that the support of the measure is finite. For such probability space X we denote by |X| the cardinality of the support of probability measure and the expression $x\in X$ will mean, that x is an atom in X, which is a point of positive weight in the underlying set.

We will consider commutative diagrams of finite probability spaces, where arrows are equivalence classes of measure-preserving maps. Two maps are considered equivalent if they coincide on a set of full measure and such equivalence classes will be called reductions.

Three examples of diagrams of probability spaces are pictured in (1.1). The combinatorial structure of such a commutative diagram can be recorded by an object $\mathbf{G}$, which could be equivalently considered as a special type of category, a finite poset, or a directed acyclic graph (DAG) with additional properties. We will call such objects simply indexing categories. Below we briefly recall the definition.

An indexing category is a finite category such that for any pair of objects there exists at most one morphism between them in either direction, and such that it satisfies the following property. For any pair of objects i, j in an indexing category $\mathbf{G}$ there exists a least common ancestor, i.e. an object k such that there are morphisms $k{\mathop {\rightarrow }\limits ^{}}i$ and $k{\mathop {\rightarrow }\limits ^{}}j$ in $\mathbf{G}$ and such that for any other object l admitting morphisms $l{\mathop {\rightarrow }\limits ^{}}i$ and $l{\mathop {\rightarrow }\limits ^{}}j$, there is also a morphism $l{\mathop {\rightarrow }\limits ^{}}k$.

By $\llbracket \mathbf{G} \rrbracket $ we denote the number of objects in the indexing category, or equivalently the number of vertices in the DAG or the number of points in the poset $\mathbf{G}$. An important class of examples of indexing categories is formed by so-called full categories$\varvec{\Lambda }_{n}$, that correspond to the poset of non-empty subsets of a set $\left\{ 1,\ldots ,n\right\} $ ordered by coinclusion. If $n=2$, we call the category

$$\begin{aligned} \varvec{\Lambda }_{2}=(O_{1}{\mathop {\leftarrow }\limits ^{}}O_{\left\{ 1,2\right\} }{\mathop {\rightarrow }\limits ^{}}O_{2}) \end{aligned}$$

a fan. We refer to the objects $O_{1}$ and $O_{2}$ as the feet of the fan and to $O_{12}$ as the initial object. We use the same terminology for the spaces in a diagram indexed by $\varvec{\Lambda }_{2}$.

The space of all commutative diagrams of a fixed combinatorial type will be denoted $\mathbf {Prob}\langle \mathbf{G}\rangle $. A morphism between two diagrams $\mathcal {X},\mathcal {Y}\in \mathbf {Prob}\langle \mathbf{G}\rangle $ is, by definition, a natural transformation between functors $\mathcal {X}$ and $\mathcal {Y}$. Essentially, it is a collection of morphisms between corresponding individual spaces in $\mathcal {X}$ and $\mathcal {Y}$, that commute with morphisms within the diagrams $\mathcal {X}$ and $\mathcal {Y}$. We call such morphisms reductions of diagrams.

The construction of forming commutative diagrams could be iterated, producing diagrams of diagrams. Especially important will be two-fans of $\mathbf{G}$-diagrams, the space of which will be denoted $\mathbf {Prob}\left\langle \mathbf{G}\right\rangle \left\langle \varvec{\Lambda }_{2}\right\rangle $.

A two-fan $\mathcal {X}$ will be called minimal, if for any morphism of $\mathcal {X}$ to another two-fan $\mathcal {Y}$, the following holds: if the induced morphisms on the feet are isomorphisms, then the top morphism is also an isomorphism. Any $\mathbf{G}$-diagram will be called minimal if for any sub-diagram, which is a two-fan, it contains a minimal two-fan with the same feet.

Given an n-tuple $(\textsf {X}_{1},\ldots ,\textsf {X}_{n})$ of finite-valued random variables, one can construct a minimal $\varvec{\Lambda }_{n}$-diagram $\mathcal {X}=\left\{ X_{I};\chi _{IJ}\right\} $ by setting for any $\emptyset \ne I\subset \left\{ 1,\ldots ,n\right\} $

$$\begin{aligned} X_{I}=\prod _{i\in I} X_{i} \end{aligned}$$

where $X_{i}$ is the target space of random variable $\textsf {X}_{i}$, and the probabilities are the induced distributions. For the diagram constructed in such a way we will write $\mathcal {X}=\left\langle \textsf {X}_{1},\ldots ,\textsf {X}_{n}\right\rangle $. On the other hand, any $\varvec{\Lambda }_{n}$-diagram gives rise to the n-tuple of random variables with the domain of definition being the initial space and the targets being the spaces indexed by one-point sets.

The constant diagram$X^{\mathbf{G}}$ is $\mathbf{G}$-diagram in which all the spaces are isomorphic to a single probability space X and all the morphisms are identity maps. In particular, we denote by $\left\{ \bullet \right\} ^{\mathbf{G}}$ the $\mathbf{G}$-diagram consisting entirely of one-point spaces.

The tensor product $\mathcal {X}\otimes \mathcal {Y}$ of two $\mathbf{G}$-diagrams is defined by taking the tensor product of corresponding probability spaces and the Cartesian product of maps. The diagram $\left\{ \bullet \right\} ^{\mathbf{G}}$ is a unit with respect to the tensor product. Certain care should be exercised here, since the assocaitivity, commutativity and unity of $\left\{ \bullet \right\} ^{\mathbf{G}}$ for the tensor product only hold up to isomorphism.

For a diagram $\mathcal {X}\in \mathbf {Prob}\langle \mathbf{G}\rangle $ one can evaluate entropies of the individual spaces. The corresponding map will be denoted

$$\begin{aligned} \textsf {Ent} _{*}:\mathbf {Prob}\langle \mathbf{G}\rangle {\mathop {\rightarrow }\limits ^{}}\mathbb {R}^{\llbracket \mathbf{G} \rrbracket }\end{aligned}$$

where the target space is the space of $\mathbb {R}$-valued functions on objects in $\mathbf{G}$ and it is equipped with the $\ell ^{1}$-norm.

For a two-fan $\mathcal {F}=(\mathcal {X}{\mathop {\leftarrow }\limits ^{}}\mathcal {Z}{\mathop {\rightarrow }\limits ^{}}\mathcal {Y})$ of $\mathbf{G}$-diagrams define the entropy distance

$$\begin{aligned} \mathrm {kd}(\mathcal {F}) {:}{=} \Vert \textsf {Ent} _{*}\mathcal {Z}-\textsf {Ent} _{*}\mathcal {X}\Vert _{1} + \Vert \textsf {Ent} _{*}\mathcal {Z}-\textsf {Ent} _{*}\mathcal {Y}\Vert _{1} \end{aligned}$$

We interpret $\mathrm {kd}(\mathcal {F})$ as a measure of deviation of $\mathcal {F}$ from being an isomorphism between the diagrams $\mathcal {X}$ and $\mathcal {Y}$. Indeed, $\mathrm {kd}(\mathcal {F})=0$ if and only if the two morphisms in $\mathcal {F}$ are isomorphisms, see [9].

We define the intrinsic entropy distance$\mathbf {k}$ on the space $\mathbf {Prob}\langle \mathbf{G}\rangle $ by

$$\begin{aligned} \mathbf {k}(\mathcal {X},\mathcal {Y}) {:}{=} \inf \left\{ \mathrm {kd}(\mathcal {F})\;\mathbf{: }\;\mathcal {F}= (\mathcal {X}{\mathop {\leftarrow }\limits ^{}}\mathcal {Z}{\mathop {\rightarrow }\limits ^{}}\mathcal {Y})\in \mathbf {Prob}\langle \mathbf{G}\rangle \langle \varvec{\Lambda }_{2} \rangle \right\} \end{aligned}$$

Note that according to the definitions used in this article any indexing category must have an initial object. In [9] such indexing categories and diagrams indexed by such categories were called complete. Therefore purely by a change of names, results that in [9] were said to hold for complete indexing categories, hold for the indexing categories of this article.

The tensor product is 1-Lipschitz with respect to $\mathbf {k}$, thus $(\mathbf {Prob}\langle \mathbf{G}\rangle ,\otimes ,\mathbf {k})$ is a metric Abelian monoid and $\textsf {Ent} _{*}:(\mathbf {Prob}\langle \mathbf{G}\rangle ,\otimes ,\mathbf {k}){\mathop {\rightarrow }\limits ^{}}(\mathbb {R}^{\llbracket \mathbf{G} \rrbracket },\Vert \cdot \Vert _{1})$ is a 1-Lipschitz homomorphism. For proofs and more detailed discussion the reader is referred to [9].

4.2 Tropical diagrams

In this section we apply the general construction in Sects. 2 and 3 to the metric Abelian monoid $(\mathbf {Prob}\left\langle \mathbf{G}\right\rangle ,\otimes ,\,\cdot \,,\varvec{\kappa })$.

We define the asymptotic distance on $\mathbf {Prob}\left\langle \mathbf{G}\right\rangle $ by

$$\begin{aligned} \varvec{\kappa }(\mathcal {X},\mathcal {Y}){:}{=}\lim _{n{\mathop {\rightarrow }\limits ^{}}\infty }\frac{1}{n}\mathbf {k}(\mathcal {X}^{n},\mathcal {Y}^{n}) \end{aligned}$$

One of the main tools for the estimation of the (asymptotic) distance is the so-called Slicing Lemma, [9, Proposition 3.9]. We will only need its corollary that we formulate below in Proposition 4.1. For a diagram $\mathcal {X}$, a space U in it and an atom $u\in U$, we may form a conditioned diagram $\mathcal {X}|u$ by conditioning all the spaces in $\mathcal {X}$ on u.

Proposition 4.1

Let $\mathbf{G}$ be an indexing category, $\mathcal {X},\mathcal {Y}\in \mathbf {Prob}\left\langle \mathbf{G}\right\rangle $ and $U^{\mathbf{G}}\in \mathbf {Prob}\left\langle \mathbf{G}\right\rangle $.

(1)
Let $\mathcal {X}{\mathop {\rightarrow }\limits ^{}}U^{\mathbf{G}}$ be a reduction, then
$$\begin{aligned} \mathbf {k}(\mathcal {X},\mathcal {Y})&\le \int _{U}\mathbf {k}(\mathcal {X}|u,\mathcal {Y}){\mathrm{d}}p_{U}(u)+ \llbracket \mathbf{G} \rrbracket \cdot \textsf {Ent} (U) \end{aligned}$$
(2)
For a “co-fan” $\mathcal {X}{\mathop {\rightarrow }\limits ^{}}U^\mathbf{G}{\mathop {\leftarrow }\limits ^{}}\mathcal {Y}$ holds
$$\begin{aligned} \mathbf {k}(\mathcal {X},\mathcal {Y}) \le \int _{U}\mathbf {k}(\mathcal {X}|u,\mathcal {Y}|u){\mathrm{d}}p_{U}(u) \end{aligned}$$

The statements and the proofs of the Slicing Lemma and its consequences can be found in [9].

We will show below that $(\mathbf {Prob}\langle \mathbf{G}\rangle ,\otimes ,\varvec{\kappa })$ has the uniformly bounded and vanishing defect properties. For this purpose we need to develop some technical tools.

4.3 Mixtures

The input data for the mixture operation is a family of $\mathbf{G}$-diagrams, parameterized by a probability space. As a result one obtains another $\mathbf{G}$-diagram with pre-specified conditionals. One particular instance of a mixture is when one mixes two diagrams $\mathcal {X}$ and $\left\{ \bullet \right\} ^{\mathbf{G}}$, the latter being a constant $\mathbf{G}$-diagram of one-point probability spaces. This operation will be used as a substitute for taking radicals “$\mathcal {X}^{\frac{1}{n}}$” below.

4.3.1 Definition of mixtures

Let $\mathbf{G}$ be an indexing category and $\Theta $ be a probability space. By $\Theta ^{\mathbf{G}}$ we denote the constant$\mathbf{G}$-diagram—the diagram such that all spaces in it are $\Theta $ and all morphisms are identity morphisms. Let be a family of $\mathbf{G}$-diagrams parameterized by $\Theta $. The mixture of the family $\left\{ \mathcal {X}_{\theta }\right\} $ is the reduction

such that

$$\begin{aligned} \mathcal {Y}|\theta \cong \mathcal {X}_{\theta } \quad \hbox { for any}\ \theta \in \Theta \end{aligned}$$

(4.1)

The mixture exists and is uniquely defined by property (4.1) up to an isomorphism which is identity on $\Theta ^{\mathbf{G}}$.

We denote the top diagram $\mathcal {Y}$ of the mixture by

$$\begin{aligned} \mathcal {Y}=:\bigoplus _{\theta \in \Theta }\mathcal {X}_{\theta } \end{aligned}$$

and also call it the mixture of the family $\left\{ \mathcal {X}_{\theta }\right\} $.

When

$$\begin{aligned} \Theta =\mathbb {B}_{\alpha } {:=}\, \big (\left\{ \square ,\blacksquare \right\} ; p(\blacksquare )=\alpha \big ) \end{aligned}$$

is a binary space we write simply

$$\begin{aligned} \mathcal {X}_{\blacksquare }\oplus _{\mathbb {B}_{\alpha }}\mathcal {X}_{\square } \end{aligned}$$

for the mixture. The diagram subindexed by the $\blacksquare $ will always be the first summand.

The entropy of the mixture can be evaluated by the following formula

$$\begin{aligned} \textsf {Ent} _{*}\left( \bigoplus _{\theta \in \Theta }\mathcal {X}_{\theta }\right) = \int _{\Theta }\textsf {Ent} _{*}(\mathcal {X}_{\theta }){\mathrm{d}}p(\theta ) + \textsf {Ent} _{*}(\Theta ^{\mathbf{G}}) \end{aligned}$$

Mixtures satisfy the distributive law with respect to the tensor product

4.3.2 The distance estimates for the mixtures

The mixture of a $\mathbf{G}$-diagram with the constant diagram of one-point spaces $\left\{ \bullet \right\} ^{\mathbf{G}}$ may serve as an substitute of taking radicals of the diagram. The following lemma provides a justification of this by some distance estimates related to mixtures and will be used below.

Lemma 4.2

Let $\mathbf{G}$ be a complete indexing category and $\mathcal {X},\mathcal {Y}\in \mathbf {Prob}\left\langle \mathbf{G}\right\rangle $. Then

1.
$\displaystyle \varvec{\kappa }(\mathcal {X},\mathcal {X}^{n}\oplus _{\mathbb {B}_{1/n}}\left\{ \bullet \right\} ) \le \textsf {Ent} (\mathbb {B}_{1/n}) $
2.
$\displaystyle \varvec{\kappa }\big (\mathcal {X},(\mathcal {X}\oplus _{\mathbb {B}_{1/n}}\left\{ \bullet \right\} )^{n}\big ) \le n\cdot \textsf {Ent} (\mathbb {B}_{1/n}) $
3.
$\displaystyle \varvec{\kappa }\big ( (\mathcal {X}\otimes \mathcal {Y})\oplus _{\mathbb {B}_{1/n}}\left\{ \bullet \right\} , (\mathcal {X}\oplus _{\mathbb {B}_{1/n}}\left\{ \bullet \right\} ) \otimes (\mathcal {Y}\oplus _{\mathbb {B}_{1/n}}\left\{ \bullet \right\} ) \big ) \le 3\textsf {Ent} (\mathbb {B}_{1/n}) $
4.
$\displaystyle \varvec{\kappa }\big ((\mathcal {X}\oplus _{\mathbb {B}_{1/n}}\left\{ \bullet \right\} ), (\mathcal {Y}\oplus _{\mathbb {B}_{1/n}}\left\{ \bullet \right\} )\big ) \le \frac{1}{n}\varvec{\kappa }(\mathcal {X},\mathcal {Y}) $

Note that the distance estimates in the lemma above are with respect to the asymptotic distance. This is essential, since from the perspective of the intrinsic distance mixtures are very badly behaved.

Proof

For $\lambda \in \mathbb {B}_{1/n}^N$, define $\mathbf{q}(\lambda )$ to be the number of black squares in the sequence $\lambda $. It is a binomially distributed random variable with mean N/n and variance $\frac{N}{n}(1-\frac{1}{n})$.

The first claim is then proven by the following calculation

$$\begin{aligned}&\varvec{\kappa }(\mathcal {X},\mathcal {X}^{n}\oplus _{\mathbb {B}_{1/n}}\left\{ \bullet \right\} ) \\&\quad = \lim _{N{\mathop {\rightarrow }\limits ^{}}\infty } \frac{1}{N} \mathbf {k}\left( \mathcal {X}^{N}, (\mathcal {X}^{n}\oplus _{\mathbb {B}_{1/n}}\left\{ \bullet \right\} )^{N} \right) \\&\quad = \lim _{N{\mathop {\rightarrow }\limits ^{}}\infty } \frac{1}{N} \mathbf {k}\left( \mathcal {X}^{N}, \bigoplus _{\lambda \in \mathbb {B}_{1/n}^{N}} \mathcal {X}^{n\cdot \mathbf{q}(\lambda )} \right) \\&\quad \le \textsf {Ent} (\mathbb {B}_{1/n}) + \lim _{N{\mathop {\rightarrow }\limits ^{}}\infty } \frac{1}{N} \int _{\lambda \in \mathbb {B}^{n}_{1/n}} \mathbf {k}(\mathcal {X}^{N}, \mathcal {X}^{n\cdot \mathbf{q}(\lambda )}) {\mathrm{d}}p(\lambda ) \\&\quad \le \textsf {Ent} (\mathbb {B}_{1/n}) + \Vert \textsf {Ent} _{*}(\mathcal {X})\Vert _{1}\cdot \lim _{N{\mathop {\rightarrow }\limits ^{}}\infty } \frac{n}{N} \cdot \int _{\lambda \in \mathbb {B}_{1/n}^{N}} \big |N/n- \mathbf{q}(\lambda )\big | {\mathrm{d}}p(\lambda ) \\&\quad \le \textsf {Ent} (\mathbb {B}_{1/n}) + \Vert \textsf {Ent} _{*}(\mathcal {X})\Vert _{1}\cdot \lim _{N{\mathop {\rightarrow }\limits ^{}}\infty } \frac{n}{N}\cdot \sqrt{N\cdot \frac{1}{n}(1-\frac{1}{n})} = \textsf {Ent} (\mathbb {B}_{1/n}) \end{aligned}$$

where we used Proposition 4.1(1) for the inequality on the third line above, and the following estimate: for any diagram $\mathcal {A}$ and integers $0\le m\le n$

$$\begin{aligned} \mathbf {k}(\mathcal {A}^{n},\mathcal {A}^{m}) \le \mathbf {k}(\mathcal {A}^{n-m},0) = \Vert \textsf {Ent} _{*}(\mathcal {A})\Vert _{1}\cdot (n-m) \end{aligned}$$

The second claim is proven similarly and the third follows from the second and the 1-Lipschitz property of the tensor product:

$$\begin{aligned}&\varvec{\kappa }\big ( (\mathcal {X}\otimes \mathcal {Y})\oplus _{\mathbb {B}_{1/n}}\left\{ \bullet \right\} , (\mathcal {X}\oplus _{\mathbb {B}_{1/n}}\left\{ \bullet \right\} ) \otimes (\mathcal {Y}\oplus _{\mathbb {B}_{1/n}}\left\{ \bullet \right\} ) \big ) \\&\quad \le \varvec{\kappa }\big ( (\mathcal {X}\otimes \mathcal {Y})\oplus _{\mathbb {B}_{1/n}}\left\{ \bullet \right\} , \mathcal {X}\otimes \mathcal {Y}\big ) + 2 \textsf {Ent} (\mathbb {B}_{1/n}) \\&\quad \le 3\textsf {Ent} (\mathbb {B}_{1/n}) \end{aligned}$$

Finally, the fourth follows from Proposition 4.1(2), by slicing both arguments along $\mathbb {B}_{1/n}$. $\square $

4.4 Vanishing defect property and completeness of the tropical cone

Lemma 4.3

For every admissible function $\varphi $, every ${\bar{\mathcal {X}}}\in \textsf {Q}\textsf {L}_{\varphi }(\mathbf {Prob}\left\langle \mathbf{G}\right\rangle ,\varvec{\kappa })$ and every $k \in {\mathbb {N}}$, there exists an asymptotically equivalent sequence ${\bar{\mathcal {Y}}}$ with defect bounded by the admissible function $\varphi _k$ defined by

$$\begin{aligned} \varphi _k(s) {:}{=} 3\textsf {Ent} (\mathbb {B}_{1/k}) + \frac{1}{k} \varphi (k \cdot s) \end{aligned}$$

Proof

Let ${\bar{\mathcal {X}}}=\left\{ \mathcal {X}(i)\right\} $ be a quasi-linear sequence with defect bounded by $\varphi $ and let $k \in {\mathbb {N}}$.

Define a new sequence ${\bar{\mathcal {Y}}}=\left\{ \mathcal {Y}(i)\right\} $ by

$$\begin{aligned} \mathcal {Y}(i) {:}{=} \big (\mathcal {X}(k\cdot i)\big ) \oplus _{\mathbb {B}_{1/k}} \left\{ \bullet \right\} \end{aligned}$$

First we verify that the sequences ${\bar{\mathcal {X}}}$ and ${\bar{\mathcal {Y}}}$ are asymptotically equivalent, that is

$$\begin{aligned} {\hat{\varvec{\kappa }}}({\bar{\mathcal {X}}},{\bar{\mathcal {Y}}})&{:}{=} \lim _{i{\mathop {\rightarrow }\limits ^{}}\infty } \frac{1}{i} \varvec{\kappa }\left( \mathcal {X}(i), \mathcal {Y}(i) \right) = 0 \end{aligned}$$

We estimate the asymptotic distance between individual members of sequences ${\bar{\mathcal {X}}}$ and ${\bar{\mathcal {Y}}}$ using Lemma 4.2 and Corollary 2.5 as follows

$$\begin{aligned} \varvec{\kappa }(\mathcal {X}(i), \mathcal {Y}(i) )&= \varvec{\kappa }\big (\mathcal {X}(i), \mathcal {X}(k\cdot i) \oplus _{\mathbb {B}_{1/k}}\left\{ \bullet \right\} \big ) \\&\le \varvec{\kappa }\left( \mathcal {X}(i), \mathcal {X}(i)^{k} \oplus _{\mathbb {B}_{1/k}}\left\{ \bullet \right\} \right) + \varvec{\kappa }\left( \mathcal {X}(i)^{k} \oplus _{\mathbb {B}_{1/k}}\left\{ \bullet \right\} , \mathcal {X}(k\cdot i) \oplus _{\mathbb {B}_{1/k}}\left\{ \bullet \right\} \right) \\&\le \textsf {Ent} (\mathbb {B}_{1/k}) + D_{\varphi }\cdot \varphi (i) \end{aligned}$$

Thus ${\hat{\varvec{\kappa }}}({\bar{\mathcal {X}}},{\bar{\mathcal {Y}}})=0$ and the two sequences are asymptotically equivalent. Next we show that the sequence ${\bar{\mathcal {Y}}}$ is $\varvec{\kappa }$-quasi-linear and evaluate its defect, also using Lemma 4.2. Let $i,j\in {\mathbb {N}}$, then

$$\begin{aligned}&\varvec{\kappa }\big (\mathcal {Y}(i+j),\mathcal {Y}(i)\otimes \mathcal {Y}(j)\big ) \\&\quad = \varvec{\kappa }\Big ( \mathcal {X}(k\cdot i+k\cdot j) \oplus _{\mathbb {B}_{1/k}}\left\{ \bullet \right\} , \big ( \mathcal {X}(k\cdot i) \oplus _{\mathbb {B}_{1/k}}\left\{ \bullet \right\} \big ) \otimes \big ( \mathcal {X}(k\cdot j) \oplus _{\mathbb {B}_{1/k}}\left\{ \bullet \right\} \big ) \Big ) \\&\quad \le \varvec{\kappa }\Big (\big (\mathcal {X}(k\cdot i)\!\otimes \!\mathcal {X}(k\cdot j)\big ) \!\oplus _{\mathbb {B}_{1/k}}\!\left\{ \bullet \right\} , \big ( \mathcal {X}(k\cdot i) \oplus _{\mathbb {B}_{1/k}}\!\left\{ \bullet \right\} \big ) \!\otimes \! \big ( \mathcal {X}(k\cdot j) \oplus _{\mathbb {B}_{1/k}}\!\left\{ \bullet \right\} \big ) \Big ) \\&\qquad + \frac{1}{k}\varphi \big (k \cdot (i + j )\big ) \\&\quad \le 3\textsf {Ent} (\mathbb {B}_{1/k})+ \frac{1}{k} \varphi \big (k \cdot ( i + j)\big ) \end{aligned}$$

$\square $

Corollary 4.4

For any indexing category $\mathbf{G}$ and for the admissible function $\varphi $ given by $\varphi (t) = t^{\alpha }$, $\alpha \in [0, 1)$, $\textsf {Q}\textsf {L}_{\varphi }(\mathbf {Prob}\left\langle \mathbf{G}\right\rangle ,\varvec{\kappa })$ has the uniformly bounded and vanishing defect properties.

Proof

Let ${\bar{\mathcal {X}}}\in \textsf {Q}\textsf {L}_{\varphi }(\mathbf {Prob}\left\langle \mathbf{G}\right\rangle ,\varvec{\kappa })$. By Lemma 4.3 there exists an asymptotically equivalent sequence ${\bar{\mathcal {Y}}}$ with defect bounded by $\varphi _k$ defined by

$$\begin{aligned} \varphi _k(t)&{:=} 3\textsf {Ent} (\mathbb {B}_{1/k}) + \frac{1}{k} C \varphi (k \cdot t) \\&= 3\textsf {Ent} (\mathbb {B}_{1/k}) + \frac{1}{k} C (k \cdot t)^\alpha \end{aligned}$$

Hence there exists a sequence $c_k {\mathop {\rightarrow }\limits ^{}}0$ such that for all $t \ge 1$,

$$\begin{aligned} \varphi _k(t) \le c_k t^\alpha \end{aligned}$$

showing the uniformly bounded and vanishing defect property. $\square $

4.5 Diagrams of tropical probability spaces

By applying the general setup in the previous section to the metric Abelian monoids $(\mathbf {Prob}\left\langle \mathbf{G}\right\rangle , \otimes , \mathbf {k})$ and $(\mathbf {Prob}\left\langle \mathbf{G}\right\rangle , \otimes , \varvec{\kappa })$ and using the Corollary 4.4 we obtain the following theorem.

Theorem 4.5

Fix an admissible function $\varphi $ and consider the commutative diagram

(4.2)

Then the following statements hold:

1.
The maps are isometries.
2.
The maps are isometric embeddings and each map has a dense image in the corresponding target space.
3.
The space in the lower-right corner, $\big (\textsf {Q}\textsf {L}_{\varphi }(\mathbf {Prob}\left\langle \mathbf{G}\right\rangle ,\varvec{\kappa }), {\hat{\varvec{\kappa }}} \big )$, is complete.

We would like to conjecture that all maps in the diagram above are isometries.

Since $\textsf {Q}\textsf {L}_{\varphi }(\mathbf {Prob}\left\langle \mathbf{G}\right\rangle ,\varvec{\kappa })$ is complete and has $\textsf {L}(\mathbf {Prob}\left\langle \mathbf{G}\right\rangle ,\varvec{\kappa })$ as a dense subset for any $\varphi >0$, it follows that $\textsf {Q}\textsf {L}_{\varphi }(\mathbf {Prob}\left\langle \mathbf{G}\right\rangle ,\varvec{\kappa })$ does not depend (up to isometry of pseudo-metric spaces) on the choice of admissible $\varphi >0$. From now on we will choose the particular function $\varphi (t){:}{=}t^{3/4}$. The choice will be clear when we formulate the Asymptotic Equipartition Property for diagrams. We may finally define the space of tropical$\mathbf{G}$-diagrams, as the space in the lower-right corner of the diagram

$$\begin{aligned} \mathbf {Prob}[\mathbf{G}] {:}{=} \big (\textsf {Q}\textsf {L}_{\varphi }(\mathbf {Prob}\left\langle \mathbf{G}\right\rangle , \varvec{\kappa }),\otimes ,\cdot ,{\hat{\varvec{\kappa }}}\big ) \end{aligned}$$

By Theorem 4.5 above, this space is complete.

The entropy function $\textsf {Ent} _{*}:\mathbf {Prob}\left\langle \mathbf{G}\right\rangle {\mathop {\rightarrow }\limits ^{}}\mathbb {R}^{\llbracket \mathbf{G} \rrbracket }$ extends to a linear functional

$$\begin{aligned} \textsf {Ent} _{*}:\mathbf {Prob}[\mathbf{G}]{\mathop {\rightarrow }\limits ^{}}\left( \mathbb {R}^{\llbracket \mathbf{G} \rrbracket },\Vert \cdot \Vert _{1}\right) \end{aligned}$$

of norm one, defined by

$$\begin{aligned} \textsf {Ent} _{*}({\bar{\mathcal {X}}}) = \lim _{n{\mathop {\rightarrow }\limits ^{}}\infty } \frac{1}{n} \textsf {Ent} _{*} \big (\mathcal {X}(n)\big ) \end{aligned}$$

Applying the construction of Sect. 3 we realize $\mathbf {Prob}[\mathbf{G}]$ as a closed convex cone in some Banach space $\mathbf {Prob}[[\mathbf{G}]]$. Entropy extends to a bounded linear functional $\textsf {Ent} _{*}:\mathbf {Prob}[[\mathbf{G}]]{\mathop {\rightarrow }\limits ^{}}(\mathbb {R}^{\llbracket \mathbf{G} \rrbracket },\Vert \cdot \Vert _{1})$, whose coordinates evaluate non-negatively on the cone. At this point we would like to define an entropic quantity as a bounded linear functional on $\mathbf {Prob}[[\mathbf{G}]]$, which is non-negative on the cone $\mathbf {Prob}[\mathbf{G}]$. Studying such entropic quantities is the subject of our future research.

5 AEP

5.1 Homogeneous diagrams

A $\mathbf{G}$-diagram $\mathcal {X}$ is called homogeneous if the automorphism group $\mathrm {Aut}(\mathcal {X})$ acts transitively on every space in $\mathcal {X}$. Homogeneous probability spaces are uniform. For more complex indexing categories this simple description is not sufficient. The subcategory of all homogeneous $\mathbf{G}$-diagrams will be denoted $\mathbf {Prob}\left\langle \mathbf{G}\right\rangle _{\textsf {h}}$. This space is invariant under the tensor product, thus it is a metric Abelian monoid.

5.1.1 Universal construction of homogeneous diagrams

Examples of homogeneous diagrams could be constructed in the following manner. Fix a finite group G and consider a $\mathbf{G}$-diagram $\left\{ G_{i};\alpha _{ij}\right\} _{i\in \mathbf{G}}$ of (not necessarily normal) subgroups of G, where morphisms $\alpha _{ij}$ are inclusions. The $\mathbf{G}$-diagram of probability spaces $\left\{ X_{i};f_{ij}\right\} $ is constructed by setting $X_{i}=(G/G_{i},\mathsf {unif})$, where $G/G_{i}$ denotes the set of left cosets and $\mathsf {unif}$ is the uniform measure, and taking $f_{ij}$ to be the natural projection $G/G_{i}{\mathop {\rightarrow }\limits ^{}}G/G_{j}$, whenever $G_{i}\subset G_{j}$. The resulting diagram $\mathcal {X}$ will be minimal if and only if for any $i,j\in \mathbf{G}$ there is $k\in \mathbf{G}$, such that $G_{k}=G_{i}\cap G_{j}$. In fact, any complete homogeneous diagram arises this way, according to the following argument from [9], although the representation of homogeneous diagrams by diagrams of subgroups is highly non-unique.

Indeed, let $\mathcal {X}=\left\{ X_{i};\;\chi _{ij}\right\} $ be a homogeneous $\mathbf{G}$-diagram of probability spaces, such that $X_{0}$ is the initial space in $\mathcal {X}$. Then $\mathrm {Aut}(\mathcal {X})$ acts transitively on every space $X_{i}$ in $\mathcal {X}$. Let $x_{0}\in X_{0}$ be an atom and set $x_{i}{:}{=}\chi _{0i}x_{0}$. Define $G{:}{=}\mathrm {Aut}(\mathcal {X})$ to be the full automorphism group and $G_{i}{:}{=}\mathrm {Stab}(x_{i})$ to be the stabilizer of the action of $\mathrm {Aut}(\mathcal {X})$ on $X_{i}$ at point $x_{i}$. The spaces $X_{i}$ can be naturally identified with $G/G_{i}$. Note that $x_{i}$ is the image of $x_{j}$ under the equivariant map $\chi _{ji}$ whenever it is present in the diagram $\mathcal {X}$. Thus we have $G_{j}\subset G_{i}$ and a natural surjection $G/G_{j}{\mathop {\rightarrow }\limits ^{}}G/G_{i}$, if the morphism $\chi _{ji}$ is present in the diagram $\mathcal {X}$. Under the identification $G/G_{j}\cong X_{i}$ the surjection $G/G_{j}{\mathop {\rightarrow }\limits ^{}}G/G_{i}$ coincides with $\chi _{ji}$ due to the equivariance of $\chi _{ji}$.

5.2 Asymptotic equipartition property

In [9] the following theorem is proven.

Theorem 5.1

Suppose $\mathcal {X}\in \mathbf {Prob}\left\langle \mathbf{G}\right\rangle $ is a $\mathbf{G}$-diagram of probability spaces for some fixed complete indexing category $\mathbf{G}$. Then there exists a sequence ${\bar{\mathcal {H}}}=(\mathcal {H}_{n})_{n=0}^{\infty }$ of homogeneous $\mathbf{G}$-diagrams such that

$$\begin{aligned} \frac{1}{n} \mathbf {k}(\mathcal {X}^{\otimes n},\mathcal {H}_{n}) \le C(|X_0|,\llbracket \mathbf{G} \rrbracket ) \cdot \sqrt{\frac{\ln ^3 n}{n}} \end{aligned}$$

(5.1)

where $C(|X_0|, \llbracket \mathbf{G} \rrbracket )$ is a constant only depending on the cardinality $|X_0|$ of the initial space $X_{0}$ of $\mathcal {X}$ and the number $\llbracket \mathbf{G} \rrbracket $ of objects in $\mathbf{G}$.

The Asymptotic Equipartition Property of Theorem 5.1 is a direct generalization of the classical Asymptotic Equipartition Property, which states that if $({\mathsf {X}}_i)$ is a sequence of identically distributed, independent random variables, the random variables $-\frac{1}{n}\log p({\mathsf {X}}_1, \dots , {\mathsf {X}}_n)$ converge as $n{\mathop {\rightarrow }\limits ^{}}\infty $ in probability to the entropy of $X_1$. Indeed, in that case the approximating sequence $H_{n}$ corresponds to a sequence of uniform random variables ${\mathsf {H}}_n$, with $\textsf {Ent} ({\mathsf {H}}_n)/n {\mathop {\rightarrow }\limits ^{}}\textsf {Ent} (X_1)$. Denote by $p(\mathbf{x},\mathbf{h})$ the optimal coupling achieving the distance in left-hand-side of (5.1). Then

$$\begin{aligned}&\frac{1}{n} \int _{X^n} \left| \log p(\mathbf {x}) - \textsf {Ent} ({\mathsf {H}}_n) \right| d p (\mathbf {x}) \\&\quad = \frac{1}{n} \int _{X^n \times H_n} \left| \log \frac{p(\mathbf{x}|\mathbf{h})}{p(\mathbf{h}|\mathbf{x})} \right| d p (\mathbf {x},\mathbf {h})\\&\quad \le \frac{1}{n} \int _{X^n \times H_n} \left| \log p(\mathbf{h}|\mathbf{x}) \right| d p (\mathbf {x},\mathbf {h}) + \frac{1}{n} \int _{X^n \times H_n} \left| \log p(\mathbf{x}|\mathbf{h}) \right| d p (\mathbf {x},\mathbf {h}) \\&\quad \le \frac{1}{n} \mathbf {k}(X^n, H_n) \\&\quad \le C(|X_0|,\llbracket \mathbf{G} \rrbracket ) \cdot \sqrt{\frac{\ln ^3 n}{n}} \end{aligned}$$

which implies the classical Asymptotic Equipartition Property.

We define the space of tropical homogeneous diagrams by

$$\begin{aligned} \mathbf {Prob}[\mathbf{G}]_{\textsf {h}}{:}{=}\textsf {Q}\textsf {L}_{\varphi }(\mathbf {Prob}\left\langle \mathbf{G}\right\rangle _{\textsf {h}},\varvec{\kappa }) \end{aligned}$$

Then, the Asymptotic Equipartition Property can be reformulated as follows.

Theorem 5.2

For any indexing category $\mathbf{G}$ the image of the natural inclusion

$$\begin{aligned} \mathbf {Prob}[\mathbf{G}]_{\textsf {h}}\hookrightarrow \mathbf {Prob}[\mathbf{G}] \end{aligned}$$

is dense.

Proof

By Theorem 5.1, every linear sequence can be approximated by a homogeneous sequence. It follows from the bound (5.1) that the defect of the approximating homogeneous sequence is bounded by a constant times $\varphi $, defined by $\varphi (t)=t^{3/4}$. Moreover, the linear sequences are dense by Theorem 4.5. This finishes the proof. $\square $

6 The tropical cone for probability spaces and chains

Although for general indexing categories $\mathbf{G}$ the space of tropical $\mathbf{G}$-diagrams will typically be infinite dimensional, it has a very simple, finite-dimensional description if $\mathbf{G}$ consists of a single object, or if it is a special type of indexing categories called a chain.

The chain of length k, denoted by $\mathbf{C}_k$, is the indexing category with k objects $O_1, \dots , O_k$, and a morphism from $O_i$ to $O_j$ whenever $i \ge j$. A $\mathbf{C}_k$-diagram of probability spaces is then a chain of reductions

$$\begin{aligned} X_k {\mathop {\rightarrow }\limits ^{}}X_{k-1} {\mathop {\rightarrow }\limits ^{}}\cdots {\mathop {\rightarrow }\limits ^{}}X_{1} \end{aligned}$$

For chains we can describe the tropical cone explicitly.

Theorem 6.1

For $k \in {\mathbb {N}}$, the tropical cone $\mathbf {Prob}[\mathbf{C}_k]$ is isomorphic to the following cone in $(\mathbb {R}^k, |\cdot |_1)$:

In particular, the algebraic structure and the pseudo-distance are preserved under the isomorphism.

Recall that a homogeneous probability space is (isomorphic to) a probability space with a uniform distribution and therefore its isomorphism class is completely determined by its cardinality or entropy. A homogeneous chain has a very simple description as well: A chain is homogeneous if and only if the individual probability spaces are homogeneous, i.e. if and only if the individual probability spaces are (isomorphic to) probability spaces with a uniform measure. Similarly, the isomorphism class of a chain is completely determined by the cardinalities of the spaces contained in it. This allows us to construct a canonical model for any chain.

We denote by $H_{n}$ the homogeneous probability space with the underlying set $\left\{ 0,\dots ,n-1\right\} $ with uniform measure. For $n\,\,|\,\,m$ the map $f_{m,n}:H_{m}{\mathop {\rightarrow }\limits ^{}}H_{n}$ defined by

$$\begin{aligned} f_{m,n}(x){:}{=}\left\lfloor \frac{x\cdot n}{m}\right\rfloor \end{aligned}$$

is a reduction of probability spaces. For a triple of positive integers satisfying $n\,\,|\,\,m\,\,|\,\,l$ holds

$$\begin{aligned} f_{m,n}\circ f_{l,m}=f_{l,n} \end{aligned}$$

(6.1)

If $\mathcal {X}=(X_{k}{\mathop {\rightarrow }\limits ^{}}\cdots {\mathop {\rightarrow }\limits ^{}}X_{1})$ is a homogeneous chain, and $n_{i}{:}{=}|X_{i}|$ then there is an isomorphism

$$\begin{aligned} \mathcal {X}\cong (H_{n_{k}}{\mathop {\longrightarrow }\limits ^{f_{n_{k},n_{k-1}}}}\dots {\mathop {\longrightarrow }\limits ^{f_{n_{2},n_{1}}}} H_{n_{1}}) \end{aligned}$$

(6.2)

Let $N,m,n\in {\mathbb {N}}$ be such that $m\,\,|\,\,N$ and $n\,\,|\,\,N$. Consider a fan of homogeneous spaces

$$\begin{aligned} H_{n}{\mathop {\longleftarrow }\limits ^{f_{N,n}}} H_{N}{\mathop {\longrightarrow }\limits ^{f_{N,m}}} H_{m} \end{aligned}$$

This fan is not minimal (and not homogeneous). We denote its minimization by $\mathcal {Z}_{n,m}$

$$\begin{aligned} \mathcal {Z}_{n,m}=\left( H_{n}{\mathop {\leftarrow }\limits ^{}}Z_{n,m}{\mathop {\rightarrow }\limits ^{}}H_{m}\right) \end{aligned}$$

with the top space $Z_{n,m}$ in the minimization satisfying

$$\begin{aligned} |Z_{n,m}|\le n+m \end{aligned}$$

Also note that minimization does not depend on N, since we assumed it is a multiple of ${\mathsf {lcm}}(m,n)$. Now we can estimate

$$\begin{aligned} \mathrm {kd}(\mathcal {Z}_{n,m})&\le 2\ln (n+m)-\ln (n)-\ln (m) \le 2\ln 2+|\ln n - \ln m|\nonumber \\&= 2\ln 2 + |\textsf {Ent} H_{n}-\textsf {Ent} H_{m}| \end{aligned}$$

(6.3)

Lemma 6.2

Let $\mathcal {X},\mathcal {Y}\in \mathbf {Prob}\left\langle \mathbf{C}_{k}\right\rangle $ be two homogeneous chains of length k. Then

$$\begin{aligned} \mathbf {k}(\mathcal {X},\mathcal {Y})\le 2k\cdot \ln 2+\Vert \textsf {Ent} _*\mathcal {X}-\textsf {Ent} _{*}\mathcal {Y}\Vert _{1} \end{aligned}$$

Proof

Let $(n_{i})$ and $(m_{i})$ be sequences of cardinalities of spaces in $\mathcal {X}$ and $\mathcal {Y}$ respectively. Without loss of generality we may assume that both chains have canonical form provided by (6.2). Let $N{:}{=}{\mathsf {lcm}}(n_{k},m_{k})$. Then $n_{i}$ and $m_{i}$ are divisors of N for all $1\le i\le k$. Consider two-fan of chains

$$\begin{aligned} \mathcal {X}{\mathop {\longleftarrow }\limits ^{l}}\mathcal {H}{\mathop {\longrightarrow }\limits ^{r}}\mathcal {Y}\end{aligned}$$

where $\mathcal {H}=H_{N}^{\mathbf{C}_{k}}$ and $l_{i}=f_{N,n_{i}}$, $r_{i}=f_{N,m_{i}}$ for $1\le i\le k$. Due to transitivity (6.1) this is indeed a two-fan of chains. Its minimization is a chain of minimal fans

$$\begin{aligned} \mathcal {Z}_{i}{:}{=}(X_{i}{\mathop {\leftarrow }\limits ^{}}Z_{n_{i},m_i}{\mathop {\rightarrow }\limits ^{}}Y_{i}) \end{aligned}$$

Thus we can estimate

$$\begin{aligned} \mathbf {k}(\mathcal {X},\mathcal {Y}) \le \sum \mathrm {kd}(\mathcal {Z}_{i}) \le 2k\cdot \ln 2 + \Vert \textsf {Ent} _{*}\mathcal {X}-\textsf {Ent} _{*}\mathcal {Y}\Vert _{1} \end{aligned}$$

$\square $

Corollary 6.3

$$\begin{aligned} \textsf {Ent} _{*}:\mathbf {Prob}[\mathbf{C}_{k}]_{\textsf {h}}{\mathop {\rightarrow }\limits ^{}}(\mathbb {R}^{k},\Vert \cdot \Vert _{1}) \end{aligned}$$

is an isometric embedding.

Proof

Let $[\mathcal {H}_{1}]$ and $[\mathcal {H}_{2}]$ be two tropical chains of length k. Then

$$\begin{aligned} \varvec{\kappa }\big ([\mathcal {H}_{1}],[\mathcal {H}_{2}]\big )&= \lim _{n{\mathop {\rightarrow }\limits ^{}}\infty } \frac{1}{n} \mathbf {k}\big (\mathcal {H}_{1}(n),\mathcal {H}_{2}(n)\big )\\&\le \lim _{n{\mathop {\rightarrow }\limits ^{}}\infty } \frac{1}{n} \big (2k\ln 2 + \Vert \textsf {Ent} _{*}\mathcal {H}_{1}(n)-\textsf {Ent} _{*}\mathcal {H}_{2}(n)\Vert _{1}\big )\\&= \big \Vert \textsf {Ent} _{*}[\mathcal {H}_{1}]-\textsf {Ent} _{*}[\mathcal {H}_{2}]\big \Vert _{1} \end{aligned}$$

The opposite inequality is the 1-Lipschitz property of entropy. $\square $

Proof of Theorem 6.1

The space $\mathbf {Prob}[\mathbf{C}_{k}]_{\textsf {h}}$ is dense in $\mathbf {Prob}[\mathbf{C}_{k}]$ by Theorem 5.2. Therefore the isomorphism in Corollary 6.3 extends to the isomorphic embedding of $\mathbf {Prob}[\mathbf{C}_{k}]$. To prove the surjectivity one constructs chains of probability spaces with prescribed entropies satisfying the inequalities defining the cone in the theorem. This is left to the reader. $\square $

On behalf of all authors, the corresponding author states that there is no conflict of interest.

Notes

The reason for the name tropical cone is the following. For instance in algebraic geometry, tropical varieties are, roughly speaking, divergent sequences of classical varieties, renormalized on a log scale with an increasing base. The adjective ‘tropical’ carries little semantics, but was introduced in honor of the Brazilian mathematician and computer scientist Imre Simon who worked on the subject of tropical mathematics. Analogously, we construct the asymptotic cone from certain divergent sequences with respect to the intrinsic entropy distance. As the intrinsic entropy distance is entropy-based, we achieve a similar type of renormalization as in algebraic geometry.

References

Ay, N., Bertschinger, N., Der, R., Güttler, F., Olbrich, E.: Predictive information and explorative behavior of autonomous robots. Eur. Phys. J. B 63(3), 329–339 (2008)
Article MathSciNet Google Scholar
A’Campo, N.: A natural construction for the real numbers. Mathematics (2003). arXiv:math/0301015
Bertschinger, N., Rauh, J., Olbrich, E., Jost, J., Ay, N.: Quantifying unique information. Entropy 16(4), 2161–2183 (2014)
Article MathSciNet Google Scholar
de Bruijn, N.G., Erdös, P.: Some linear and some quadratic recursion formulas. II. In: Proceedings of the Koninklijke Nederlandse Akademie van Wetenschappen: Series A: Mathematical Sciences, vol. 14, pp. 152–163 (1952)
Friston, K.: The free-energy principle: a rough guide to the brain? Trends Cogn. Sci. 13(7), 293–301 (2009)
Article Google Scholar
Kovačević, M., Stanojević, I., Šenk, V.: On the hardness of entropy minimization and related problems. In: 2012 IEEE Information Theory Workshop, pp. 512–516. IEEE (2012)
Kingma, D.P., Welling, M.: Auto-encoding variational Bayes (2013). arXiv:1312.6114
Matus, F.: Infinitely many information inequalities. In: Information Theory. ISIT 2007. IEEE International Symposium on, pp. 41–44. IEEE (2007)
Matveev, R., Portegies, J.W.: Asymptotic dependency structure of multiple signals. Inf. Geom. 1(2), 237–285 (2018)
Article MathSciNet Google Scholar
Matveev, R., Portegies, J.W.: Conditioning in tropical probability theory (2019). arXiv:1905.05596
Matveev, R., Portegies, J.W.: Tropical probability theory and an application to the entropic cone (2019). arXiv:1905.05351
Rådström, H.: An embedding theorem for spaces of convex sets. Proc. Am. Math. Soc. 3(1), 165–169 (1952)
Article MathSciNet Google Scholar
Steudel, B., Ay, N.: Information-theoretic inference of common ancestors. Entropy 17(4), 2304–2327 (2015)
Article MathSciNet Google Scholar
Van Dijk, S.G., Polani, D.: Informational constraints-driven organization in goal-directed behavior. Adv. Complex Syst. 16(02n03), 1350016 (2013)
Article MathSciNet Google Scholar
Vidyasagar, M.: A metric between probability distributions on finite sets of different cardinalities and applications to order reduction. IEEE Trans. Autom. Control 57(10), 2464–2477 (2012)
Article MathSciNet Google Scholar

Download references

Acknowledgements

Open access funding provided by Projekt DEAL. The authors would like to thank the referee for the many constructive remarks that have led to a substantial improvement of the article.

Author information

Authors and Affiliations

Max-Planck-Institut für Mathematik in den Naturwissenschaften, Inselstraße 22, 04103, Leipzig, Germany
R. Matveev
Eindhoven University of Technology, Postbus 513, 5600 MB, Eindhoven, The Netherlands
J. W. Portegies

Authors

R. Matveev
View author publications
You can also search for this author in PubMed Google Scholar
J. W. Portegies
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to R. Matveev.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Matveev, R., Portegies, J.W. Tropical diagrams of probability spaces. Info. Geo. 3, 61–88 (2020). https://doi.org/10.1007/s41884-020-00027-1

Download citation

Received: 05 June 2019
Revised: 29 January 2020
Published: 07 April 2020
Issue Date: June 2020
DOI: https://doi.org/10.1007/s41884-020-00027-1

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Tropical diagrams of probability spaces

Abstract

Similar content being viewed by others

Tropical Ehrhart theory and tropical volume

Information geometry

Main Directions in the Theory of Probability Metrics

1 Introduction

2 Asymptotic cones of metric abelian monoids

2.1 Metric and pseudo-metric spaces

2.2 Metric abelian monoids

Proposition 2.1

Proposition 2.2

2.3 Asymptotic cones (tropicalization) of monoids

2.3.1 Admissible functions

Lemma 2.3

Proof

2.3.2 Quasi-linear sequences

2.3.3 Asymptotic distance

2.3.4 Quasi-homogeneity

Lemma 2.4

Proof

Corollary 2.5

2.3.5 The semi-module structure

2.3.6 Completeness

Proposition 2.6

Proof

2.3.7 On the density of linear sequences

Proposition 2.7

Proof

2.3.8 Asymptotic distance on original monoid

Lemma 2.8

Proof

3 Grothendieck construction

Proposition 3.1

Proof

4 Tropical probability spaces and their diagrams

4.1 Diagrams of probability spaces

4.2 Tropical diagrams

Proposition 4.1

4.3 Mixtures

4.3.1 Definition of mixtures

4.3.2 The distance estimates for the mixtures

Lemma 4.2

Proof

4.4 Vanishing defect property and completeness of the tropical cone

Lemma 4.3

Proof

Corollary 4.4

Proof

4.5 Diagrams of tropical probability spaces

Theorem 4.5

5 AEP

5.1 Homogeneous diagrams

5.1.1 Universal construction of homogeneous diagrams

5.2 Asymptotic equipartition property

Theorem 5.1

Theorem 5.2

Proof

6 The tropical cone for probability spaces and chains

Theorem 6.1

Lemma 6.2

Proof

Corollary 6.3

Proof

Proof of Theorem 6.1

Notes

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation