Abstract
In this paper, we address the problem of constructing a uniform probability measure on \({\mathbb {N}}\). Of course, this is not possible within the bounds of the Kolmogorov axioms, and we have to violate at least one axiom. We define a probability measure as a finitely additive measure assigning probability 1 to the whole space, on a domain which is closed under complements and finite disjoint unions. We introduce and motivate a notion of uniformity which we call weak thinnability, which is strictly stronger than extension of natural density. We construct a weakly thinnable probability measure, and we show that on its domain, which contains sets without natural density, probability is uniquely determined by weak thinnability. In this sense, we can assign uniform probabilities in a canonical way. We generalize this result to uniform probability measures on other metric spaces, including \({\mathbb {R}}^n\).
Similar content being viewed by others
Avoid common mistakes on your manuscript.
1 Introduction and Main Results
Within the bounds of the Kolmogorov axioms [5], a probability measure on \({\mathbb {N}} = \{1,2,3,\ldots \}\) cannot assign the same probability to every singleton, and therefore, a uniform probability measure on \({\mathbb {N}}\) does not exist. Despite this, we have some intuition about what a uniform probability measure on \({\mathbb {N}} \) should look like. According to this intuition, for example, we would assign probability \(1/2\) to the subset of all odd numbers. If we want to capture this intuition in a mathematical framework, we have to violate at least one of the axioms of Kolmogorov.
One suggestion by De Finetti [3] is to relax countable additivity of the measure to finite additivity. To see why this suggestion is reasonable, we must first understand why it is possible, within the axioms of Kolmogorov, to set up uniform (Lebesgue) measure on \([0,1]\). The type of additivity we demand plays a crucial role here. In the standard theory one always demands countable additivity. If every singleton has the same probability, in an infinite space, every singleton must have probability zero. With countable additivity this means that every countable set must have probability zero. This is no problem if we are working on the uncountable \([0,1]\), since we still have freedom to assign different probabilities to different uncountable subsets of \([0,1]\). The interval \([0,1/2]\), for example, has Lebesgue measure \(1/2\), while it is equipotent with \([0,1]\), which has Lebesgue measure 1. This works because the cardinality of the set over which we sum is smaller than the cardinality of the space itself.
On \({\mathbb {N}}\) the problem of countable additivity is immediately clear: since every subset of \({\mathbb {N}}\) is countable, every subset should have probability zero, which is impossible because the probability of \({\mathbb {N}}\) itself should be 1. In analogy with Lebesgue measure, we want finite subsets to have probability zero, and we want to be able to assign different probabilities to countable subsets. To do this, we should change the type of additivity to finite additivity. In short: since the cardinality of the space changes from uncountable to countable, the additivity should change from countable to finite.
Schirokauer and Kadane [8] study three different collections of finitely additive probability measures on \({\mathbb {N}}\) which may qualify as uniform: the set \(L\) of measures that extend natural density, the set \(S\) of shift-invariant measures and the set \(R\) of measures that measure residue classes uniform. They show that \(N \subset S \subset R\) where the inclusions are strict. If a set \(A \subseteq {\mathbb {N}}\) is without natural density, i.e.,
does not converge as \(n \rightarrow \infty \), different measures in \(L\) assign different probabilities to \(A\). So even the smallest collection discussed by Schirokauer and Kadane does not lead to a uniquely determined uniform probability for sets which do not have a natural density. This observation brings us to the main goal of this paper.
Main goal find a natural notion of uniformity, stronger than extension of natural density such that all probability measures that are uniform under this notion assign the same probability to a large collection of sets. In particular, this collection of sets should be larger than the collection of sets having a natural density.
In this paper, we introduce and study a notion of uniformity which is stronger than the extension of natural density. A uniform probability measure on \([0,1]\) or on a finite space is characterized by the property that if we condition on any suitable subset, the resulting conditional probability measure is again uniform on that subset. It is this property that we will generalize, and the generalized notion will be called weak thinnability. (The actual definition of weak thinnability is given later, and will also involve two technical conditions.)
We allow probability measures to be defined on collections of sets that are closed under complements and finite disjoint unions. This is because we think there is no principal reason to insist that all sets are measured, just like not all subsets of \({\mathbb {R}}\) are Lebesgue measurable. We should, however, be cautious when allowing domains that are not necessarily algebras, for the following reason. De Finetti [3] uses a Dutch Book argument to conclude that, under the Bayesian interpretation of probability, a probability measure has to be coherent. He shows that if the domain of the probability measure is an algebra, the finite additivity of the probability measure implies coherence. On domains only closed under complements and finite disjoint unions, however, this implication no longer holds. Therefore, someone sharing de Fenitti’s view of probability would like to add coherence as additional constraint. For completeness, we study both the case with and the case without coherence as additional constraint on the probability measure.
Definition 1.1
Let \(X\) be a space and write \({\mathcal {P}}(X)\) for the power set of \(X\). An \(f\) -system on \(X\) is a nonempty collection \({\mathcal {F}} \subseteq {\mathcal {P}}(X)\) such that
- 1.:
-
\(A,B \in {\mathcal {F}}\) with \(A \cap B = \emptyset \) implies that \(A \cup B \in {\mathcal {F}}\),
- 2.:
-
\(A \in {\mathcal {F}}\) implies that \(A^c \in {\mathcal {F}}\).
A probability measure on an \(f\)-system \({\mathcal {F}}\) is a map \(\mu : {\mathcal {F}} \rightarrow [0,1]\) such that
- 1.:
-
\(A,B \in {\mathcal {F}}\) with \(A \cap B = \emptyset \) implies that \(\mu (A \cup B) = \mu (A)+\mu (B)\),
- 2.:
-
\(\mu (X)=1\).
A coherent probability measure is a probability measure \(\mu : {\mathcal {F}} \rightarrow [0,1]\) such that for all \(n \in {\mathbb {N}}\), \(\alpha _1,\ldots ,\alpha _n \in {\mathbb {R}}\), \(A_1,\ldots ,A_n \in {\mathcal {F}}\)
A probability pair on \(X\) is a pair \(({\mathcal {F}},\mu )\) such that \({\mathcal {F}}\) is an \(f\)-system on \(X\) and \(\mu \) is a probability measure on \({\mathcal {F}}\).
Remark 1.2
Schurz and Leitgeb [9], p. 261] call an \(f\)-system a pre-Dynkin system, since in case of closure under countable unions of mutually disjoint sets, such a collection is called a Dynkin system.
Remark 1.3
Expression 1.2 has the following interpretation. If \(\alpha _i \ge 0\), we buy a bet on \(A_i\) that pays out \(\alpha _i\) for \(\alpha \mu (A_i)\). If \(\alpha _i<0\), we sell a bet on \(A_i\) that pays out \(|\alpha _i|\) for \(|\alpha _i|\mu (A_i)\). Then (1.2) expresses there is no guaranteed amount of net loss.
We aim at uniquely determining the probability of as many sets as possible. In particular, we are interested in probability pairs with an \(f\)-system consisting only of sets with a uniquely determined probability. So we are not only interested in probability pairs satisfying our stronger notion of uniformity, but in the canonical ones, where “canonical” is to be understood in the following way.
Definition 1.4
Let \(P\) be some collection of probability pairs. A pair \(({\mathcal {F}},\mu ) \in P\) is canonical with respect to \(P\) if for every \(A \in {\mathcal {F}}\) and every pair \(({\mathcal {F}}',\mu ') \in P\) with \(A \in {\mathcal {F}}'\) we have \(\mu (A)=\mu '(A)\).
Before we give a more detailed outline of our paper, we need the following definition. Set
Note that \({\mathcal {M}}\) is an algebra on \([0,\infty )\). It turns out that by working on \([0,\infty )\) instead of \({\mathbb {N}}\), where we restrict ourselves to sub-\(f\)-systems of \({\mathcal {M}}\), we can formulate and prove our claims much more elegantly. Here, we view the elements of \({\mathcal {P}}({\mathbb {N}})\) embedded in \({\mathcal {M}}\) by the injection
We should emphasize, however, that conceptually there is no difference between \([0,\infty )\) and \({\mathbb {N}}\) and that the work we do in Sects. 2 and 3 can be done in the same way for \({\mathbb {N}}\). After working on \({\mathcal {M}}\), we explicitly translate our result to \({\mathbb {N}}\) and other metric spaces in Sect. 4.
For \(A \in {\mathcal {M}}\) we define \(\rho _A: [0,\infty ) \rightarrow [0,1]\) by \(\rho _A(0){:=}0\) and
for \(x>0\). Also set
which are the elements of \({\mathcal {M}}\) that have natural density and let \(\lambda : \mathcal {C} \rightarrow [0,1]\) be given by
We write \(L^{*}\) for the collection of probability pairs \(({\mathcal {F}},\mu )\) on \([0,\infty )\) such that \(\mathcal {C} \subseteq {\mathcal {F}} \subseteq {\mathcal {M}}\) and \(\mu (A)=\lambda (A)\) for \(A \in \mathcal {C}\). Our earlier observation about the indeterminacy of probability under \(L\) gets the following formulation in terms of \(L^{*}\): a pair \(({\mathcal {F}},\mu ) \in L^{*}\) is canonical with respect to \(L^{*}\) if and only if \({\mathcal {F}} = \mathcal {C}\). We write \(WT\) for the collection of probability pairs that are a weakly thinnable pair (WTP), that is, a probability pair that satisfies the condition of weak thinnability. The collection \(WT\) is a proper subset of \(L^{*}\) and contains pairs \(({\mathcal {F}},\mu )\) canonical with respect to \(WT\) such that \({\mathcal {F}} \setminus \mathcal {C} \not = \emptyset \). In other words, with restricting \(L^{*}\) to \(WT\) we are able to assign a uniquely determined probability to some sets without natural density. Finally, we write \(WTC \subseteq WT \subset L^{*}\) for the elements \(({\mathcal {F}},\mu ) \in WT\) such that \(\mu \) is coherent.
The structure of this paper is as follows. In Sect. 2, we discuss weak thinnability and motivate why this is a natural notion of uniformity. In Sect. 3, we introduce the probability pair \(({\mathcal {A}}^\mathrm{uni},\alpha )\) where
and
Remark 1.5
The expression in (1.9) is sometimes called the logarithmic density of \(A\) [11], p. 272].
We end Sect. 3 with the following theorem, which is the main result of our paper.
Theorem 1.6
(Main theorem) The following holds:
-
1
The pair \(({\mathcal {A}}^\mathrm{uni},\alpha )\) is a WTP, is extendable to a WTP \(({\mathcal {F}},\mu )\) with \({\mathcal {F}}={\mathcal {M}}\) and \(\alpha \) is coherent.
-
2
The pair \(({\mathcal {A}}^\mathrm{uni},\alpha )\) is canonical with respect to both \(WT\) and \(WTC\).
-
3
If a pair \(({\mathcal {F}},\mu )\) is canonical with respect to \(WT\) or \(WTC\), then \({\mathcal {F}} \subseteq {\mathcal {A}}^\mathrm{uni}\).
In Sect. 4, we derive from \(({\mathcal {A}}^\mathrm{uni},\alpha )\) analogous probability pairs on certain metric spaces including Euclidean space. The proofs of the results in Sects. 2–4 are given in Sect. 5.
We write \({\mathbb {N}}_0{:=} \{0,1,2,\ldots \}\). For real-valued sequences \(x,y\) or real-valued functions \(x,y\) on \([0,\infty )\) we write \(x \sim y\) or \(x_i \sim y_i\) if \(\lim _{i \rightarrow \infty } (x_i-y_i)=0\). Since we work only on \([0,\infty )\) in Sects. 2 and 3, every time we speak of an \(f\)-system, probability pair or probability measure it is understood that this is on \([0,\infty )\).
2 Weak Thinnability
Let \(m\) be the Lebesgue measure on \({\mathbb {R}}\). For Lebesgue measurable \(Y \subseteq {\mathbb {R}}\) with \(0<m(Y)<\infty \) the uniform probability measure on \(Y\) is given by
for all Lebesgue measurable \(X \subseteq Y\). Let \(A \subseteq B \subseteq C\) be all Lebesgue measurable with \(m(B)>0\) and \(m(C)<\infty \). Observe that
We want to generalize this property to a property of probability pairs on \([0,\infty )\). For \(A \in {\mathcal {M}}\) define \(S_A: [0,\infty ) \rightarrow [0,\infty )\) by
Write
Consider for \(A \in {\mathcal {M}}^{*}\) the map \(f_A: A \rightarrow [0,\infty )\) given by \(f_A(x){:=}S_A(x)\). The map \(f_A\) gives a one-to-one correspondence between \(A\) and \([0,\infty )\). If \(A \in {\mathcal {M}}^{*}\) and \(B \in {\mathcal {M}}\), we want to introduce notation for the set
that gives the subset of \(A\) that corresponds to \(B\) under \(f_A\). Inspired by van Douwen [12], we introduce the following operation.
Definition 2.1
For \(A,B \in {\mathcal {M}}\), define
Note that if \(A,B \in {\mathcal {M}}\), then \(A \circ B \in {\mathcal {M}}\) and that for \(A \in {\mathcal {M}}^{*}\) we have
We can view this operation as thinning \(A\) by \(B\) because we create a subset of \(A\), where \(B\) is “deciding” which parts of \(A\) are removed. We also can view the operation \(A \circ B\) as thinning out \(B\) over \(A\), since we “spread out” the set \(B\) over \(A\). Taking for example
and
we get
and
Let \(({\mathcal {F}},\mu )\) be a probability pair and let \(A \in {\mathcal {F}} \cap {\mathcal {M}}^{*}\). If \(B \in {\mathcal {M}}\), the set \(A\circ B\) is the subset of \(A\) corresponding to \(B\). We can use this to transform \(\mu \) into a measure on \(A\) as follows. We set \({\mathcal {F}}_A {:=} \{ A \circ B \;:\; B \in {\mathcal {F}} \}\) and then define \(\mu _A : {\mathcal {F}}_A \rightarrow [0,1]\) by
Given \(B \in {\mathcal {F}}\) such that \(A \circ B \in {\mathcal {F}}\), the condition that
is a natural generalization of (2.2). Using (2.12) this translates into
We now have the restriction that \(A \in {\mathcal {F}} \cap {\mathcal {M}}^{*}\). However, if \(A \in {\mathcal {F}} \setminus {\mathcal {M}}^{*}\), then any uniform probability measure should assign 0 to \(A\) and since \(A\circ B \subseteq A\) (2.14) still holds. In Sect. 6.2, we show that the condition that (2.14) holds for all \(A,B \in {\mathcal {F}}\) is so strong that only probability pairs with relatively small \(f\)-systems satisfy it. Since it is our goal to find a notion of uniformity that allows for a canonical pair with a large \(f\)-system, we choose to use a weakened version of this property which asks that \(\mu (C \circ A) = \mu (C)\mu (A)\) for every \(C \in \mathcal {C}\) and \(A \in {\mathcal {F}}\).
Weak thinnability also involves two technical conditions. Let \(({\mathcal {F}},\mu )\) be a probability pair, let \(A,B \in {\mathcal {F}}\) and suppose it is true for every \(x \in [0,\infty )\) that
Since this inequality is true for every \(x\), the set \(B\) is “sparser” than \(A\). Therefore, it is natural to ask that \(\mu (A) \ge \mu (B)\). We call this property “preserving ordering by \(S\).”
Since we have \(\mathcal {C} \subseteq {\mathcal {F}}\), it seems natural to also ask \({ \left. \mu \right| _{\mathcal {C}} }=\lambda \), but it turns out to be sufficient to ask the weaker property that \(\mu ([c,\infty ))=1\) for every \(c \in [0,\infty )\). So, to reduce redundancy we require the latter and then prove that \({ \left. \mu \right| _{\mathcal {C}} }=\lambda \). Putting everything together, we obtain the following definition.
Definition 2.2
A probability pair \(({\mathcal {F}},\mu )\) with \({\mathcal {F}} \subseteq {\mathcal {M}}\) is a WTP if it satisfies the following conditions:
-
P1
For every \(C \in \mathcal {C}\) and \(A \in {\mathcal {F}}\) we have \(C \circ A \in {\mathcal {F}}\) and \(\mu (C \circ A)=\mu (C)\mu (A)\),
-
P2
\(\mu \) preserves ordering by \(S\),
-
P3
\(\mu ([c,\infty ))=1\) for every \(c \in [0,\infty )\).
That every WTP extends natural density is implied by the following result.
Proposition 2.3
Let \(({\mathcal {F}},\mu ) \in WT\). Then for \(A \in {\mathcal {F}}\) we have
3 The Pair \(({\mathcal {A}}^\mathrm{uni},\alpha )\)
For \(A \in {\mathcal {M}}\) set \(\sigma _A: (0,\infty )^2 \rightarrow [0,1]\) given by
which is the average of \(1_A\) over the interval \([x,x+D]\). Then set for any \(A \in {\mathcal {M}}\)
and
Define
It is easy to check that \((\mathcal {W}^\mathrm{uni},{ \left. \lambda \right| _{\mathcal {W}^\mathrm{uni}} })\) is a probability pair. For any \(A \in {\mathcal {M}}\), we set
Definition 3.1
We define
and \(\alpha : {\mathcal {A}}^\mathrm{uni} \rightarrow [0,1]\) by
Notice that Definition 3.1 gives a definition of \(({\mathcal {A}}^\mathrm{uni},\alpha )\) that is slightly different from (1.8) and (1.9). For a justification of Eqs. 1.8 and 1.9, see the proof of Lemma 5.2. Our first concern is that \(\alpha \) coincides with natural density.
Proposition 3.2
We have \(\mathcal {C} \subseteq {\mathcal {A}}^\mathrm{uni}\) and for every \(A \in \mathcal {C}\)
A typical example of a set in \({\mathcal {A}}^\mathrm{uni}\) that is not in \(\mathcal {C}\), is
It is easy to check that \(A \not \in \mathcal {C}\), but
so \(\log (A) \in \mathcal {W}^\mathrm{uni}\) with \(\lambda (\log (A))=1/2\). Hence \(A \in {\mathcal {A}}^\mathrm{uni}\) with \(\alpha (A)=1/2\).
That \(({\mathcal {A}}^\mathrm{uni},\alpha )\) is a probability pair follows directly from the fact that \((\mathcal {W}^\mathrm{uni},{ \left. \lambda \right| _{\mathcal {W}^\mathrm{uni}} })\) is a probability pair. The pair \(({\mathcal {A}}^\mathrm{uni},\alpha )\) is also a WTP.
Theorem 3.3
We have \(({\mathcal {A}}^\mathrm{uni},\alpha ) \in WTC \subseteq WT\) and we can extend \(({\mathcal {A}}^\mathrm{uni},\alpha )\) to a \(WTP\) with \({\mathcal {M}}\) as \(f\)-system.
Remark 3.4
We use free ultrafilters in the proof of Theorem 3.3 to show there exists an extension to a \(WTP\) ith \({\mathcal {M}}\) as \(f\)-system. The existence of free ultrafilters is guaranteed by the Boolean Prime Ideal Theorem, which cannot be proven in ZF set theory, but is weaker than the axiom of choice [4]. The existence of a atomfree or nonprincipal (i.e., every singleton has measure zero) finite additive measure defined on the power set of \({\mathbb {N}}\) cannot be established in ZF alone [10]. Consequently, a version of the axiom of choice is always necessary to construct a probability measure on \({\mathcal {M}}\) that assigns measure zero to all bounded intervals.
We do not only want an element of \(WTC\), but a canonical one. This is guaranteed by the following theorem.
Theorem 3.5
The pair \(({\mathcal {A}}^\mathrm{uni},\alpha )\) is canonical with respect to both \(WT\) and \(WTC\).
The pair \(({\mathcal {A}}^\mathrm{uni},\alpha )\) is maximal in the sense that it contains every pair that is canonical with respect to \(WT\) or \(WTC\).
Theorem 3.6
If \(({\mathcal {F}},\mu ) \in WT\) is canonical with respect to \(WT\), then \({\mathcal {F}} \subseteq {\mathcal {A}}^\mathrm{uni}\). If \(({\mathcal {F}},\mu ) \in WTC\) is canonical with respect to \(WTC\), then \({\mathcal {F}} \subseteq {\mathcal {A}}^\mathrm{uni}\).
4 Generalization to Metric Spaces
In this section we derive probability pairs on a class of metric spaces that are analogous to \(({\mathcal {A}}^\mathrm{uni},\alpha )\). Of course one could also try to construct such a probability measure by working more directly on these metric spaces, instead of constructing a derivative of \(({\mathcal {A}}^\mathrm{uni},\alpha )\). Since probability pairs on \([0,\infty )\), motivated from the problem of a uniform probability measure on \({\mathbb {N}}\), is the priority of this paper, we do not make such an effort here.
Let us first sketch the idea of the generalization. Let \(A \in {\mathcal {M}}\). Whether \(A\) is in \({\mathcal {A}}^\mathrm{uni}\) depends completely on the asymptotic behavior of \(\rho _A\) (Lemma 5.2). If \(A \in {\mathcal {A}}^\mathrm{uni}\), then also \(\alpha (A)\) only depends on the asymptotic behavior of \(\rho _A\) (Lemma 5.2). Now suppose that on a space \(X\), we can somehow define a density functions \(\bar{\rho }_B:[0,\infty )\rightarrow [0,1]\) for (some) subsets \(B \subseteq X\) in a canonical way. Then, by replacing \(\rho \) by \(\bar{\rho }\), we get the analogue of \(({\mathcal {A}}^\mathrm{uni},\alpha )\) in \(X\). The goal of this section is to make this idea precise.
Let \((X,d)\) be a metric space. For \(x \in X\) and \(r \ge 0\), write
Write \(\mathcal {B}(X)\) for the Borel \(\sigma \)-algebra of \(X\). We need a “uniform” measure on this space to measure density of subsets in open balls. It is clear that the measure of an open ball should at least be independent of where in the space we look, i.e., it should only depend on the radius of the ball. This leads to the following definition.
Definition 4.1
We say that a Borel measure \(\nu \) on \(X\) is uniform if for all \(r > 0\) and \(x,y \in X\) we have
On \({\mathbb {R}}^n\) with Euclidean metric, the standard Borel measure as obtained by assigning to a product of intervals the product of the lengths of those intervals, is a uniform measure. In general, on normed locally compact vector spaces, the invariant measure with respect to vector addition, as given by the Haar measure, is a uniform measure.
A result by Christensen [1] tells us that uniform measures that are Radon measures are unique up to multiplicative constants on locally compact metric spaces. This, however, does not cover all cases. The set of irrational numbers, for example, is not locally compact, but the Lebesgue measure restricted to Borel sets of irrational numbers is a uniform measure and unique up to a multiplicative constant. We give a slightly more general version of the result of Christensen.
Proposition 4.2
If \(\nu _1\) and \(\nu _2\) are two uniform measures on \(X\), then there exists some \(c>0\) such that \(\nu _1=c \nu _2\).
Proposition 4.2 gives us uniqueness, but not existence. To see that there are metric spaces without a uniform measure, consider the following example. Let \(X\) be the set of vertices in a connected graph that is not regular. Let \(d\) be the graph distance on \(X\). If we suppose that \(\nu \) is a uniform measure on \(X\), from (4.2) with \(r<1\) it follows that for some \(C>0\) we have \(\nu (\{x\})=C\) for every \(x \in X\). But then \(\nu (B(x,2))=C(1+\deg (x))\) for every \(x\in V\), which implies (4.2) cannot hold for \(r=2\) since the graph is not regular. A characterization of metric spaces on which a uniform measure exist, does not seem to be present in the literature.
We now assume \(X\) has a uniform measure \(\nu \) and that \(\nu (X)=\infty \). In addition to that, we write \(h(r){:=} \nu (B(x,r))\) for \(r\ge 0\) and assume that
which is equivalent with amenability in case \((X,d)\) is a normed locally compact vector space [7]. For the importance of this assumption, see Remark 4.4 below.
Set
for \(u \in [0,\infty )\). Note that \(h(r^-(u)) \le u\) and \(h(r^+(u)) \ge u\). Write \((X,\mathcal {L}(X),\bar{\nu })\) for the (Lebesgue) completion of \((X,\mathcal {B}(X),\nu )\). Fix some \(o \in X\). For \(A \in \mathcal {L}(X)\) define the map \(\bar{\rho }_A: [0,\infty ) \rightarrow [0,\infty )\) given by \(\bar{\rho }_A(0){:=}0\) and
for \(r>0\). The value \(\bar{\rho }_A(u)\) is the density of \(A\) in the biggest open ball around \(o\) of at most measure \(u\). Notice that \(\bar{\rho }_A\) is independent of the choice of \(\nu \) as a result of Proposition 4.2. The function \(\bar{\rho }_A\) does depend on the choice of \(o\), but in Proposition 4.3 we show that the asymptotic behavior of \(\bar{\rho }_A\) does not depend on the choice of \(o\). We also show in Proposition 4.3 that the asymptotic behavior of \(\bar{\rho }_A\) is not affected if we replace \(r^-(u)\) by \(r^+(u)\) in (4.5).
Proposition 4.3
Fix \(x,y \in X\) and \(A \in \mathcal {L}(X)\). Then
Remark 4.4
Proposition 4.3 is not necessarily true if we do not assume (4.3), as illustrated by the following example. Suppose \(X\) is the set of vertices of a 3-regular tree graph and \(d\) is the graph distance. Let \(\nu \) be the counting measure, which is a uniform measure on this metric space. Then clearly (4.3) is not satisfied. Now pick any \(x \in X\) and let \(y\) be a neighbor of \(x\). Let \(A \subseteq {\mathcal {P}}(X)\) be the connected component containing \(y\) in the graph where the edge between \(x\) and \(y\) is removed. Then
Proposition 4.3 justifies the use of \(\bar{\rho }\) to determine the density, since its asymptotic behavior is canonical. So, we define for \(A \in \mathcal {L}(X)\) the map \(\bar{\xi }_A: (1,\infty )^2 \rightarrow [0,1]\) given by
Then we set
and \(\alpha ^X: {\mathcal {A}}^\mathrm{uni}(X) \rightarrow [0,1]\) by
The pair \(({\mathcal {A}}^\mathrm{uni}(X),\alpha ^X)\) gives us the analogue of \(({\mathcal {A}}^\mathrm{uni},\alpha )\) in \(X\). In particular, it gives for \(X={\mathbb {N}}\) the corresponding uniform probability measure on \({\mathbb {N}}\) we initially searched for. In case of Euclidean space, we have the following expression for \(({\mathcal {A}}^\mathrm{uni}(X),\alpha ^X)\), which in the special case of \(X={\mathbb {R}}\) gives us an extension of \(\alpha \) (\({\mathcal {A}}^\mathrm{uni}({\mathbb {R}})\) is the maximal sub-\(f\)-system of \(\mathcal {L}({\mathbb {R}})\), where \({\mathcal {A}}^\mathrm{uni} \subseteq {\mathcal {A}}^\mathrm{uni}({\mathbb {R}})\) is the maximal sub-\(f\)-system of \({\mathcal {M}}\)).
Proposition 4.5
Suppose \(X={\mathbb {R}}^n\) and \(d\) is Euclidean distance. Let \(\sigma \) be the surface measure on the unit sphere in \({\mathbb {R}}^n\). Then for \(A \in \mathcal {L}({\mathbb {R}}^n)\) we can replace \(\bar{\xi }_A(D,x)\) in (4.9) and (4.10) by
where \(K_A:[0,\infty ) \rightarrow [0,1]\) is given by
5 Proofs
First we show that every \(f\)-system of a WTP is closed under translation and that every probability measure of a WTP is invariant under translation.
Lemma 5.1
Let \(({\mathcal {F}},\mu )\) be a WTP. Let \(A \in {\mathcal {F}}\) and \(c \in [0,\infty )\). Then
and \(\mu (A)=\mu (A')\).
Proof
Let \(({\mathcal {F}},\mu )\) be a WTP. Let \(A \in {\mathcal {F}}\) and \(c \in [0,\infty )\). Set \(B{:=}[c,\infty )\). We have \(B \in \mathcal {C} \subseteq {\mathcal {F}}\) and by P3 we have \(\mu (B)=1\). Therefore, \(A' = B \circ A \in {\mathcal {F}}\) and
by P1. \(\square \)
Proof of Propositon 2.3
Let \(({\mathcal {F}},\mu )\) be a WTP and \(A \in {\mathcal {F}}\). Set \(u {:=} \limsup _{x \rightarrow \infty } \rho _A(x)\). If \(u=1\) there is nothing to prove, so assume \(u<1\). Let \(\epsilon >0\) be given. Let \(u' \in [0,1] \cap \mathbb {Q}\) such that \(u'>u\) and \(u'-u<\epsilon \). The idea is to construct a \(Y \in {\mathcal {M}}\) such that we can easily see that \(\mu (Y)=u'\) and \(\rho _A(x) \le \rho _Y(x)\) for all \(x\), so that with P2 we get \(\mu (A) \le u'\).
First we observe that there is a \(K>0\) such that for all \(x\ge K\) we have \(\rho _A(x) \le u'\). We can write \(u'\) as \(u' = \frac{p}{q}\) for some \(p,q \in {\mathbb {N}}_0\) with \(p \le q\). Now we introduce the set \(Y\) given by
Note that \(Y \in \mathcal {C} \subseteq {\mathcal {F}}\). Lemma 5.1 and the fact that \(\mu \) is a probability measure, gives us that \(\mu (Y)=u'\). Further, observe that for each \(x \in [0,\infty )\) we have \(\rho _A(x) \le \rho _Y(x)\), so with P2 we get
Letting \(\epsilon \downarrow 0\) we find
By applying this to \(A^c\) we find
\(\square \)
Before we prove Proposition 3.2 and Theorem 3.3, we present the following alternative representation of \(({\mathcal {A}}^\mathrm{uni},\alpha )\). We define for \(A \in {\mathcal {M}}\) the map \(\xi _A: (1,\infty )^2 \rightarrow [0,1]\) given by
Set
If \(s \in \mathcal {S}\) and \(f \in (1,\infty )^{\mathbb {N}}\), then we can interpret the pair \((s,f)\) as the sequence \((s_1,f_1),(s_2,f_2),\ldots \) in \((1,\infty )^2\). Write
for the collection of all such sequences.
For every \((s,f) \in {\mathcal {P}}\) we set
and
Lemma 5.2
(Alternate representation) We have
with for any \((s,f) \in {\mathcal {P}}\) and \(A \in {\mathcal {A}}^\mathrm{uni}\)
Proof
Let \(A \in {\mathcal {M}}\). We start to relate \(\sigma _{\log (A)}\) and \(\xi _A\). If \(D,x \in (1,\infty )\), then
This implies that for \((s,f) \in {\mathcal {P}}\) we have
with for \(A \in {\mathcal {A}}^{s,f}\)
Since for any \(A \in {\mathcal {M}}\) and \((s,f) \in {\mathcal {P}}\)
and
we find that if \(\log (A) \in \mathcal {W}^\mathrm{uni}\), then \(A \in {\mathcal {A}}^{s,f}\) with \(\alpha ^{s,f}(A)=\alpha (A)\).
The only thing left to show is that
So assume \(A \in \bigcap _{(s,f) \in {\mathcal {P}}} {\mathcal {A}}^{s,f}\). Suppose we have \((s,f) \in {\mathcal {P}}\) such that \(\alpha ^{s,f}(A)=L(\log (A))\) and \((s',f') \in {\mathcal {P}}\) such that \(\alpha ^{s',f'}(A)=U(\log (A))\). Then we can create a new sequence given by
Because by assumption \(A \in {\mathcal {A}}^{s'',f''}\), we then have \(\alpha ^{s,f}(A)=\alpha ^{s',f'}(A)\). Hence \(A \in {\mathcal {A}}^\mathrm{uni}\). So it is sufficient to show that we can choose \((s,f)\) and \((s',f')\) in the desired way.
Choose \(s \in \mathcal {S}\) such that
Choose \(f \in (1,\infty )^{{\mathbb {N}}}\) such that
for every \(n \in {\mathbb {N}}\). Then \((s,f) \in {\mathcal {P}}\) with
In the same way choose \((s',f') \in {\mathcal {P}}\) such that
\(\square \)
Proof of Proposition 3.2
Let \(A \in \mathcal {C}\) and \((s,f) \in {\mathcal {P}}\). Since \(\rho _A(y) \rightarrow \lambda (A)\), we have \(\xi _A(s_n,f_n) \sim \lambda (A)\), so \(\alpha ^{s,f}(A)=\lambda (A)\). The result now follows by Lemma 5.2. \(\square \)
Proof of Theorem 3.3
Notice that any intersection of \(f\)-systems closed under weak thinning is again closed under weak thinning. Therefore, if we show that \(({\mathcal {A}}^{s,f},\alpha ^{s,f})\) is a WTP for every \((s,f)\in {\mathcal {P}}\), it follows from Lemma 5.2 that \(({\mathcal {A}}^\mathrm{uni},\alpha )\) is a WTP.
Let \((s,f) \in {\mathcal {P}}\). It immediately follows that \(({\mathcal {A}}^{s,f},\alpha ^{s,f})\) is a probability pair and that P2 and P3 hold, so we have to verify P1. Note that for every \(A,B \in {\mathcal {M}}\) and \(x>0\) we have
Let \(A \in \mathcal {C}\) and \(B \in {\mathcal {A}}^{s,f}\). Then
If \(\lambda (A)=0\) it is clear that \(A \circ B \in {\mathcal {A}}^{s,f}\) with \(\alpha ^{s,f}(A \circ B) = 0 = \lambda (A)\alpha ^{s,f}(B)\). If \(\lambda (A)>0\), then we see that
Since
we have
Thus \(A \circ B \in {\mathcal {A}}^{s,f}\) and since \(\lambda (A)=\alpha ^{s,f}(A)\) (see the proof of Propositon 3.2), we have
We have showed that \(({\mathcal {A}}^\mathrm{uni},\alpha ) \in WT\). To show that \(({\mathcal {A}}^\mathrm{uni},\alpha )\) can be extended, let \({\mathcal {U}}\) be any free ultrafilter on \({\mathbb {N}}\) and \((s,f) \in {\mathcal {P}}\). Then consider \(\mu : {\mathcal {M}} \rightarrow [0,1]\) given by
Since the \({\mathcal {U}}\)-limit is multiplicative it follows completely analogous that \(({\mathcal {M}},\mu )\) is a WTP. Hence every \(({\mathcal {A}}^{s,f},\alpha ^{s,f})\) can be extended to a WTP with \({\mathcal {M}}\) as its \(f\)-system. In particular, by Lemma 5.2, this means that \(({\mathcal {A}}^\mathrm{uni},\alpha )\) can be extended to a WTP with \({\mathcal {M}}\) as its \(f\)-system.
From de Finetti [2] it follows that if \(\alpha \) can be extended to a finitely additive probability measure on an algebra, then \(\alpha \) is coherent. Since we have showed that \(\alpha \) can be extended to \({\mathcal {M}}\), which is an algebra, it follows that \(({\mathcal {A}}^\mathrm{uni},\alpha ) \in WTC\). Notice that we showed that \(\alpha ^{s,f}\) can be extended to \({\mathcal {M}}\) for every \((s,f) \in {\mathcal {P}}\), so we also have \(({\mathcal {A}}^{s,f},\alpha ^{s,f}) \in WTC\) for every \((s,f) \in {\mathcal {P}}\). \(\square \)
For our proof of Theorem 3.5, we need an alternate expression for \(U(\log (A))\). For \(A \in {\mathcal {M}}\) set \(\tau _A: (1,\infty ) \times {\mathbb {N}} \rightarrow [0,1]\) given by
Also set for \(C>1\) and \(A \in {\mathcal {M}}\)
Lemma 5.3
For every \(A \in {\mathcal {M}}\) we have
Proof
Let \(A \in {\mathcal {M}}\) and fix \(C>1\).
Step 1 We show that
where
for \(D,x \in (0,\infty )\).
Define
Since
we have
This implies
Step 2 We give an upper and lower bound for
in terms of \(\tau _A(C,j)\).
If we set for \(j \in {\mathbb {N}}\)
then
We now observe that
and
The fact that \(\log (1+y) \le y\) for every \(y\ge 0\), combined with (5.40), (5.41) and (5.42) gives
Step 3 We combine Step 1 and Step 2 to finish the proof.
Observe that
We use (5.43) and (5.44) to find an upper bound for the expression in (5.37), giving us
Analogously, we find that
Combining (5.45) and (5.46) we obtain
which implies
\(\square \)
We also need the following lemma.
Lemma 5.4
Let \(({\mathcal {F}},\mu )\) be a WTP. Then for any \(A \in {\mathcal {F}}\) and \(C>1\)
Proof
Let \(({\mathcal {F}},\mu )\) be a WTP with \(A \in {\mathcal {F}}\). Fix \(C>1\) and write
The idea is to introduce a set \(B \in {\mathcal {M}}\) for which we have \(\limsup _{x \rightarrow \infty } \rho _B(x) \le CS\) and \(\rho _A(x) \le \rho _B(x)\) for all \(x\). Set
By construction of \(B\) we have \(\rho _A(x) \le \rho _B(x)\) for every \(x \in (0,\infty )\). So
By Proposition 2.3 we then find
\(\square \)
We are ready to give the proof of Theorem 3.5.
Proof of Theorem 3.5
Let \(({\mathcal {F}},\mu )\) be a WTP and \(A \in {\mathcal {F}}\). It is sufficient to show that
We give the following example to give an idea of the proof that follows. Set
Note that \(Z_1,Z_2,Z_3 \in \mathcal {C}\) are pairwise disjoint. Now, we set
Observe that for \(j \ge 3\)
So we constructed a set \(A'\) that on each interval \([2^{j-1},2^{j})\) with \(j \ge 3\) has an average that equals the average of the averages of \(A\) on two consecutive intervals. By weak thinnability we find that \(\mu (A')=\frac{1}{2}\mu (A) + \frac{1}{4}\mu (A) + \frac{1}{4}\mu (A) = \mu (A)\). If \(\tau _{A'}(2,j)\) is convergent or only oscillates a little, we can give a good upper bound of \(\mu (A)\) using Lemma 5.4. Applying this strategy not only for \(C=2\) but for any \(C>1\) and averages of not only two but arbitrarily many averages on consecutive intervals, is what happens in the proof.
Step 1 We construct a \(\hat{A} \in {\mathcal {F}}\).
Fix \(C>1\) and \(n \in {\mathbb {N}}\). We split up \([C^{j-1},C^j)\) into intervals of length 1 plus a remainder interval for every \(j\). Set for \(j \in {\mathbb {N}}\)
and for \(j \in {\mathbb {N}}\) and \(l \in \{1,\ldots ,N_j\}\)
so that for every \(j \in {\mathbb {N}}\) we have
Choose \(u \in {\mathbb {N}}\) such that for every \(j \in {\mathbb {N}}\) we have
which can be done since \(N_j\) is asymptotically equivalent with \(C^{j-1}(C-1)\). For \(p \in \{0,..,n\}\), \(k \in \{1,..,\lfloor C^{p} \rfloor \}\) and \(j \in {\mathbb {N}}\) we set
For \(l \le T\) set
that “evenly” distributes mass \(l\) over the interval \([0,T)\). Note that (5.60) guarantees that
for every \(j \in {\mathbb {N}}\), so
is well defined. Note that by construction \(Z(p,k) \in \mathcal {C}\) and
From this it directly follows that
We now introduce
Observe that all the \(Z(p,k)\) are disjoint. So P1 and the fact that \({\mathcal {F}}\) is an \(f\)-system imply that \(\hat{A} \in {\mathcal {F}}\).
Step 2 We give an upperbound for \(\mu (A)\) by first giving an upperbound for \(\mu (\hat{A})\) and then relating \(\mu (A)\) and \(\mu (\hat{A})\).
A crucial property of \(\hat{A}\) is that for \(j \in {\mathbb {N}}\)
Hence
We apply Lemma 5.4 for \(\hat{A}\) and find with (5.69) that
The weak thinnability of \(\mu \) gives that
Combining (5.70) and (5.71) gives
Step 3 We take limits in (5.72).
Unfix \(n\) and \(C\). We first take the limit superior for \(n \rightarrow \infty \) in (5.72), giving
Then we take the limit superior for \(C \downarrow 1\) and find by Lemma 5.3 that
The lower bound we can now easily obtain by applying our upper bound for the complement of \(A\). Doing this, we see that
giving that \(\mu (A) \ge L(\log (A))\). \(\square \)
Proof of Theorem 3.6
We prove the contrapositive. Let \(({\mathcal {F}},\mu )\) be a WTP with \({\mathcal {F}} \setminus {\mathcal {A}}^\mathrm{uni} \not =\emptyset \). Let \(A \in {\mathcal {F}} \setminus {\mathcal {A}}^\mathrm{uni}\). By Lemma 5.2, this means that there is a \((s,f) \in {\mathcal {P}}\) such that
Clearly, we can find \(m,l \in {\mathbb {N}}^\infty \) such that \(\xi _A(s_{m_n},f_{m_n})\) tends to \(I\) and \(\xi _A(s_{l_n},f_{l_n})\) tends to \(S\). Now set \(s'_n {:=} s_{m_n}\), \(f'_n {:=} f_{m_n}\), \(s''_n {:=} s_{l_n}\) and \(f''_n {:=} f_{l_n}\). Then we see that \(A \in {\mathcal {A}}^{s',f'}\) and \(A \in {\mathcal {A}}^{s'',f''}\) with
In the proof of Theorem 3.3 we showed that \(({\mathcal {A}}^{s',f'},\alpha ^{s',f'})\) and \(({\mathcal {A}}^{s'',f''},\alpha ^{s'',f''})\) are both in WTC. Thus \(({\mathcal {F}},\mu )\) is not canonical with respect to \(WT\) and in case \(\mu \) is coherent, \(({\mathcal {F}},\mu )\) is not canonical with respect to \(WTC\).
\(\square \)
Proof of Proposition 4.2
We give a proof along the lines of Mattila [6], p. 45], with small adaptations for completeness and more generality.
Let \((X,d)\) be a metric space and \(\nu _1,\nu _2\) uniform measures on \(X\). Write \(h_1(r){:=} \nu _1(B(x,r))\) and \(h_2(r){:=} \nu _2(B(x,r))\) for \(r>0\), which are well defined since \(\nu _1\) and \(\nu _2\) are uniform. We show that \(\nu _1=c\nu _2\) for some \(c>0\). It is sufficient to show that \(\nu _1=c\nu _2\) on all open sets.
First let \(A\) be an open set of \((X,d)\) with \(\nu _1(A)<\infty \) and \(\nu _2(A)<\infty \). Suppose that \(r>0\) is such that \(h_2\) is continuous in \(r\). Then
Hence \(x \mapsto \nu _2(A \cap B(x,r))\) is a continuous mapping from \(X\) to \([0,\infty )\). Since \(h_2\) is nondecreasing, it can have at most countable many discontinuities. So we can choose \(r_1,r_2,r_3,\ldots \) such that \(\lim _{n \rightarrow \infty } r_n = 0\) and \(h_2\) is continuous in every \(r_n\).
For \(n \in {\mathbb {N}}\) let \(f_n: X \rightarrow [0,1]\) be given by
Notice that by our previous observation \(f_n\) is continuous on \(A\), hence \(f_n\) is measurable. Because \(A\) is open, we have \(\lim _{n \rightarrow \infty } f_n(x) = 1\) for every \(x \in A\). With Fatou’s Lemma we find
Note that any uniform measure is \(\sigma \)-finite. Applying Fubini’s theorem we obtain
By interchanging \(\nu _1\) and \(\nu _2\) we get
Note that \(\liminf _{n \rightarrow \infty } \frac{h_2(r_n)}{h_1(r_n)}>0\) since (5.80) would otherwise imply that all open balls are null sets. So we may rewrite (5.80) as
Hence \(v_1(A)=cv_2(A)\) with
Now let \(A\) be any open set of \((X,d)\). Let \(x \in X\) and set \(A_n {:=} A \cap B(x,n)\) for \(n \in {\mathbb {N}}\). Note that \(A_n\) is open with \(\nu _1(A_n)\le \nu _1(B(x,n))<\infty \) and \(\nu _2(A_n) \le \nu _2(B(x,n))<\infty \). Hence, by the first part of the proof, we find \(\nu _1(A_n)=c\nu _2(A_n)\). But then
\(\square \)
Proof of Proposition 4.3
Fix \(A \in \mathcal {L}(X)\) and \(x,y \in X\). By (4.3) we have
Hence
Observe that for any \(r \in [0,\infty )\) we have
By (4.3), it follows that
Combining (5.85) and (5.87) gives the desired result. \(\square \)
Proof of Proposition 4.5
Suppose \(X={\mathbb {R}}^n\) with \(d\) Euclidean distance. Set
Let \(\nu \) be the Borel measure on \({\mathbb {R}}^n\). Note that \(h(r) = n^{-1} \delta _n r^n\). If we set \(u=\root n \of {n\delta _n^{-1}y}\), then
Now observe that by partial integration
If we set for \(D,x \in (1,\infty )\)
then
Since \(|\zeta _A(D,x)| \le \frac{1}{\log (D)}\), the desired result follows. \(\square \)
6 Discussion
6.1 Algebra Versus \(f\)-System
The natural analogue of an \(\sigma \)-algebra in finite additive probability theory is an algebra. It has been remarked [9, 13] that the restriction of \({\mathcal {M}}\) to \(\mathcal {C}\) is problematic since \(\mathcal {C}\) is not an algebra. However, any collection extending \(\mathcal {C}\) that is not \({\mathcal {M}}\) itself, is not an algebra since \(a(\mathcal {C})={\mathcal {M}}\). This can be seen as follows. Let \(A \in {\mathcal {M}}\) and set
Then \(M_1,M_2,X,Y \in \mathcal {C}\) with \(\lambda (M_1)=\lambda (M_2)=\lambda (X)=\lambda (Y)=1/2\) and \(A = (M_1 \cap X) \cup (M_2 \cap Y)\). Hence \(A \in a(\mathcal {C})\) and since \(A \in {\mathcal {M}}\) was arbitrary, we have \(a(\mathcal {C})={\mathcal {M}}\).
This observation brings us to the conclusion that the requirement of an algebra, despite the fact that an algebra is the natural analogue of an \(\sigma \)-algebra, is too restrictive. Furthermore, finite additivity only dictates how a probability measure behaves when taking disjoint unions, and thus only suggests closedness under disjoint unions. Coherence is a concern since, as remarked before, it is not guaranteed on \(f\)-systems, whereas it is guaranteed on algebras. Coherence, however, can also be achieved on \(f\)-systems, as \(\alpha \) does, and therefore, coherence not being guaranteed is in itself not an argument against \(f\)-systems. Therefore, we think the requirement of an \(f\)-system rather than an algebra in Definition 1.1 is justified.
It should be noted that even if one prefers \({\mathcal {M}}\) as domain, by Theorem 3.3 \(({\mathcal {A}}^\mathrm{uni},\alpha )\) can be extended to a WTP with \({\mathcal {M}}\) as \(f\)-system. Such a pair is not canonical with respect to \(WT\) or \(WTC\) (Theorem 3.6), but still has \({\mathcal {A}}^\mathrm{uni}\) included as an \(f\)-system within the domain on which probability is uniquely determined.
6.2 Thinnability
Suppose that in Definition 2.2 we replace P1 by the property that for every \(A,B \in {\mathcal {F}}\) we have \(A \circ B \in {\mathcal {F}}\) and \(\mu (A \circ B)=\mu (A)\mu (B)\). Instead of weak thinnability, we call this thinnability. Now consider the set
We have \(A,A^c \in {\mathcal {A}}^\mathrm{uni}\) with \(\alpha (A)=\alpha (A^c)=1/2\). But also, we have \(A\circ A^c \in {\mathcal {A}}^\mathrm{uni}\) with
So \(({\mathcal {A}}^\mathrm{uni},\alpha )\) is not a thinnable pair. Since every thinnable pair is also a WTP, by Theorem 3.5 we see that a thinnable probability measure on \({\mathcal {A}}^\mathrm{uni}\), does not exist.
Notice that we are not necessarily looking for the strongest notion of uniformity, but for a notion that allows for a canonical probability pair with a “big” \(f\)-system. This is the reason why we are interested in weak thinnability rather than thinnability. There may, of course, be other notions of uniformity that lead to canonical pairs with bigger \(f\)-systems than \({\mathcal {A}}^\mathrm{uni}\). At this point, we cannot see any convincing motivation for such notions.
6.3 Weak Thinnability
In this paper, we only studied the notion of weak thinnability from the interest in canonical probability pairs. There are, however, interesting open questions about the property of weak thinnability itself that we did not address in this paper. Some examples are:
-
Is every probability pair that extends \(({\mathcal {A}}^\mathrm{uni},\alpha )\) a WTP?
-
Is every WTP coherent?
-
Can every WTP be extended to a WTP with \({\mathcal {M}}\) as \(f\)-system?
-
How do the sets \(\{ \mu (A) \;:\; ({\mathcal {F}},\mu ) \in WT \;\mathrm{and}\; A \in {\mathcal {F}} \}\) and \(\{ \mu (A) \;:\; ({\mathcal {F}},\mu ) \in WTC \;\mathrm{and}\; A \in {\mathcal {F}} \}\) look like for \(A \not \in {\mathcal {A}}^\mathrm{uni}\)?
-
Is P2 redundant? If no, what probability pairs are not a WTP, but do satisfy P1 and P3?
-
How does weak thinnability relate to the property \(\mu (cA)=\mu (A)\), where \(cA{:=} \{ ca:\,\; a \in A\}\) and \(c>1\)?
6.4 Size of \({\mathcal {A}}^\mathrm{uni}\)
A typical example of a set in \({\mathcal {M}}\) that does not have natural density, but is assigned a probability by \(\alpha \), is
for which we have \(\alpha (A)=1/2\). It is, however, unclear how “many” of such sets there are, i.e., how much “bigger” the \(f\)-system \({\mathcal {A}}^\mathrm{uni}\) is than \(\mathcal {C}\) and how much “smaller” it is than \({\mathcal {M}}\). If we could construct a uniform probability measure on \({\mathcal {M}}\) by the method of Sect. 4, we could determine the probability of \({\mathcal {A}}^\mathrm{uni}\) if \({\mathcal {A}}^\mathrm{uni} \in {\mathcal {A}}^\mathrm{uni}({\mathcal {M}}) \). To construct such a probability measure, we need to equip \({\mathcal {M}}\) with a metric \(d\) such that \(({\mathcal {M}},d)\) has a uniform measure. It is, however, not at all clear how we should choose \(d\). So at this point, it is not clear if there is a useful way of measuring the collections \(\mathcal {C}\) and \({\mathcal {A}}^\mathrm{uni}\).
References
Christensen, J.P.R.: On some measures analogous to Haar measure. Math. Scand. 26, 103–106 (1970)
de Finetti, B.: Sul significato soggettivo della probabilita. Fundam. Math. 17, 298–329 (1931)
de Finetti, B.: Theory of Probability, vols. 1 & 2 (A. Machi & A. Smith, Trans.). Wiley, New York (1974)
Halpern, J.D.: The independence of the axiom of choice from the Boolean prime ideal theorem. Fundam. Math. 55, 57–66 (1964)
Kolmogorov, A.N.: Grundbegriffe der Wahrscheinlichkeitsrechnung. Springer, Berlin (1933)
Mattila, P.: Geometry of Sets and Measures in Euclidean Spaces: Fractals and Rectifiability. Cambridge University Press, Cambridge (1995)
Pier, J.-P.: Amenable Locally Compact Groups. Wiley, New York (1984)
Schirokauer, O., Kadane, J.: Uniform distributions on the natural numbers. J. Theor. Probab. 20, 429–441 (2007)
Schurz, G., Leitgeb, H.: Finitistic and frequentistic approximation of probability measures with or without \(\sigma \)-additivity. Stud. Log. 89(2), 257–283 (2008)
Solovay, R.M.: A model of set theory in which every set of reals is Lebesgue measurable. Ann. Math. 92, 1–56 (1970)
Tenenbaum, G.: Introduction to Analytic and Probabilistic Number Theory. Cambridge University Press, Cambridge (1995)
van Douwen, E.K.: Finitely additive measures on \({\mathbb{N}}\). Topol. Appl. 47(3), 223–268 (1992)
Wenmackers, S., Horsten, L.: Fair infinite lotteries. Synthese 190(1), 37–61 (2013)
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.
About this article
Cite this article
Kerkvliet, T., Meester, R. Uniquely Determined Uniform Probability on the Natural Numbers. J Theor Probab 29, 797–825 (2016). https://doi.org/10.1007/s10959-015-0611-2
Received:
Revised:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10959-015-0611-2