Cluster Expansions: Necessary and Sufficient Convergence Conditions

Jansen, Sabine; Kolesnikov, Leonid

doi:10.1007/s10955-022-02992-6

Cluster Expansions: Necessary and Sufficient Convergence Conditions

Open access
Published: 24 September 2022

Volume 189, article number 33, (2022)
Cite this article

Download PDF

You have full access to this open access article

Journal of Statistical Physics Aims and scope Submit manuscript

Cluster Expansions: Necessary and Sufficient Convergence Conditions

Download PDF

2043 Accesses
1 Citation
2 Altmetric
Explore all metrics

Abstract

We prove a new convergence condition for the activity expansion of correlation functions in equilibrium statistical mechanics with possibly negative pair potentials. For non-negative pair potentials, the criterion is an if and only if condition. The condition is formulated with a sign-flipped Kirkwood–Salsburg operator and known conditions such as Kotecký–Preiss and Fernández–Procacci are easily recovered. In addition, we deduce new sufficient convergence conditions for hard-core systems in $\mathbb {R}^d$ and $\mathbb {Z}^d$ as well as for abstract polymer systems. The latter improves on the Fernández–Procacci criterion.

Local Moderate and Precise Large Deviations via Cluster Expansions

Article 20 March 2021

Identities for Correlation Functions in Classical Statistical Mechanics and the Problem of Crystal States

Article 03 June 2020

Convergence of Density Expansions of Correlation Functions and the Ornstein–Zernike Equation

Article Open access 24 February 2018

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

Since its introduction by Mayer in the early 40s, the method of cluster expansions was—and remains—a very important tool in equilibrium statistical mechanics. A classical application yields the analyticity of the logarithm of the partition function for a physical system at equilibrium by deriving a Taylor expansion in the activity or density parameter around zero. Such results can be quite useful, for example, in the study of phase transitions or the decay of correlations, for a vast class of models.

In 1971, Gruber and Kunz introduced in their seminal paper [12] systems of non-overlapping geometric objects—referred to as polymers—given by subsets of a lattice. They presented a rigorous mathematical formalism in order to provide convergent cluster expansions for this model. Instead of the logarithm of the partition function, they considered the correlation functions of the system and derived convergent activity expansions by using a system of integral equations, the so-called Kirkwood–Salsburg equations, and solving the corresponding fixed point equation on a suitable Banach space. However, in the following years less analytical appoaches were favoured by researchers: Combinatorial proofs such as in [3], relying on tree-graph identities [23], and inductive proofs following the idea by Kotecký and Preiss [18] and its development in [4] by Dobrushin. The inductive method was presented in the more general setup of abstract polymers (where the underlying space is not necessarily a lattice, nor are the polymers necessarily given by geometric objects). Notice that abstract polymer models are universal in the sense that a large class of classical models can be represented as polymer models due to the combinatorial structure of the corresponding partition functions (see, e.g., [11] for an application to the Ising model). Moreover, an interesting connection with probability theory was pointed out by Scott and Sokal in [29]: Convergence of cluster expansions in abstract polymer models is related to the Lovász Local Lemma—better sufficient conditions can provide refinements of the latter (see, e.g., [2]).

In 2008, Fernández and Procacci proved a new sufficient criterion in the setup of abstract polymers improving on the result by Kotecký and Preiss. The initial proof [8] relies on combinatorial arguments, an alternative proof via an induction à la Dobrushin [10] appeared recently (finally, in this paper we provide an analytical proof in the spirit of Gruber–Kunz).

Overall, in the last two decades, a notable effort was made to generalize classical sufficient conditions in the abstract polymer setup (including the condition by Fernández and Procacci) to hold in continuous spaces and for systems with soft-core (or even more general) interactions, see [5, 14, 22, 24, 32].

We want to go further by employing a Kirkwood–Salsburg approach in the rather general setup of Gibbs point processes (or, in terms of statistical mechanics, grand-canonical Gibbs measures) defined via pairwise interactions. It is well-known that—under mild additional moment conditions, which are automatically satisfied for non-negative pair potentials—there is a one-to-one correspondence between the set of those Gibbs measures and the associated families of correlation functions (also known as factorial moment densities). In the special case of a discrete space and hard-core interactions, the value of the n-point correlation function is given simply by the probability to see n particles at the prescribed positions in the random configuration of particles. The correlation functions can be expanded as power series in the activity parameter z, i.e., in the intensity of the underlying Poisson process. We denote the Taylor expansion for the n-point correlation function in z around zero by $\rho _n$ and write $\varvec{\rho }$ for the family of those expansions. In general, the series $\varvec{\rho }$ need not to be convergent at all; we are, however, interested in conditions which ensure pointwise convergence (towards the correlation functions). Furthermore, we want to consider the more general case where the underlying Poisson point process is inhomogeneous, i.e., where a different intensity value may be assigned to every point in the space, the activity z is a function and the expansions $\varvec{\rho }$ are multivariate power series in z. For a rigorous introduction of Gibbs point processes and corresponding correlation functions, see [14], but notice that here we do not assume the interaction potential to be non-negative (unless explicitly stated).

The starting point of the paper and the central quantity to investigate are the activity expansions $\varvec{\rho }$ which we consider independently of their interpretation in term of the correlation functions. Let us outline the main ideas present in the paper. The coefficients of the multivariate power series $\varvec{\rho }$ are defined in terms of a certain family of rooted graphs to which we refer as multi-rooted graphs (see [14, 30]). Using the terminology from [6], the activity expansions $\varvec{\rho }$ are given by the exponential generating functions of the coloured weighted combinatorial species of multi-rooted graphs with a fixed set of roots. The set of all multi-rooted graphs has an essential structural property—it is invariant under the operation of removal of a root. Taking a multi-rooted graph and removing an arbitrary root (as well as all edges incident to it), one gets again a multi-rooted graph on a smaller vertex set, where every neighbour of the removed root becomes a root vertex itself. The weight of the original graph is equal to the weight of the resulting graph times the weight of the edges removed. The corresponding property of the generating functions is expressed by the Kirkwood–Salsburg equations. Every possible rule for the choice of the root to remove induces a different combinatorial operation and therefore a different system of Kirkwood–Salsburg equations for the generating functions.

In this work we provide a condition for absolute convergence of the activity expansions $\varvec{\rho }$ in terms of the existence of a measurable function solving a system of Kirkwood–Salsburg type inequalities (in the case of repulsive interactions, that condition is also a necessary one). Our main result, Theorem 2.1, is inspired by [1]; it is a slightly modified, strongly generalized version of Claim 1 therein. The goal, however, is not only to obtain abstract conditions which are both necessary and sufficient for convergence of the cluster expansions—but also to demonstrate how these characterizations provide a universal approach to prove model-specific sufficient conditions on different levels of generality, both in discrete and continuous setups with repulsive interactions. A two-lane mechanism arises: On the one hand, for a candidate family of ansatz functions $\varvec{\xi }$ (given, for example, as approximations of $\varvec{\rho }$) one can search for conditions that ensure that these functions $\varvec{\xi }$ satisfy the Kirkwood–Salsburg inequalities; on the other hand, given candidate sufficient conditions, one can construct a suitable family of ansatz functions $\varvec{\xi }$ tailored to satisfy the Kirkwood–Salsburg inequalities under these conditions.

This approach provides a unifying framework for the known conditions, but it also allows to prove stronger results. To emphasize this possibility, we derive a new sufficient condition for absolute convergence of the activity expansions $\varvec{\rho }$ in the setup of abstract polymers. In that general setup, our condition improves on any known condition that we are aware of.

A more detailed outline of the main ideas intoduced above can be found in [16] (for the case of non-negative pairwise interactions and without rigorous proofs).

In the further course of the paper, we investigate two particular hard-core setups as examples—the subset polymers in $\mathbb {Z}^d$ and hard objects in $\mathbb {R}^d$. There, the sets of roots of the multi-rooted graphs correspond to configurations of geometric objects. By breaking the geometric objects into smaller “pieces” to which we refer as snippets, we can identify these configurations with configurations of snippets (e.g., in the case of subset polymers we can identify a configuration of polymers with the disjoint union of monomers covering this configuration). Picking a root of a multi-rooted graph—the combinatorial operation underlying the Kirkwood–Salsburg equations—corresponds to picking a snippet. Different rules to pick a snippet in general give rise to characterizations of absolute convergence in terms of different Kirkwood–Salsburg inequalities. This way the latter can be tailored to a candidate sufficient condition. Thus different sufficient conditions can be derived by playing both with the choice of different systems of Kirkwood–Salsburg inequalities and the choice of different ansatz functions satisfying these inequalities. We illustrate this mechanism by deriving some sufficient conditions for a class of hard-core interaction models, in particular for multi-type systems of hard spheres in $\mathbb {R}^d$.

The paper is organized as follows: In Sect. 2.1 we introduce the basic notation and present the general framework. Furthermore, in Theorem 2.1 we state our main result, a characterization of the domain of absolute convergence for the activity expansions $\varvec{\rho }$, and use it to recreate the classical sufficient conditions by Kotecký and Preiss as well as the sufficient conditions by Fernández and Procacci in a rather general setup (see Corollaries 2.7 and 2.9, respectively). In Sect. 2.2, the same approach is used to prove a new, improved sufficient condition in the setup of abstract polymers (Proposition 2.4). The proof of the proposition relies on an auxiliary result (Lemma 2.5) which is proved in Appendix A. In the Sects. 2.3 and 2.4 we consider the special case of hard-core interactions. Both in the continuum (Sect. 2.3) and in the discrete setup (Sect. 2.4), we provide model-specific characterizations of the convergence domain, stated in Theorems 2.6 and 2.7, respectively. As an immediate consequence of Theorem 2.7 we obtain an elementary proof of the well known Gruber–Kunz condition (Corollary 2.8). In Sect. 3, we present a forest-graph equality and other combinatorial results in order to prove Theorems 2.1, 2.6 and 2.7. Finally, in Sect. 4, Theorems 2.6 and 2.7 are used to obtain practitioner-type sufficient conditions for a class of hard-core interaction models, including new sufficient conditions for subset polymers in $\mathbb {Z}^d$ (Theorem 4.1) and hard objects in $\mathbb {R}^d$ (Theorems 4.2 and 4.4).

The reader interested primarily in the discrete setup of subset polymers is encouraged to jump directly to Subsect. 2.4, its main result being the characterization of the domain of convergence for the activity expansions $\varvec{\rho }$ given by Theorem 2.7 (compare to Theorem 3.13). The main ideas behind the proof of Theorem 2.7 in Subsect. 3.4 and behind the application of Theorem 2.7 in Subsect. 4.1 can be transferred to the continuous setup as well.

2 Main Results

2.1 (Locally) Stable Pair Potentials

Let $(\mathbb X, {\mathcal {X}})$ be a measurable space, $\lambda $ a $\sigma $-finite reference measure, and v a pair potential, i.e., $v: \mathbb X\times \mathbb X\rightarrow \mathbb {R}\cup \{\infty \}$ is measurable and symmetric—in the sense that $v(x,y) = v(y,x)$ for any $x,y\in \mathbb {X}$. Corresponding to the potential v, Mayer’s f function is given by

$$\begin{aligned} f(x,y) = \mathrm {e}^{-v(x,y)}-1. \end{aligned}$$

We call the pair potential v stable if there exists a measurable map $B:\mathbb X\rightarrow \mathbb {R}_+$ such that for any $n\in \mathbb {N}$ and $x_1,\ldots ,x_n\in \mathbb {X}$

$$\begin{aligned} \prod \limits _{1\le i<j\le n}(1+f(x_i,x_j))\le \mathrm {e}^{\sum _{k=1}^nB(x_k)} \end{aligned}$$

(2.1)

holds; we call v locally stable or Penrose stable (due to O. Penrose, see [23]) if there exists a measurable map $C:\mathbb X\rightarrow \mathbb {R}_+$ such that for any $x_0\in \mathbb {X}$, $n\in \mathbb {N}$ and $x_1,\dots ,x_n\in \mathbb {X}$ satisfying $\prod _{1\le i<j\le n}(1+f(x_i,x_j))\ne 0$

$$\begin{aligned} \prod \limits _{i=1}^n(1+f(x_0,x_i))\le \mathrm {e}^{C(x_0)} \end{aligned}$$

(2.2)

holds. Notice that every locally stable potential is stable and that every non-negative potential v is locally stable (with the choice $C\equiv 0$).

An activity function is a measurable map $z:\mathbb X\rightarrow \mathbb {R}$. Physically relevant activities are non-negative but for the purpose of studying the convergence of expansions it can be helpful to admit negative (or complex) activities as well. We define the (signed) measure $\lambda _z$ on ${\mathcal {X}}$ by

$$\begin{aligned} \lambda _z(B):= \int _B z(x) \lambda (\mathrm {d}x), \quad B\in {\mathcal {X}}. \end{aligned}$$

(2.3)

The weight of a graph G with vertex set $[n]= \{1,\ldots ,n\}$ and edge set E(G) is

$$\begin{aligned} w(G;x_1,\ldots ,x_n) := \prod _{\{i,j\}\in E(G)} f(x_i,x_j). \end{aligned}$$

Let ${\mathcal {G}}_n$ be the set of all graphs with vertex set [n], ${\mathcal {C}}_n\subset {\mathcal {G}}_n$ the set of connected graphs and

$$\begin{aligned} \varphi _n^{\mathsf {T}}(x_1,\ldots x_n):= \sum _{G \in {\mathcal {C}}_{n}} w(G;x_1,\ldots ,x_n) \end{aligned}$$

the n-th Ursell function. For $n\in \mathbb {N}$ and $k\in \mathbb {N}_0$, let ${\mathcal {D}}_{n,n+k}\subset {\mathcal {G}}_{n+k}$ be the collection of all graphs G such that every vertex $j\in \{n+1,\ldots , n+k\}$ connects to at least one of the vertices $i\in \{1,\ldots , n\}$. We may view the vertices $\{1,\ldots , n\}$ as roots and call the graphs $G\in {\mathcal {D}}_{n,n+k}$ multi-rooted graphs or, following the footnote 53 in [30], root-connected graphs. Consider the functions

$$\begin{aligned} \psi _{n,n+k}(x_1,\ldots ,x_{n+k}) := \sum _{G \in {\mathcal {D}}_{n,n+k}} w(G;x_1,\ldots ,x_{n+k}). \end{aligned}$$

For $n=1$, the functions coincide with the standard Ursell functions, i.e., $\psi _{1,1+k} = \varphi _{1+k}^{\mathsf {T}}$. We are interested in the associated series

$$\begin{aligned} \rho _n(x_1,\ldots ,x_n;z) :=\sum _{k=0}^\infty \frac{1}{k!} \int _{\mathbb X^k} \psi _{n,n+k}(x_1,\ldots ,x_n,y_1,\ldots ,y_k) z(x_1)\cdots z(x_n) \lambda _z^k(\mathrm {d}\varvec{y}). \end{aligned}$$

The summand for $k=0$ is to be read as $\psi _{n,n}(x_1,\ldots ,x_n) z(x_1)\cdots z(x_n)$. The series $\rho _n$ corresponds to the n-point correlation function of a grand-canonical Gibbs measure [30, Eqs. (4–7)], see also [14]—it is the expansion of the correlation function in the activity z around 0.

We will say that the activity expansions $\varvec{\rho }$ converge absolutely for a non-negative activity function z if

$$\begin{aligned} \sum _{k=0}^\infty \frac{1}{k!} \int _{\mathbb X^k}\vert \psi _{n,n+k}(x_1,\ldots ,x_n,y_1,\ldots ,y_k)\vert z(x_1)\cdots z(x_n) \lambda _z^k(\mathrm {d}\varvec{y})<\infty \end{aligned}$$

for all $n\in \mathbb {N}$ and $(x_1,\ldots ,x_n)\in \mathbb {X}^n$.

Our main concern is to derive necessary and sufficient convergence conditions, but sometimes it is useful to view the series as purely formal; relevant background on formal power series whose variable is a measure (here $\lambda _z(\mathrm {d}x)$) is given in [17, Appendix A].

Next we introduce sign-flipped Kirkwood–Salsburg operators. A selection rule $s(\cdot )$ is a map from $P(\mathbb X):= \sqcup _{n=1}^\infty \mathbb X^n$ to $\mathbb {N}$ such that $s(x_1,\ldots ,x_n) \in \{1,\ldots , n\}$ for all $(x_1,\ldots , x_n) \in P(\mathbb X)$. To lighten notation we write $x_s$ rather than $x_{s(x_1,\ldots ,x_n)}$. Further let $(x'_2,\ldots , x'_{n})$ be the vector obtained from $(x_1,\ldots ,x_n)$ by deleting the entry $x_s$, leaving the order otherwise unchanged. For the simplest selection rule that picks the first entry $s =1$, we have $x'_i = x_i$. The sign-flipped Kirkwood–Salsburg operator ${\tilde{K}}_z^s$ with selection rule $s(\cdot )$ acts on families $\varvec{\xi }= (\xi _n)_{n\in \mathbb {N}}$ of measurable symmetric functions $\xi _n:\mathbb X^n\rightarrow \mathbb {R}_+$ as

$$\begin{aligned} ({\tilde{K}}_z^s \xi )_n(x_1,\ldots ,x_n):= & {} z(x_s)\, \prod \limits _{i=2}^n(1+f(x_s,x'_i)) \Big ( \mathbb {1}_{\{n\ge 2\}}\xi _{n-1}(x'_2,\ldots , x'_{n}) \nonumber \\&+ \sum _{k=1}^\infty \frac{1}{k!} \int _{\mathbb X^k} \prod _{j=1}^k \bigl | f(x_s, y_j)\bigr |\, \xi _{n-1+k} (x'_2,\ldots , x'_{n}, y_1,\ldots , y_k) \lambda ^k(\mathrm {d}\varvec{y})\Big ),\nonumber \\ \end{aligned}$$

(2.4)

for all $n\in \mathbb {N}$ and $(x_1,\ldots ,x_n)\in \mathbb X^n$. Here we allow the functions $({\tilde{K}}_z^s \xi )_n$ to assume the value “$\infty $”. For non-negative potentials and on a suitably reduced domain, ${\tilde{K}}_z^s$ differs from the standard Kirkwood–Salsburg operator [26, Chapter 4.2] by a mere sign-flip: it has $|f(x_s,y_i)|$ instead of $f(x_s,y_i)$.

Theorem 2.1

Let $z(\cdot )$ be a non-negative activity and $s(\cdot )$ any selection rule. Consider the following two conditions:

(i)
There is a family $\varvec{\xi }= (\xi _n)_{n\in \mathbb {N}}$ of measurable symmetric functions $\xi _n:\mathbb X^n\rightarrow \mathbb {R}_+$ such that
$$\begin{aligned} \begin{aligned} z(x_1) \delta _{n,1} + ({\tilde{K}}_z^s \varvec{\xi })_n\, (x_1,\ldots ,x_n) \le \xi _n(x_1,\ldots ,x_n) \end{aligned} \end{aligned}$$
(2.5)
for all $n\in \mathbb {N}$ and $(x_1,\ldots ,x_{n})\in \mathbb X^{n}$.
(ii)
The series $\rho _n(x_1,\ldots , x_n;z)$ converges absolutely, for all $n\in \mathbb {N}$ and $(x_1,\ldots , x_n)\in \mathbb X^n$.

Condition (i) is sufficient for (ii) to hold; moreover, if (i) is satisfied, then

$$\begin{aligned} \sum _{k=0}^\infty \frac{1}{k!} \int _{\mathbb X^k} \bigl | \psi _{n,n+k}(x_1,\ldots ,x_n,y_1,\ldots ,y_k) \bigr | z(x_1)\cdots z(x_n) \lambda _z^k(\mathrm {d}\varvec{y}) \le \xi _n(x_1,\ldots , x_n)\nonumber \\ \end{aligned}$$

(2.6)

on $\mathbb X^n$, for all $n\in \mathbb {N}$.

In addition, if we assume the pair potential to be non-negative, then (ii) implies (i) as well, so that the two conditions are equivalent in this case.

Remark 2.1

We formulate this theorem—as well as the following results—for non-negative activities, mainly for the purpose of notational convenience. Naturally, such conditions for absolute convergence can be formulated in the usual framework of complex analysis by exchanging complex activities z with $\vert z \vert $ in the convergence criteria.

We prove the theorem in Subsect. 3.2. The known sufficient convergence conditions of Kotecký–Preiss and Fernández–Procacci types are easily recovered from Theorem 2.1. We start with the Kotecký–Preiss type criterion [18], as extended to soft-core and continuum systems by Ueltschi in [32] (and to stable interactions by Ueltschi and Poghosyan in [24]).

Corollary 2.2

Let z be a non-negative activity function and assume stable interactions in the sense of (2.1) for some $B\ge 0$. If there exists a measurable function $a: \mathbb {X}\rightarrow \mathbb {R}_+$ such that for all $x\in \mathbb {X}$

$$\begin{aligned} \int _\mathbb X \bigl |f(x,y)\bigr | \mathrm {e}^{a(y)} \lambda _z(\mathrm {d}y)+2B(x) \le a(x), \end{aligned}$$

(2.7)

then the activity expansions $\rho _n(x_1,\ldots ,x_n;z)$ converge absolutely and the bounds

$$\begin{aligned} \rho _n(x_1,\ldots ,x_n;z)\le z(x_1)\cdots z(x_n) \mathrm {e}^{a(x_1)+\cdots + a(x_n)} \end{aligned}$$

hold for all $n\in \mathbb {N}$ and $(x_1,\ldots , x_n)\in \mathbb X^n$. Notice that for non-negative pair interactions, we can choose $B\equiv 0$ in condition (2.7).

Remark 2.2

Notice that via the substitution $\hat{a}=a-2B$ the above criterion is equivalent to the existence of a measurable function ${\hat{a}}: \mathbb {X}\rightarrow \mathbb {R}_+$ such that for all $x\in \mathbb X$

$$\begin{aligned} \int _\mathbb X \bigl |f(x,y)\bigr | \mathrm {e}^{{\hat{a}}(y)+2B(y)} \lambda _z(\mathrm {d}y) \le {\hat{a}}(x). \end{aligned}$$

Proof

Assume that (2.7) holds and define $\varvec{\xi }=(\xi _n)_{n\in \mathbb {N}},\ \xi _n:\mathbb {X}^n\rightarrow [0, \infty )$, by

$$\begin{aligned} \xi _n(x_1,...,x_n):=z(x_1)\cdots z(x_n) \mathrm {e}^{a(x_1)+\cdots + a(x_n)} \end{aligned}$$

for some $a(\cdot )$ satisfying (2.7). The interactions fulfill the stability condition (2.1), therefore for every $n\in \mathbb {N}$ and $x_1,...,x_n\in \mathbb {X}$ there exists an index $j\in \{1,\ldots ,n\}$ such that the bound

$$\begin{aligned} \prod \limits _{ 1\le i\le n,i\ne j} (1+f(x_j,x_i))\le \mathrm {e}^{2B(x_j)} \end{aligned}$$

(2.8)

holds. Choose the selection rule s that always picks an element $x_j$ satisfying (2.8) from $(x_1,\ldots ,x_n)$. Plugging our choice of $\varvec{\xi }$ into the left-hand side of Eq. (2.5) and bounding the interaction term as $\prod _{i=2}^n(1+f(x_s,x'_i))\le \mathrm {e}^{2B(x_s)}$, we recognize an exponential series, and find altogether that the left-hand side of (2.5) is bounded by

$$\begin{aligned} z(x_s)z(x'_2)\cdots z(x'_n)\, \mathrm {e}^{a(x'_2)+\cdots + a(x'_n)} \exp \Bigl ( \int _\mathbb X |f(x_s,y)| \mathrm {e}^{a(y)} \lambda _z(\mathrm {d}y)+2B(x_s) \Bigr ). \end{aligned}$$

By condition (2.7), this is in turn bounded by $\xi _{n}(x_1,\ldots , x_n)$. It follows that condition (i) of Theorem 2.1 is satisfied. $\square $

Analogously, one shows that the criterion by Fernández and Procacci [8], extended to soft-core and continuum systems by Faris in [5] and by Jansen in [14], is sufficient for absolute convergence of the activity expansions $\varvec{\rho }$. We prove the result in the slightly more general setup of locally stable interactions.

Corollary 2.3

Let z be a non-negative activity function and assume locally stable interactions in the sense of (2.2) for some $C\ge 0$. If there exists a measurable function $\mu : \mathbb {X}\rightarrow [0,\infty )$ such that for all $x\in \mathbb X$

$$\begin{aligned} z(x)\left( 1+\sum \limits _{k=1}^{\infty }\frac{1}{k!}\int _{\mathbb {X}^{k}}\mathrm {e}^{\sum _{j=1}^k C(y_j)}\prod \limits _{j=1}^{k}\left| f(x,y_j)\right| \prod \limits _{1\le i<j\le k}(1+f(y_i,y_j)) \lambda _\mu ^{k}(\mathrm {d}\varvec{y})\right) \le \mu (x),\nonumber \\ \end{aligned}$$

(2.9)

then the activity expansions $\rho _n(x_n,\ldots , x_n;z)$ converge absolutely and the bounds

$$\begin{aligned} \rho _n(x_n,\ldots , x_n;z)\le \prod \limits _{1\le i<j\le n}\left( 1+f(x_i,x_j)\right) \prod \limits _{i=1}^n\mu (x_i) \end{aligned}$$

hold for all $n\in \mathbb {N}$ and $(x_1,\ldots , x_n)\in \mathbb X^n$. Notice that for non-negative pair interactions, we can choose $C\equiv 0$ in condition (2.9).

Remark 2.3

The Fernández–Procacci condition improves on the the Kotecký–Preiss condition—in the sense that the assumptions of Corollary 2.2 yield the assumptions of Corollary 2.3. In other words, Corollary 2.3 in general guarantees convergence of $\varvec{\rho }$ on a larger domain of activities .

Proof

Assume that (2.9) holds and define $\varvec{\xi }=(\xi _n)_{n\in \mathbb {N}},\ \xi _n:\mathbb {X}^n\rightarrow [0, \infty ),$ by

$$\begin{aligned} \xi _n(x_1,\ldots ,x_n):=\prod \limits _{1\le i<j\le n}\left( 1+f(x_i,x_j)\right) \prod \limits _{i=1}^n\mu (x_i) \end{aligned}$$

for some $\mu $ satisfying (2.9). Let s be the selection rule that always selects the first entry—so that $x_s = x_1$ and $x'_i = x_i$ for $i\ge 2$. For locally stable pair potentials, we have

$$\begin{aligned}&\prod \limits _{i=2}^n(1+f(x_1,x_i))\xi _{n+k-1}(x_2,\ldots ,x_n,y_1,\ldots ,y_k) \nonumber \\&\quad = \prod \limits _{1\le i<j\le n}(1+f(x_i,x_j))\prod \limits _{1\le i<j\le k}(1+f(y_i,y_j))\prod \limits _{i=2}^n\prod \limits _{j=1}^k(1+f(x_i,y_j))\prod \limits _{i=2}^n\mu (x_i)\prod \limits _{j=1}^k\mu (y_j)\nonumber \\&\quad \le \Bigl ( \prod \limits _{1\le i<j\le n}(1+f(x_i,x_j)) \prod \limits _{i=2}^n\mu (x_i)\Bigr ) \Bigl ( \prod \limits _{1\le i<j\le k}(1+f(y_i,y_j)) \prod _{j=1}^k \mu (y_j)\Bigr )\mathrm {e}^{\sum _{j=1}^k C(y_j)}, \end{aligned}$$

(2.10)

where we used the local stability to estimate

$$\begin{aligned} \prod \limits _{i=2}^n\prod \limits _{j=1}^k(1+f(x_i,y_j))=\prod \limits _{j=1}^k\prod \limits _{i=2}^n(1+f(x_i,y_j))\le \prod \limits _{j=1}^k\mathrm {e}^{C(y_j)}=\mathrm {e}^{\sum _{j=1}^k C(y_j)}. \end{aligned}$$

We plug our choice of $\varvec{\xi }$ into the left-hand side of (2.5) and use the estimate (2.10) together with the assumption (2.9) to find that condition (i) of Theorem 2.1 is satisfied. $\square $

Remark 2.4

We see that Theorem 2.1 provides a mechanism to prove sufficient conditions for absolute convergence—by constructing a sequence of ansatz functions $\varvec{\xi }$ tailored to satisfy the Kirkwood–Salsburg inequalities under the given condition. Conversely, given an appropriate sequence of ansatz functions $\varvec{\xi }$, obtained, for example, as an approximation of $\varvec{\rho }$, one can try to determine the corresponding sufficient condition for convergence.

We now proceed to demonstrate the usefulness of that approach by deriving a sufficient condition that improves on the classical examples above.

2.2 Abstract Polymer Models

In the following we want to consider the setup of abstract polymers [1, 8], in which the two classical conditions above—Kotecký–Preiss and Fernández–Procacci—were first introduced.

Let $\mathbb X$ be a countable set (the set of polymers), let ${\mathcal {X}}$ be the powerset of $\mathbb X$ and let $\lambda $ simply be given by the counting measure. Moreover, let $R\subset \mathbb {X}\times \mathbb {X}$ be a symmetric and reflexive relation. We write $x\not \sim y$ for $(x,y)\in R$ (and say that x and y are incompatible) and $x\sim y$ for $(x,y)\notin R$ (and say that x and y are compatible). Moreover, we call a subset $X\subset \mathbb X$ compatible if $x\sim y$ for all $x\ne y \in X$ and write $X\sim z$ for $z\in \mathbb {X}$ if $z\sim x$ for all $x\in X$. We set $\Gamma {(x)}:=\{y\in \mathbb X\vert \ y\not \sim x\}$ for any $x\in \mathbb {X}$ and extend this notation to $\Gamma {(X)}:=\cup _{x\in X}\{y\in \mathbb X\vert \ y\not \sim x\}$ for any $X\subset \mathbb {X}$. Notice that we do not require $\Gamma (x)$ to be finite sets and that $x\in \Gamma (x)$ for every $x\in \mathbb {X}$. Finally, we consider hard-core interactions corresponding to Mayer’s f function given by $f(x,y):= -\mathbb {1}_{\{x\not \sim y\}}$.

In this setting we prove a new, improved sufficient condition for absolute convergence of the activity expansions $\varvec{\rho }$.

Proposition 2.4

Let z be a non-negative activity function and assume that there exists $\mu :\mathbb {X}\rightarrow [0,\infty )$ such that for all $x\in \mathbb {X}$

$$\begin{aligned} z(x)\left( 1+\sum \limits _{k\ge 1}\sum \limits _{{\begin{array}{c} Y=\{y_1,...,y_k\}\\ y_i \not \sim x,\ y_i\sim y_j \end{array}}}\prod \mu (y_i)\prod \limits _{w\in \Gamma (Y)}\mathrm {e}^{\mu (w)}\right) \le \mu (x)\prod \limits _{w\in \Gamma (x)}\mathrm {e}^{\mu (w)}, \end{aligned}$$

(2.11)

where the inner sum on the left-hand side runs over compatible subsets $Y=\{y_1,...,y_k\}\subset \Gamma (x)$. Then the activity expansions $\rho _n(x_1,\ldots , x_n;z)$ converge absolutely and the bounds

$$\begin{aligned} \rho _n(x_1,\ldots , x_n;z)\le \prod \limits _{1\le i<j\le n}\mathbb {1}_{\{x_i\sim x_j\}}\prod \limits _{i=1}^n\mu (x_i)\prod \limits _{w\in \Gamma (\{x_1,\ldots ,x_n\})}\mathrm {e}^{\mu (w)} \end{aligned}$$

hold for all $n\in \mathbb {N}$ and all $(x_1,\ldots , x_n)\in \mathbb X^n$.

The proof of the proposition essentially exploits the following auxiliary result:

Lemma 2.5

Let $\mu : \mathbb {X}\rightarrow [0,\infty )$. Then the following holds for every $x_1\in \mathbb {X}$, $n\in \mathbb {N}$ and $X=\{x_2,...,x_n\}\subset \mathbb {X}$ such that $x_1\sim x_i$ for all $i\in \{2,\ldots ,n\}$:

$$\begin{aligned}&\frac{\mu (x_1)\prod \limits _{w\in \Gamma (x_1)}\mathrm {e}^{\mu (w)}}{1+\sum \limits _{k\ge 1}\sum \limits _{{\begin{array}{c} Y=\{y_1,...,y_k\}\\ y_i \not \sim x_1,\ y_i\sim y_j \end{array}}}\prod \limits _{i=1}^k\mu (y_i)\prod \limits _{w\in \Gamma (Y)}\mathrm {e}^{\mu (w)}}\nonumber \\&\quad \le \frac{\mu (x_1)\prod \limits _{w\in \Gamma (x_1)\cap \Gamma (X)^C}\mathrm {e}^{\mu (w)}}{1+\sum \limits _{k\ge 1}\sum \limits _{\begin{array}{c} \begin{array}{c} Y=\{y_1,...,y_k\}\\ y_i \not \sim x_1,\ y_i\sim y_j \end{array}\\ y_i\sim X \end{array}}\prod \limits _{i=1}^k\mu (y_i)\prod \limits _{w\in \Gamma (Y)\cap \Gamma (X)^C}\mathrm {e}^{\mu (w)}}, \end{aligned}$$

(2.12)

where $\Gamma (W)$ is given by $\cup _{i=1}^{n}\Gamma (w_i)$ for any $n\in \mathbb {N}$ and $W=\{w_1,...,w_n\}\subset \mathbb {X}$. The inner sum in the denominator on the left-hand side runs over compatible subsets $Y=\{y_1,\ldots ,y_k\}\subset \Gamma (x_1)$; the inner sum in the denominator on the right-hand side runs over all such subsets Y which additionally satisfy the constraint $Y\cap \Gamma (X)=\varnothing $, i.e., $y_i\sim X$ for all $i\in \{1,\ldots ,k\}.$

The lemma is of rather technical nature; for the interested reader, the proof is to be found in Appendix A.

Remark 2.5

The general idea behind the proof of Proposition 2.4 is to argue as in the proofs of the classical conditions presented in the previous section (Corollaries 2.2 and 2.3)—but to choose a sequence of ansatz functions $ \varvec{\xi }$ which, heuristically speaking, encode more of the structure of the exact solution to the Kirkwood–Salsburg equations (i.e., of the activity expansions $\varvec{\rho }$) than the ansatz functions chosen in the proof of those corollaries. The intuition thereby is that “less multiplicative” ansatz functions $\varvec{\xi }$ provide better convergence criteria.

Proof of Proposition 2.4

Assume that (2.11) holds and define $\varvec{\xi }=(\xi _n)_{n\in \mathbb {N}},\ \xi _n:\mathbb {X}^n\rightarrow [0, \infty ),$ by setting

$$\begin{aligned} \xi _n(x_1,...,x_n):=\prod \limits _{1\le i<j\le n}\mathbb {1}_{\{x_i\sim x_j\}}\prod \limits _{i=1}^n\mu (x_i)\prod \limits _{w\in \Gamma (\{x_1,\ldots ,x_n\})}\mathrm {e}^{\mu (w)} \end{aligned}$$

(2.13)

for some $\mu $ satisfying (2.11), for any $n\in \mathbb {N}$ and every $(x_1,..,x_n)\in \mathbb {X}^n$. Thereby we again use the convention $\Gamma (\{w_1,\ldots ,w_n\})=\cup _{i=1}^n\Gamma (w_i)$ for $\{w_1,...,w_n\}\subset \mathbb X$. As in the preceeding proofs of the classical sufficient conditions, we show that our choice of $\varvec{\xi }=(\xi _n)_{n\in \mathbb {N}}$ satisfies the system of Kirkwood–Salsburg inequalities (2.5) from Theorem 2.1. To lighten the notation, we choose the same selection rule s as in the proof of Lemma 2.3 and denote by X the set $\{x_2,...,x_n\}$. Notice that the left-hand side of (2.5) is equal to

$$\begin{aligned}&z(x_1)\prod \limits _{1\le i<j\le n}\mathbb {1}_{\{x_i\sim x_j\}}\prod \limits _{i=2}^n\mu (x_i)\prod \limits _{w\in \Gamma (X)}\mathrm {e}^{\mu (w)}\\&\quad \times \left( 1+\sum \limits _{k\ge 1}\sum \limits _{Y=\{y_1,...,y_k\}}\prod \limits _{j=1}^{k}\mathbb {1}_{\{y_j\not \sim x_1\}}\prod \limits _{\begin{array}{c} 2\le i \le n\\ 1\le j\le k \end{array}}\mathbb {1}_{\{x_i\sim y_j\}}\prod \limits _{1\le i< j\le k}\mathbb {1}_{\{y_i\sim y_j\}}\right. \\&\quad \left. \prod \limits _{j=1}^k\mu (y_j)\prod \limits _{w\in \Gamma (Y)\cap \Gamma (X)^C}\mathrm {e}^{\mu (w)}\right) . \end{aligned}$$

By Lemma 2.5, the assumption that z satisfies the condition (2.11) implies that z also satisfies the inequality

$$\begin{aligned}&z(x_1)\left( 1+\sum \limits _{k\ge 1}\sum \limits _{Y=\{y_1,...,y_k\}}\prod \limits _{j=1}^{k}\mathbb {1}_{\{y_j\not \sim x_1\}}\prod \limits _{\begin{array}{c} 2\le i \le n\\ 1\le j\le k \end{array}}\mathbb {1}_{\{x_i\sim y_j\}}\prod \limits _{1\le i< j\le k}\mathbb {1}_{\{y_i\sim y_j\}}\right. \\&\quad \left. \prod \limits _{j=1}^k\mu (y_j)\prod \limits _{w\in \Gamma (Y)\cap \Gamma (X)^C}\mathrm {e}^{\mu (w)}\right) \\&\quad \le \mu (x_1)\prod \limits _{w\in \Gamma (x_1)\cap \Gamma (X)^C}\mathrm {e}^{\mu (w)} \end{aligned}$$

and thus, for our choice of $\varvec{\xi }$, the left-hand side of (2.5) is bounded from above by

$$\begin{aligned}&\prod \limits _{1\le i<j\le n}\mathbb {1}_{\{x_i\sim x_j\}}\prod \limits _{i=1}^n \mu (x_i)\prod \limits _{w\in \Gamma (X)}\mathrm {e}^{\mu (w)}\prod \limits _{w\in \Gamma (x_1)\cap \Gamma (X)^C}\mathrm {e}^{\mu (w)}\\&\quad \quad =\prod \limits _{1\le i<j\le n}\mathbb {1}_{\{x_i\sim x_j\}}\prod \limits _{i=1}^n\mu (x_i)\prod \limits _{w\in \Gamma (X\cup \{x_1\})}\mathrm {e}^{\mu (w)}=\xi _{n}(x_1,...,x_n), \end{aligned}$$

which—by Theorem 2.1—yields the claim of the proposition.$\square $

Example 2.1

Consider non-overlapping (hard-core interactions) cubes on $\mathbb {Z}^2$ of side-length 2 with translationally invariant activity z. The sufficient condition on z for the absolute convergence of $\rho (z)$ given by the Fernández–Procacci criterion provides the bound

$$\begin{aligned} z\le \max \limits _{\mu \ge 0}\ \ \frac{\mu }{1+9\mu +16\mu ^2+8\mu ^3+\mu ^4}\approx 0.057271, \end{aligned}$$

while our condition from Proposition 2.4 provides

$$\begin{aligned} z\le \max \limits _{\mu \ge 0}\ \ \frac{\mu e^{9\mu }}{1+9e^{9\mu }\mu +(6e^{15\mu }+8e^{16\mu }+2e^{17\mu })\mu ^2+8e^{21\mu }\mu ^3+e^{25\mu }\mu ^4}\approx 0.060833. \end{aligned}$$

This corresponds to an improvement of approximately 6%.

2.3 Hard-Core Systems in the Continuum

Let ${\mathscr {K}}'$ be the collection of non-empty compact subsets of $\mathbb {R}^d$, equipped with the Hausdorff distance and Borel $\sigma $-algebra [19, Chapter I-4], and $\mathbb X\subset {\mathscr {K}}'$ a non-empty measurable subset. Here we want to additionally assume that $\mathbb X$ consists of bounded convex sets that are non-empty and regular closed, i.e., that are equal to the closure of its non-empty interior. Notice that such sets are compact and have finite positive Lebesgue measure that is equal to the Lebesgue measure of their interior. In practice $\mathbb X$ will consist of easily described subsets. For example, when dealing with closed balls $B_r(x)\subset \mathbb {R}^d$ we may identify $\mathbb X$ with $\mathbb {R}^d\times \mathbb {R}_+$. Consider the hard-core interactions given by the potential $v(X,Y) := \infty \mathbb {1}_{\{X\cap Y \ne \varnothing \}}$, Mayer’s f function then is

$$\begin{aligned} f(X,Y) = - \mathbb {1}_{\{X\cap Y\ne \varnothing \}}. \end{aligned}$$

Clearly the function is well-defined for general subsets $X,Y\subset \mathbb {R}^d$ that are not necessarily in $\mathbb X$, the domains of definition of the functions $\varphi _n^{\mathsf {T}}$ and $\psi _{n,n+k}$ extend accordingly.

For $D\subset \mathbb {R}^d$ and a measure $\lambda _z$ on ${\mathcal {X}}$ defined as in (2.3), consider the formal series

$$\begin{aligned} T(D;z) := 1+\sum _{k=1}^\infty \frac{1}{k!}\int _{\mathbb X^k} \varphi _{1+k}^{\mathsf {T}}(D,Y_1,\ldots , Y_k) \lambda _z^k(\mathrm {d}\varvec{Y}). \end{aligned}$$

(2.14)

As is well-known [6, Eq. (3.12)]

$$\begin{aligned} T(D;z) = \exp \Biggl ( - \sum _{k=1}^\infty \frac{1}{k!}\int _{\mathbb X^k} \mathbb {1}_{\{\exists i:\, Y_i \cap D\ne \varnothing \}} \varphi _{k}^{\mathsf {T}}(Y_1,\ldots , Y_k) \lambda _z^k(\mathrm {d}\varvec{Y}) \Biggr ) \end{aligned}$$

(2.15)

on the level of formal power series.

Moreover, if the domain D can be written as a finite union of disjoint objects $X_i\in \mathbb {X}$, say $D=X_1\cup \ldots \cup X_n$ for $n\in \mathbb {N}$, then the identity

$$\begin{aligned}&1+\sum _{k=1}^\infty \frac{1}{k!}\int _{\mathbb X^k} \varphi _{1+k}^{\mathsf {T}}(D,Y_1,\ldots , Y_k) \lambda _z^k(\mathrm {d}\varvec{Y})\\&\quad = \sum _{k=0}^\infty \frac{1}{k!} \int _{\mathbb X^k} \psi _{n,n+k}(X_1,\ldots ,X_n,Y_1,\ldots ,Y_k)\lambda _z^k(\mathrm {d}\varvec{Y}) \end{aligned}$$

holds by Lemma 3.8 below and we recognize that the series T(D, z) provide expansions for the reduced correlation functions in the sense that

$$\begin{aligned} \rho _n(X_1,\ldots ,X_n;z) = z(X_1)\cdots z(X_n) \mathbb {1}_{\{X_1,\ldots ,X_n\ \text {disjoint}\}}\, T(X_1\cup \cdots \cup X_n;z). \end{aligned}$$

The absolute convergence of the expansions $\rho (z)$ for the correlation functions is implied by the absolute convergence of T(D; z), i.e., by the pointwise convergence

$$\begin{aligned} 1+\sum _{k=1}^\infty \frac{1}{k!}\int _{\mathbb X^k} \vert \varphi _{1+k}^{\mathsf {T}}(D,Y_1,\ldots , Y_k)\vert \lambda _z^k(\mathrm {d}\varvec{Y})<\infty , \end{aligned}$$

for all domains D that are unions of finitely many objects $X_i \in \mathbb X$.

Assume we are given a systematic way to chop up the objects $X\in \mathbb X$ into smaller bits and pieces, called snippets (think: analogous to representing a polymer as a collection of monomers in the discrete setup of subset polymers). That is, choose a positive number $\varepsilon >0$ and assume that there is a designated collection $\mathbb E_\varepsilon $ of bounded Borel sets in $\mathbb {R}^d$, each of which is contained in some open ball of radius $\varepsilon $, and a chopping map

$$\begin{aligned} C:\mathbb X\rightarrow {\mathcal {P}}(\mathbb E_\varepsilon ),\quad X\mapsto C(X) \end{aligned}$$

such that for every $X\in \mathbb X$, $C(X) = \{E_1,\ldots , E_m\}$ with $m\in \mathbb {N}$ and $E_1,\ldots , E_m$ a set partition of X. We additionally want to assume that the topological boundary of every snippet is a $\lambda $-null set, i.e., $\lambda (\overline{E}\backslash E^\circ )=0$ for all $E\in \mathbb E_\varepsilon $ (where $\overline{E}$ denotes the topological closure and $E^\circ $ the interior of E).

Let $\mathbb D_\varepsilon $ be the set of bounded domains $D\subset \mathbb {R}^d$ that can be written as the union of finitely many disjoint snippets. The empty set $D= \varnothing $ is an element of $\mathbb D_\varepsilon $. For two disjoint subsets $D_0,D_1\subset \mathbb {R}^d$ with $D_0\ne \varnothing $ and for finitely many objects $Y_1,\ldots ,Y_k\in \mathbb {X}$, $k\in \mathbb N$, set

$$\begin{aligned} I(D_0; D_1; Y_1,\ldots , Y_k):=\Bigl ( \prod _{i=1}^k \mathbb {1}_{\{D_0\cap Y_i \ne \varnothing ,\, D_1\cap Y_i = \varnothing \}}\Bigr )\Bigl ( \prod _{1\le i<j\le k} \mathbb {1}_{\{Y_i\cap Y_j = \varnothing \}}\Bigr ). \end{aligned}$$

(2.16)

Theorem 2.6

Let $z(\cdot )$ be a non-negative activity function. The following two conditions are equivalent:

(i)
There exists a non-negative map $a:\mathbb D_\varepsilon \rightarrow \mathbb {R}_+$ such that for all $D\in \mathbb D_\varepsilon $, the map ${\mathscr {K}}'\ni F\mapsto a(D\cup F)$ is measurable and the following system of inequalities is satisfied: For all non-empty $D\in \mathbb D_\varepsilon $ with $C(D)=\{E_1,\ldots ,E_m\}\subset \mathbb {E}_\varepsilon $ for some $m\in \mathbb {N}$, there exists an $s\in \{1,\ldots ,m\}$ such that—setting $D':=D\backslash E_s$—we have
$$\begin{aligned} \sum _{k=1}^\infty \frac{1}{k!}\int _{\mathbb X^k} I(E_s; D'; Y_1,\ldots , Y_k) \mathrm {e}^{a(D'\cup Y_1\cup \cdots \cup Y_k) - a(D')} \lambda _z^k(\mathrm {d}\varvec{Y}) \le \mathrm {e}^{a(E_s\cup D') - a(D')}-1. \end{aligned}$$
(ii)
T(D; z) is absolutely convergent for all $D\in \mathbb D_\varepsilon $.

Moreover, if one of the equivalent conditions (hence, both) holds true, then, for all $D\in \mathbb D_\varepsilon $, we have

$$\begin{aligned} \bigl | \log T(D;z)\bigr |\le \sum _{k=1}^\infty \frac{1}{k!}\int _{\mathbb X^k} \mathbb {1}_{\{\exists i:\, Y_i \cap D\ne \varnothing \}} \bigl | \varphi _{k}^{\mathsf {T}}(Y_1,\ldots , Y_k) \bigr | \lambda _z^k(\mathrm {d}\varvec{Y}) \le a(D). \end{aligned}$$

(2.17)

2.4 Subset Polymers

Let $\mathbb X$ consist of the finite non-empty subsets of $\mathbb {Z}^d$ (or any other countable set), and let ${\mathcal {X}} = \mathcal P(\mathbb X)$ be the $\sigma $-algebra containing all subsets of $\mathbb X$. The reference measure $\lambda $ is simply the counting measure. The interaction is a pure hard-core interaction as in Sect. 2.3. Notice that this setup is a special case of the abstract polymer setup introduced in Sect. 2.2. For a finite set $D\subset \mathbb {Z}^d$, define T(D; z) as in (2.14). In statistical physics T(D; z) corresponds to the probability that no polymer intersects D. If D is a polymer or a union of disjoint polymers, it corresponds to a reduced correlation function in the sense of [12].

Notice how in the case of subset polymers every polymer always can be “chopped” in a canonical way—into a disjoint collection of monomers. Those play the role of snippets from the previous section—that simplifies the formulation of a criterion for absolute convergence of the activity expansions $\varvec{\rho }$ (compare next result with Theorem 2.6).

Theorem 2.7

Let $(z(X))_{X\in \mathbb X}$ be a non-negative activity. The following two conditions are equivalent:

(i)
There exists a function $a(\cdot )$ from the finite subsets of $\mathbb {Z}^d$ to $[0,\infty )$ such that $a(\varnothing )=0$ and the following system of inequalities is satisfied: For all finite, non-empty subsets $D\subset \mathbb {Z}^d$ there exists an $x\in D$ such that—setting $D':=D\backslash \{x\}$—we have
$$\begin{aligned} \sum _{\begin{array}{c} Y\in \mathbb X:\\ Y\ni x,\, Y\cap D' =\varnothing \end{array}} z(Y) \mathrm {e}^{a(D'\cup Y) - a(D')} \le \mathrm {e}^{a(D'\cup \{x\}) - a(D')}-1. \end{aligned}$$
(2.18)
(ii)
T(D; z) is absolutely convergent for all finite subsets $D\subset \mathbb {Z}^d$.

Moreover, if one of the equivalent conditions (hence, both) holds true, then, for all finite subsets $D\subset \mathbb {Z}^d$, we have

$$\begin{aligned} \bigl | \log T(D;z)\bigr |\le \sum _{k=1}^\infty \frac{1}{k!}\sum _{(Y_1,\ldots , Y_k) \in \mathbb X^k} \mathbb {1}_{\{\exists i:\, Y_i \cap D\ne \varnothing \}} \bigl | \varphi _{k}^\mathsf T(Y_1,\ldots , Y_k) \bigr | z(Y_1)\cdots z(Y_k) \le a(D). \end{aligned}$$

(2.19)

The theorem is similar to Claim 1 in [1, Sect. 4.2]. As noted in [1], Theorem 2.7 allows for an easy recovery of the extended Gruber–Kunz criterion. The criterion is named after Gruber and Kunz [12], who proved a similar condition but with a strict inequality. See [8] for a comparison of the Gruber–Kunz criterion to other classical conditions.

Corollary 2.8

Let $(z(X))_{X\in \mathbb X}$ be a non-negative activity. Suppose there exists some $\alpha \ge 0$ such that for all $x\in \mathbb {Z}^d$,

$$\begin{aligned} \sum _{Y\ni x}z(Y)\,\mathrm {e}^{\alpha \mid Y|}\le \mathrm {e}^{\alpha }-1. \end{aligned}$$

(2.20)

Then T(D; z) is absolutely convergent, for all finite subsets $D\subset \mathbb {Z}^d$.

Proof

Set $a(D):=\alpha |D|$, where $\alpha >0$ satisfies the inequality from (2.20). Because of the additivity of $a(\cdot )$, we have $a(D\cup Y) = a(D) + a(Y)$ for all finite, disjoint subsets $D,Y\subset \mathbb {Z}^d$. Therefore condition (2.18) becomes

$$\begin{aligned} \sum _{\begin{array}{c} Y\ni x:\\ Y\cap D =\varnothing \end{array}}z(Y)\, \mathrm {e}^{a(Y)} \le \mathrm {e}^{a(\{x\})}-1, \end{aligned}$$

which depends on D only through the constraint $Y\cap D\ne \varnothing $ on the left-hand side. By the non-negativity of the activity z, it is clearly sufficient that

$$\begin{aligned} \sum _{Y\ni x}z(Y)\, \mathrm {e}^{a(Y)} \le \mathrm {e}^{a(\{x\})}-1, \end{aligned}$$

which holds true for all $x\in \mathbb {Z}^d$ because of (2.20). $\square $

Another immediate consequence of Theorem 2.7 is that convergence of cluster expansions implies exponential decay of the activities in the object size. Precisely, set

$$\begin{aligned} V(D):= \sum _{\begin{array}{c} Y\in \mathbb X:\\ Y \cap D \ne \varnothing \end{array}} z(Y). \end{aligned}$$

Notice that if the activity is translationally invariant and not identically zero, one can choose an arbitrary polymer $X\in \mathbb {X}$ with positive activity, say $z_0> 0$, and obtain the bound

$$\begin{aligned} V(D) \ge z_0 |D|. \end{aligned}$$

(2.21)

Theorem 2.9

If $z(\cdot )$ is a non-negative activity and T(D; z) is absolutely convergent for all finite subsets $D\subset \mathbb {Z}^d$, then necessarily

$$\begin{aligned} \sum _{Y \ni x} z(Y) \mathrm {e}^{V(Y)} <\infty \end{aligned}$$

for all $x\in \mathbb {Z}^d$.

Proof

By condition (i) in Theorem 2.7, evaluated at $D'=\varnothing $, there exists a non-negative function $a(\cdot )$ such that

$$\begin{aligned} \sum _{Y \ni x} z(Y) \mathrm {e}^{a(Y)} \le \mathrm {e}^{a(\{x\})} - 1< \infty . \end{aligned}$$

For any polymer $Y\in \mathbb {X}$, the value a(Y) is necessarily larger than V(Y) by (2.19) and the claim follows. $\square $

For translationally invariant systems, Theorem 2.9 says that if the activity expansions are absolutely convergent, then necessarily the activities are exponentially small in the size of the object—by (2.21) we can observe that $z(X) = O(\exp ( - {z_0} |X|))$ when $|X|\rightarrow \infty $. Let us emphasize that the necessary exponential decay is an intrinsic limitation of the activity expansion, which cannot be eliminated by tinkering with different sufficient convergence conditions. Rigorous results for one-dimensional and hierarchical models [13, 15] suggest that the exponential decay is not needed for the convergence of the multi-species virial expansion, however for general systems this is so far an unproven conjecture.

3 Combinatorial Lemmas: Proof of Theorems 2.1, 2.6, and 2.7

3.1 Forest Partition Schemes: Alternating Sign Property

To obtain a better understanding of the series $\rho _n$ given by the generating functions of multi-rooted graphs, we now consider a particular way to construct the latter—by taking a designated spanning forest and successively adding edges to it. This perspective onto multi-rooted graphs leads to a forest-graph equality analogous to the familiar tree-graph identity for connected graphs [8, Proposition 5] and allows for a direct proof of an alternating sign property for the coefficients $\psi _{n,n+k}$ of $\rho _n$ in the case of repulsive interactions (i.e., for non-negative potentials).

The forest-graph equality builds on the notion of forest partition schemes—maps that assign spanning forests to multi-rooted graphs in ${\mathcal {D}}_{n,n+k}$ and thereby in a specific manner provide partitions of ${\mathcal {D}}_{n,n+k}$.

In the following, we let ${\mathcal {F}}_{n,n+k}$ denote the set of forest graphs on the vertex set $[n+k]$ consisting of n rooted trees, where the vertices $\{1,\ldots ,n\}$ are the roots of the trees (recall that a forest is an acyclic graph and a tree is a connected acyclic graph).

Definition 3.1

(Forest partition scheme) A forest partition scheme is a family of maps $\pi _{n,k}:\ {\mathcal {D}}_{n,n+k}\rightarrow {\mathcal {F}}_{n,n+k}$ such that for all $n\in \mathbb {N}$, $k\in \mathbb {N}_0$, and all $F\in {\mathcal {F}}_{n,n+k}$, there exists a graph $R_{n,k}(F) \in {\mathcal {D}}_{n,n+k}$ with

$$\begin{aligned} \pi _{n,k}^{-1}\bigl ( \{F\}\bigr ) = \{G\in {\mathcal {D}}_{n,n+k} \mid E(F)\subset E(G)\subset E\bigl (R_{n,k}(F)\bigr )\} =: [F, R_{n,k}(F)]. \end{aligned}$$

To lighten the notation, we introduced partition schemes as families of maps on uncoloured structures. Notice, however, that partition schemes may be defined on coloured structures and may be allowed to depend on the colouring of the vertex set. Therefore, one could introduce families of maps $\pi _{k,n}({\varvec{x}_{[n]}})$, indexed additionally by colourings $\varvec{x}_{[n]}\in \mathbb X^n$ of $[n]=\{1,...,n\}$. Same graphs on the vertex set [n] with different colourings $\varvec{x}_{[n]}$ of the vertices can be mapped onto different forests under such partition schemes.

The existence of forest partition schemes is ensured by the existence of a large class of tree partition schemes, e.g, the Penrose tree partition scheme (see [8, 33]; for coulouring-dependent schemes see also [25, 31]).

Example 3.1

A particular forest partition scheme can be defined as follows: For a given multi-rooted graph, construct a connected graph from it by adding a ghost-vertex and connecting it to every root directly by an edge. Then apply the Penrose partition scheme to the resulting connected graph to obtain a spanning tree of this connected graph. Finally, by removing the ghost vertex as well as every edge incident to it, one gets a spanning forest of the initial multi-rooted graph. For the map given by this construction, the characterizing properties of a forest partition scheme follow from the corresponding properties of the Penrose tree partition scheme.

Naturally, the choice of the Penrose tree partition scheme in the example above is somewhat arbitrary; any tree partition scheme which does not “delete” any edge incident to the ghost vertex in the above construction yields a forest partition scheme via the same procedure.

Proposition 3.2

(Forest-graph equality) Let $(\pi _{n,k})_{n\in \mathbb {N},\, k\in \mathbb {N}_0}$ be a forest partition scheme and let $(R_{n,k})_{n\in \mathbb {N},\, k\in \mathbb {N}_0}$ provide the corresponding family of multi-rooted graphs as in Definition 3.1. Then

$$\begin{aligned} \psi _{n,n+k}(x_1,\ldots ,x_{n+k}) = \sum _{F\in {\mathcal {F}}_{n,n+k}} \prod _{\{i,j\}\in E(F)} f(x_i,x_j) \prod _{\{i,j\}\in E(R_{n,k}(F))\setminus E(F)} \bigl ( 1+ f(x_i,x_j)\bigr ) \end{aligned}$$

for all $n\in \mathbb {N}$, $k\in \mathbb {N}_0$, and $(x_1,\ldots , x_{n+k})\in \mathbb X^{n+k}$.

Proof

The proof is similar to the standard proof of the tree-graph equality [8, Proposition 5]. We have

$$\begin{aligned} \psi _{n,n+k}(x_1,\ldots ,x_{n+k})&= \sum _{G\in {\mathcal {D}}_{n,n+k}} w\bigl (G,(x_1,\ldots ,x_{n+k})\bigr ) \\&= \sum _{F\in {\mathcal {F}}_{n,n+k}} \sum _{\begin{array}{c} G\in {\mathcal {D}}_{n,n+k}:\\ \pi _{n,k}(G) = F \end{array}} w\bigl (G,(x_1,\ldots ,x_{n+k})\bigr ) \\&= \sum _{F\in {\mathcal {F}}_{n,n+k}} \prod _{\{i,j\}\in E(F)} f(x_i,x_j) \prod _{\{i,j\}\in E(R_{n,k}(F))\setminus E(F)} \bigl ( 1+ f(x_i,x_j)\bigr ). \end{aligned}$$

$\square $

In the case of repulsive interactions, the forest-graph equality allows for a direct proof of the alternating sign property for the graph weights $\psi _{n,n+k}$. For $n=1$, it reduces to the well-known alternating sign property [8, Eq. (2.8)]

$$\begin{aligned} \varphi _n^{\mathsf {T}}(x_1,\ldots , x_n) = (-1)^n \bigl | \varphi _n^{\mathsf {T}}(x_1,\ldots , x_n)\bigr | \end{aligned}$$

(3.1)

of the Ursell functions.

Corollary 3.3

For non-negative potentials, we have

$$\begin{aligned} \psi _{n,n+k}(x_1,\ldots ,x_{n+k}) = (-1)^{k} \bigl |\psi _{n,n+k}(x_1,\ldots ,x_{n+k})\bigr | \end{aligned}$$

for all $n\in \mathbb {N}$, all $k\in \mathbb {N}_0$, and all $(x_1,\ldots ,x_{n+k})\in \mathbb X^{n+k}$.

Proof

Each forest $F\in {\mathcal {F}}_{n,n+k}$ has exactly k edges. Indeed, the forest F consists of trees $T_1,\ldots , T_n$. Let $m_i$ be the number of vertices of the tree $T_i$; thus $m_1+\cdots + m_n =n+k$. Each tree $T_i$ has exactly $m_i -1$ edges, therefore the number of edges of the forest is given by $\sum _{i=1}^n (m_i -1) = k$. Since $f\le 0$ and $1+f\ge 0$ for non-negative potentials, it follows that

$$\begin{aligned}&\psi _{n,n+k}(x_1,\ldots ,x_{n+k}) = (-1)^{k} \sum _{F\in \mathcal F_{n,n+k}} \prod _{\{i,j\}\in E(F)} |f(x_i,x_j)| \\&\quad \prod _{\{i,j\}\in E(R_{n,k}(F))\setminus E(F)} \bigl ( 1+ f(x_i,x_j)\bigr ), \end{aligned}$$

hence $(-1)^{k}\psi _{n,n+k}(x_1,\ldots , x_{n+k})\ge 0$. $\square $

We will use the alternating sign property to establish that—in the case of non-negative potentials—condition (i) in Theorem 2.1 is not only sufficient but also necessary for absolute convergence of $ \rho $.

We conclude this section with a lemma that is not needed for the proof of Theorem 2.1 but enters the analysis of hard-core models, see the proof of Lemma 3.8 below.

Lemma 3.4

For all $n\in \mathbb {N}$, all $k\in \mathbb {N}_0$, and all $(x_1,\ldots , x_{n+k})\in \mathbb X^{n+k}$, we have

$$\begin{aligned} \psi _{n,n+k}(x_1,\ldots ,x_{n+k})= & {} \prod _{1\le i < j \le n}\bigl ( 1+ f(x_i,x_j)\bigr ) \nonumber \\&\times \sum _{\{V_1,\ldots , V_r\}} \prod _{\ell =1}^r \Biggl ( \prod _{\begin{array}{c} 1 \le i \le n,\\ j\in V_\ell \end{array}} \bigl (1+f (x_i,x_j)\bigr ) -1 \Biggr ) \varphi _{|V_\ell |}^\mathsf T\bigl ((x_j)_{j\in V_\ell }\bigr ),\nonumber \\ \end{aligned}$$

(3.2)

where the sum runs over all set partitions $\{V_1,\ldots , V_r\}$ of non-root vertices $\{n+1,\ldots , n+k\}$.

Remark 3.1

The lemma allows for an alternative proof of the alternating sign property from Corollary 3.3, starting from the well-known alternating sign property of the Ursell function instead of the forest-graph equality. Indeed, the sign of every summand in the right-hand side of (3.2) is

$$\begin{aligned} (-1)^{r+ \sum _{i=1}^r (|V_i| -1)} = (-1)^{k}. \end{aligned}$$

Proof of Lemma 3.4

For $n=1$, the lemma reduces to a well-known equality for the Ursell functions, see e.g. [11, Eq. (5.13)]. For $n\ge 2$, the proof is similar, we provide the details for the reader’s convenience. Every multi-rooted graph $G\in \mathcal D_{n,n+k}$ can be constructed in the following way. On the root set $\{1,\ldots , n\}$ pick an arbitrary graph $G_0$. On the complement of the root set do the following construction: Partition the set of the non-root vertices into r sets $V_1,...,V_r$, $r\le k$. For every block $V_\ell $, pick a connected graph $G_\ell $ with vertex set $V_\ell $, and in addition a non-empty set of edges $ E_\ell \subset \bigl \{ \{i,j\}\mid i \in \{1,\ldots , n\}, \, j\in V_\ell \}. $ Then the graph G with vertices $1,\ldots , n+k$ and edge set given by the union of $E_1,\ldots , E_\ell $ and of the edge sets of $G_0,G_1,\ldots , G_r$ is in ${\mathcal {D}}_{n,n+k}$, its graph weight is

$$\begin{aligned} w(G;x_1,\ldots , x_{n+k}) = w(G_0;x_1,\ldots ,x_n) \prod _{\ell =1}^r \Biggl (\prod _{\{i,j\}\in E_\ell } f(x_i,x_j)\Biggr ) w\bigl (G_\ell ;(x_j)_{j\in V_\ell }\bigr ). \end{aligned}$$

Summation over $G_0$ yields the factor $\prod _{1\le i < j \le n} (1+f(x_i,x_j))$. Summation over the connected graphs $G_\ell $ yields $\varphi _{|V_\ell |}^{\mathsf {T}}(\varvec{x}_{V_\ell })$. Finally, summation over the edge sets $E_\ell $ yields the factor $\prod _{1\le i \le n,j \in V_\ell }(1+f(x_i,x_j)) - 1$. $\square $

3.2 Kirkwood–Salsburg Equations: Proof of Theorem 2.1

To prove our main result, Theorem 2.1, we will show that the activity expansions $\varvec{\rho }$ satisfy the Kirkwood–Salsburg inequalities and, moreover, that equality holds for non-negative interactions. To do so, we will need to establish a recursive formula for the coefficients $\psi _{n,n+k}$ of $\rho _n$ given in terms of multi-rooted graphs.

Lemma 3.5

Let $n\in \mathbb {N}$, $k\in \mathbb {N}_0$ and $(x_1,\ldots , x_n) \in \mathbb X^n$. Abbreviate $s= s(\varvec{x})$. For $L\subset [k]$, let $\ell $ denote the cardinality of L. Then for all $(y_1,\ldots ,y_k)\in \mathbb X^k$,

$$\begin{aligned} \psi _{n,n+k}(x_1,\ldots ,x_n,y_1,\ldots , y_k)&=\prod _{\begin{array}{c} 1\le i \le n:\\ i\ne s \end{array}} \bigl (1+f(x_{s}, x_i)\bigr ) \sum _{L\subset [k]} \Bigl (\prod _{i\in L} f(x_ {s}, y_i)\Bigr ) \\&\quad \quad \times \psi _{n-1+\ell , n-1+k}\bigl (x'_2,\ldots , x'_{n}, (y_i)_{i\in L}, (y_j)_{j\in [k]\setminus L} \bigr ). \end{aligned}$$

Furthermore, if $n\ge 2$,

$$\begin{aligned} \psi _{n,n}(x_1,\ldots ,x_n) =\prod _{\begin{array}{c} 1\le i \le n:\\ i\ne s \end{array}} \bigl (1+f(x_{s}, x_i)\bigr )\, \psi _{n-1, n-1}\bigl (x'_2,\ldots , x'_{n} \bigr ). \end{aligned}$$

The lemma is proven in [14, Lemma 4.1] and holds true as well for pair potentials that may take negative values. The index set L corresponds to the non-root vertices adjacent to the selected vertex s. Similar recurrent relation are well-known from the literature and have been employed in the context of both activity and density (virial) expansions for a long time (see, e.g., [20, Eq. 5]).

We now want to translate the recurrence relation for coefficients $\psi _{n,n+k}$ from Lemma 3.5 into integral equations for partial sums and series. For a non-negative activity function, we set

$$\begin{aligned} {\tilde{\rho }}_n(x_1,\ldots , x_n;z):= z(x_1)\cdots z(x_n) \sum _{k=0}^{\infty } \frac{1}{k!}\int _{\mathbb X^k} \bigl |\psi _{n,n+k}(x_1,\ldots ,x_n,\varvec{y})\bigr | \lambda _z^k(\mathrm {d}\varvec{y}). \end{aligned}$$

Notice that $\rho _n(x_1,\ldots ,x_n;z)$ is absolutely convergent if and only if ${\tilde{\rho }}_n(x_1,\ldots ,x_n;z)<\infty $, and, in the case of non-negative potentials,

$$\begin{aligned} {\tilde{\rho }}_n(x_1,\ldots , x_n;z) = (-1)^n \rho _n(x_1,\ldots , x_n;-z) \end{aligned}$$

holds due to the alternating-sign property from Corollary 3.3.

Let $\varvec{{\tilde{S}}}_N(z) = ({\tilde{S}}_{N,n}(\cdot ;z))_{n\in \mathbb {N}}$ be the vector of partial sums given by

$$\begin{aligned} {\tilde{S}}_{N,n}(x_1,\ldots ,x_n;z):=z(x_1)\cdots z(x_n) \sum _{k=0}^{N-n} \frac{1}{k!}\int _{\mathbb X^k} \bigl | \psi _{n,n+k}(x_1,\ldots ,x_n,\varvec{y})\bigr | \lambda _z^k(\mathrm {d}\varvec{y}) \end{aligned}$$

if $N \ge n$, and 0 otherwise. The summand for $k=0$ is to be read as $|\psi _{n,n}(x_1,\ldots ,x_n)|$.

Proposition 3.6

For general pair-interactions, we have

$$\begin{aligned} \varvec{{\tilde{\rho }}}(z) \le \varvec{e}_z+ {\tilde{K}}_z^s \varvec{\tilde{\rho }}(z) \end{aligned}$$

(3.3)

and

$$\begin{aligned} \varvec{{\tilde{S}}}_1(z) = \varvec{e}_z,\quad \varvec{{\tilde{S}}}_{N+1} \le \varvec{e}_z+{\tilde{K}}_z^s \varvec{{\tilde{S}}}_{N}(z)\quad (N\ge 1). \end{aligned}$$

Moreover, for non-negative potentials, we get the equalities

$$\begin{aligned} \varvec{{\tilde{\rho }}}(z) = \varvec{e}_z+ {\tilde{K}}_z^s \varvec{\tilde{\rho }}(z) \end{aligned}$$

(3.4)

and

$$\begin{aligned} \quad \varvec{{\tilde{S}}}_{N+1} = \varvec{e}_z+{\tilde{K}}_z^s \varvec{\tilde{S}}_{N}(z)\quad (N\ge 1). \end{aligned}$$

Proof

The equality $\varvec{{\tilde{S}}}_1(z) = \varvec{e}_z$ follows from the definition of $\varvec{{\tilde{S}}}_1(z)$ and $\psi _{1,1}(x_1) =1$. For the recurrence relation, we employ arguments from [14, Sect. 4] and combine them with the alternating sign property from Corollary 3.3 to argue equality in the case of non-negative potentials.

Consider first ${\tilde{S}}_{N+1,n}(z)$ with $2\le n\le N+1$. Define

$$\begin{aligned} \mathscr {R}_{n,\ell } (x_1,\ldots ,x_n;y_1,\ldots , y_\ell ) := z(x_s) \prod \limits _{i=2}^n(1+f(x_s,x'_i)) \prod _{i=1}^\ell \bigl | f(x_s, y_i)\bigr |. \end{aligned}$$

Fix $n\ge 2$. Lemma 3.5 and the triangle inequality yield

$$\begin{aligned}&\bigl |\psi _{n,n+k}(x_1,\ldots ,x_n,y_1,\ldots , y_k) \bigr |\nonumber \\&\quad \le \sum _{L\subset [k]} \mathscr {R}_{n,\ell } (x_1,\ldots ,x_n; \varvec{y}_L)\, \bigl |\psi _{n-1+\ell , n-1+k}\bigl (x'_2,\ldots , x'_{n}, \varvec{y}_L, \varvec{y}_{[k]\setminus L}\bigr ) \bigr |, \end{aligned}$$

(3.5)

where $\ell = \#L$. When we integrate over $y_1,\ldots ,y_k$, all sets L with the same cardinality contribute the same, therefore

$$\begin{aligned}&\frac{1}{k!}\int _{\mathbb X^k} \bigl |\psi _{n,n+k}(x_1,\ldots ,x_n,y_1,\ldots , y_k) \bigr | \, \lambda _z^k(\mathrm {d}\varvec{y}) \\&\quad \le \sum _{\ell =0}^k \frac{1}{\ell ! (k-\ell )!} \int _{\mathbb X^k} \mathscr {R}_{n,\ell } (x_1,\ldots ,x_n; y_1,\ldots ,y_\ell )\, \bigl |\psi _{n-1+\ell , n-1+k}\bigl (x'_2,\ldots , x'_{n}, \varvec{y}\bigr ) \bigr |\, \lambda _z^k(\mathrm {d}\varvec{y}). \end{aligned}$$

Summing over $k=0,\ldots ,N+1-n$ we obtain a double sum over k and $\ell $. A change in summation indices from $(\ell ,k)$ to $(\ell ,m)=(\ell ,k-\ell )$ yields

$$\begin{aligned}&{\tilde{S}}_{N+1,n}(x_1,\ldots ,x_n;z) \le \sum _{\ell =0}^{N+1-n} \frac{1}{\ell !} \int _{\mathbb X^\ell } \mathscr {R}_{n,\ell } (x_1,\ldots ,x_n; y_1,\ldots ,y_\ell )\Bigl \{\cdots \Bigr \} \lambda _z^\ell \bigl (\mathrm {d}( y_1,\ldots y_\ell )\bigr ),\\&\quad \{\cdots \} = \sum _{m=0}^{N+1-n-\ell } \frac{1}{m!} \int _{\mathbb X^m} \, \bigl |\psi _{n-1+\ell , n-1+\ell +m}\bigl (x'_2,\ldots , x'_{n}, \varvec{y}\bigr ) \bigr |\, \lambda _z^m\bigl (\mathrm {d}(y_{\ell +1},\ldots y_{\ell +m})\bigr ). \end{aligned}$$

The term in curly braces is nothing else but $\tilde{S}_{N,n-1+\ell }(x'_2,\ldots ,x'_n,y_1,\ldots ,y_\ell )$. For $\ell \ge N+1-n$, the function ${\tilde{S}}_{N,n-1+\ell }(\cdot ;z)$ is identically zero. It follows that

$$\begin{aligned} {\tilde{S}}_{N+1,n}(x_1,\ldots ,x_n;z) \le \sum _{\ell =0}^{\infty } \frac{1}{\ell !} \int _{\mathbb X^\ell } \mathscr {R}_{n,\ell } (x_1,\ldots ,x_n; \varvec{y}) \tilde{S}_{N,n-1+\ell }(x'_2,\ldots ,x'_n,\varvec{y}) \lambda _z^\ell \bigl (\mathrm {d}\varvec{y}\bigr ). \end{aligned}$$

This proves the inequality ${\tilde{S}}_{N+1,n}(\cdot ;z)\le (\tilde{K}_z^s S_{N}(z)\bigr )_n(\cdot )$. The cases $n\ge N+2$ and $n=1$ are treated in a similar fashion, we leave the details to the reader. For the equality ${\tilde{S}}_{N+1,n}(\cdot ;z)=({\tilde{K}}_z^s S_{N}(z)\bigr )_n(\cdot )$ in the case of non-negative potentials, notice that for such potentials (3.5) holds with an equality — due to the alternating-sign property from Corollary 3.3.

Finally, by passing to the limit $N\rightarrow \infty $ in the recurrence relation for $\varvec{{\tilde{S}}}_N(z)$, the inequality (3.3) for $\varvec{{\tilde{\rho }}}(z)$ follows (and in the case of non-negative potentials the fixed point equation (3.4) is obtained). Notice that all exchanges of limits, sums and integrals are permitted by monotone convergence and because all terms involved are non-negative. $\square $

Proof of Theorem 2.1

For the implication (i) $\Rightarrow $ (ii), suppose there exists a sequence $\varvec{\xi }= (\xi _n)_{n\in \mathbb {N}}$ of measurable non-negative functions $\xi _n:\mathbb X^n\rightarrow \mathbb {R}_+$ such that $ \varvec{e}_z + {\tilde{K}}_z^s \varvec{\xi }\le \varvec{\xi }$. We prove by induction over N that $\varvec{{\tilde{S}}}_N(z) \le \varvec{\xi }$ for all $N\in \mathbb {N}$. For $N=1$, we have

$$\begin{aligned} \varvec{{\tilde{S}}}_1 = \varvec{e}_z \le \varvec{e}_z + {\tilde{K}}_z^s \varvec{\xi }\le \varvec{\xi }. \end{aligned}$$

If $\varvec{{\tilde{S}}}_N\le \varvec{\xi }$ for some $N\in \mathbb {N}$, then

$$\begin{aligned} \varvec{{\tilde{S}}}_{N+1} \le \varvec{e}_z+{\tilde{K}}_z^s \varvec{\tilde{S}}_{N}(z) \le \varvec{e}_z + {\tilde{K}}_z^s \varvec{\xi }\le \varvec{\xi }, \end{aligned}$$

where the first inequality holds by Proposition 3.6 and the second one due to the inductive hypothesis and the monotonicity of ${\tilde{K}}_z^s$ on non-negative functions.

This completes the induction and proves $\varvec{{\tilde{S}}}_N\le \varvec{\xi }$ for all N. Passing to the limit $N\rightarrow \infty $, we find $\varvec{{\tilde{\rho }}} \le \varvec{\xi }$. This proves the absolute convergence (ii) as well as the bound (2.6).

Left to show is the implication (ii) $\Rightarrow $ (i) under the additional assumption that the potential is non-negative. Suppose that $\rho _n(x_1,\ldots ,x_n;z)$ is absolutely convergent, for all $n\in \mathbb {N}$ and $(x_1,\ldots ,x_n)\in \mathbb X^n$. Then ${\tilde{\rho }}_n(x_1,\ldots ,x_n;z)$ is finite everywhere and we may set

$$\begin{aligned} \xi _n(x_1,\ldots ,x_n):= {\tilde{\rho }}_n(x_1,\ldots ,x_n;z). \end{aligned}$$

Proposition 3.6 yields $\varvec{\xi }= \varvec{e}_z + \tilde{K}_z^s \varvec{\xi }$ hence a fortiori $\varvec{\xi }\ge \varvec{e}_z + {\tilde{K}}_z^s \varvec{\xi }$, as a pointwise inequality for all vector entries. This proves (i).$\square $

3.3 Integral Equations for Hard-Core Models: Proof of Theorem 2.6

In this section we specialize to hard-core systems in the continuum as in Sect. 2.3 and use capital letters for objects $X\in \mathbb X$. Let ${\mathcal {D}}^\mathrm {red}_{n,n+k}\subset {\mathcal {D}}_{n,n+k}$ be the collection of graphs $G\in \mathcal D_{n,n+k}$ that have no edges linking any two root vertices $i,j\in \{1,\ldots ,n\}$. Define $\psi _{n,n+k}^\mathrm {red}$ in a similar way as $\psi _{n,n+k}$ but with summation over graphs in $ \mathcal D^\mathrm {red}_{n,n+k}$. It is not difficult to check that

$$\begin{aligned} \psi _{n,n+k}(X_1,\ldots ,X_{n+k}) =\prod _{1\le i < j \le n} \bigl (1+ f(X_i,X_j)\bigr )\psi _{n,n+k}^\mathrm {red}(X_1,\ldots ,X_{n+k}). \end{aligned}$$

The reduced functions $\psi _{n,n+k}^\mathrm {red}$ satisfy recurrence relations similar to Lemma 3.5. Define

$$\begin{aligned}&g(X_1;X_2,\ldots , X_n;Y_1,\ldots , Y_k):= \prod _{j=1}^{k}f(X_1,Y_j) \prod _{1\le i< j\le k}\bigl (1+f(Y_i,Y_j)\bigr ) \nonumber \\&\quad \quad \prod _{\begin{array}{c} 2\le i\le n\\ 1\le j\le k \end{array}} \bigl (1+f(X_i,Y_j)\bigr ). \end{aligned}$$

(3.6)

Remember the indicator I from (2.16) and notice

$$\begin{aligned} g(X_1;X_2,\ldots , X_n;Y_1,\ldots , Y_k)=(-1)^k I(X_1; X_2\cup \cdots \cup X_n; Y_1,\ldots , Y_k). \end{aligned}$$

(3.7)

Lemma 3.7

For all $k,n\in \mathbb {N}$, we have

$$\begin{aligned}&\psi _{n,n+k}^\mathrm {red}(X_1,\ldots , X_n,Y_1,\ldots , Y_k) \\&\quad = \sum _{L\subset [k]}g\bigl (X_s;X'_2,\ldots , X'_n;(Y_j)_{j\in L}\bigr ) \psi _{n-1+\ell ,n-1+k}^\mathrm {red}\bigl (X'_2,\ldots , X'_n, (Y_j)_{j\in L}, (Y_j)_{j\in [k]\setminus L}\bigr ). \end{aligned}$$

The proof is based on combinatorial considerations similar to the proof of Lemma 4.1 in [14], we leave the details to the reader. The lemma holds true for arbitrary subsets of $\mathbb {R}^d$, the $X_i$’s and $Y_j$’s need not be in $\mathbb X$. That applies to our next result as well.

Lemma 3.8

For all $k,n\in \mathbb {N}$, we have

$$\begin{aligned} \psi _{n,n+k}^\mathrm {red}(X_1,\ldots ,X_n,Y_1,\ldots ,Y_{k}) = \varphi _{1+k}^{\mathsf {T}}(X_1\cup \cdots \cup X_n,Y_1,\ldots , Y_k). \end{aligned}$$

Proof

Revisiting the proof of Lemma 3.4, we see that

$$\begin{aligned}&\nonumber \psi _{n,n+k}^\mathrm {red}(X_1,\ldots ,X_n,Y_1,\ldots ,Y_k) \\ {}&= \sum _{\{V_1,\ldots , V_r\}} \prod _{\ell =1}^r \Biggl ( \prod _{\begin{array}{c} 1 \le i \le n,\\ j\in V_\ell \end{array}} \bigl (1+f (X_i,Y_j)\bigr ) -1 \Biggr ) \varphi _{|V_\ell |}^\mathsf T\bigl ((Y_j)_{j\in V_\ell }\bigr ), \end{aligned}$$

(3.8)

where the sum runs over all set partitions $\{V_1,\ldots , V_r\}$ of non-root vertices $\{n+1,\ldots , n+k\}$. For hard-core interactions, the term in parentheses is equal to minus the indicator that $D:=X_1\cup \cdots \cup X_n$ is intersected by at least one $Y_j$, $j\in V_\ell $. Thus

$$\begin{aligned} \psi _{n,n+k}^\mathrm {red}(X_1,\ldots ,X_n,Y_1,\ldots ,Y_k) = \sum _{\{V_1,\ldots , V_r\}} \prod _{\ell =1}^r \bigl ( - \mathbb {1}_{\{\exists j\in V_\ell :\, Y_j \cap D\ne \varnothing \}}\bigr ) \varphi _{|V_\ell |}^{\mathsf {T}}\bigl ((Y_j)_{j\in V_\ell }\bigr ). \end{aligned}$$

Using again (3.8), we deduce

$$\begin{aligned} \psi _{n,n+k}^\mathrm {red}(X_1,\ldots ,X_n,Y_1,\ldots ,Y_k) = \psi _{1,1+k}(D,Y_1,\ldots , Y_k) = \varphi _{1+k}^{\mathsf {T}} (D,Y_1,\ldots ,Y_k). \end{aligned}$$

$\square $

Lemma 3.9

Let $k\in \mathbb {N}$ and let $D_0,D_1$ be two disjoint subsets of $\mathbb {R}^d$, with $D_0\ne \varnothing $. Then

$$\begin{aligned}&\varphi _{1+k}^{\mathsf {T}} (D_0\cup D_1,Y_1,\ldots , Y_k) \\&\quad = \sum _{L\subset [k]} (-1)^\ell I\bigl ( D_0;D_1;(Y_i)_{i\in L}\bigr )\varphi _{1+k-\ell }^{\mathsf {T}} \Bigl (D_1\cup \bigl ( \bigcup _{i\in L} Y_i\bigr ), (Y_j)_{j\in [k]\setminus L}\Bigr ), \end{aligned}$$

where the sum is taken over all subsets $L\subset [k]$ and $\ell $ denotes the cardinality of L.

Proof

The claim of the lemma follows from Eq. (3.7), Lemmas 3.7 and 3.8. $\square $

For $D\in \mathbb D_\varepsilon $ with $E_1,\ldots ,E_n\in \mathbb E_\varepsilon $ and $C(D)=\{E_1,\ldots , E_n\}$, let

$$\begin{aligned} {\tilde{T}}(D;z):=1+\sum _{k=1}^\infty \frac{1}{k!}\int _{\mathbb X^k} \bigl |\varphi _{1+k}^{\mathsf {T}}(D,Y_1,\ldots ,Y_k)\bigr | \lambda _z^k(\mathrm {d}\varvec{Y}), \end{aligned}$$

${\tilde{T}}_1(D;z):= \delta _{n,1}(\{E_1,\ldots ,E_n\})$, and for $N \ge 2$,

$$\begin{aligned} {\tilde{T}}_N\bigl (D;z\bigr ) :=\mathbb {1}_{\{n\le N\}}+\sum _{k=1}^{N-n} \frac{1}{k!}\int _{\mathbb X^k} \mathbb {1}_{\{n+\sum _{i=1}^k |C(Y_i)|\le N\}} \bigl |\varphi _{1+k}^\mathsf T(D,Y_1,\ldots ,Y_k)\bigr | \lambda _z^k(\mathrm {d}\varvec{Y}). \end{aligned}$$

Although the value of ${\tilde{T}}(D;z)$ (if the series converges) does not depend on the choice of the chopping map, notice that the value of ${\tilde{T}}_N(D;z)$ clearly does depend on $C(D)=\{E_1,\ldots ,E_n\}$ and not only on $E_1\cup \ldots \cup E_n$ — due to the constraint on the number of snippets.

A selection rule is a map s from collections of disjoint snippets $\mathbb D_\varepsilon $ to $\mathbb E_\varepsilon $ such that

$$\begin{aligned} s(\{E_1,\ldots ,E_n\}) \in \{E_1,\ldots , E_n\}, \end{aligned}$$

i.e., $s(\cdot )$ selects one of the snippets. We use the suggestive but somewhat abusive notation $E_s $ for the selected snippet, and let $E'_2,\ldots , E'_n$ be any enumeration of the remaining snippets. If $\xi (\cdot )$ is a function from $\mathbb D_\varepsilon $ to $\mathbb {R}_+$ that satisfies the measurability assumption from Theorem 2.6, define a new function $\tilde{\kappa }_z^s\xi $ (possibly assuming the value “$\infty $”) by setting

$$\begin{aligned} \nonumber&({\tilde{\kappa }}_z^s\xi )\bigl ( D\bigr ) := \mathbb {1}_{\{n\ge 2\}}\, \xi \bigl (E'_2\cup \ldots \cup E'_n) \\&\quad +\sum _{k=1}^\infty \frac{1}{k!} \int _{\mathbb X^k} I(E_s;E'_2\cup \cdots \cup E'_n; Y_1,\ldots ,Y_k) \xi \Bigl (E'_2\cup \ldots E'_n\cup Y_1\cup \ldots \cup Y_k \Bigr )\lambda _z^k(\mathrm {d}\varvec{Y}) \end{aligned}$$

(3.9)

for $D\in \mathbb D_\varepsilon $ with $E_1,\ldots ,E_n\in \mathbb E_\varepsilon $ and $C(D)=\{E_1,\ldots , E_n\}$. Furthermore, let $e(D):= \delta _{n,1}(\{E_1,\ldots ,E_n\})$ be the indicator that D is a single snippet.

Let z be a non-negative activity such that for every non-empty $D\in \mathbb D_\epsilon $ the series ${\tilde{T}}\bigl (D;z\bigr )$ converges absolutely. Notice that the topology induced by the Hausdorff distance is equivalent to the myopic topology and the map ${\mathscr {K}}'\ni F\mapsto \mathbb {1}_{\{F\in \mathscr {K}'\vert \ F\cap B\ne \varnothing \}}(F)=-f(F,B)$ is measurable with respect to the myopic topology for all compact subsets B (see [21]). Measurability of ${\mathscr {K}}'\ni F\mapsto \tilde{T}(D\cup F ;z)$ for every $D\in {\mathscr {K}}'$ can be concluded, e.g., by representing the series ${\tilde{T}}(D\cup F;z)$ as in Eq. (2.15). Since $\tilde{T}(D\cup F ;z)=\tilde{T}({\overline{D}}\cup F ;z)$ for every $D\in \mathbb D_\varepsilon $, its topological closure ${\overline{D}}$ in $\mathbb {R}^d$ and every $F\in {\mathscr {K}}'$ (by our assumptions that the boundaries of snippets are $\lambda $-null set), the measurability of $\mathscr {K}'\ni F\mapsto \tilde{T}(D\cup F ;z)$ for all $D\in \mathbb D_\varepsilon $ follows.

The next result is an analogue of Proposition 3.6.

Proposition 3.10

We have

$$\begin{aligned} {\tilde{T}}\bigl (\cdot ;z\bigr ) = e(\cdot ) +{\tilde{\kappa }}_z^s \tilde{T}(\cdot ;z). \end{aligned}$$

Moreover ${\tilde{T}}_1(\cdot ;z) = e(\cdot )$ and for $N\ge 1$

$$\begin{aligned} {\tilde{T}}_{N+1}\bigl (\cdot ;z\bigr ) = e(\cdot ) + (\tilde{\kappa }_z^s {\tilde{T}}_N)\bigl (\cdot ;z\bigr ). \end{aligned}$$

The proposition follows from Lemma 3.9 by arguments similar to the proof of Proposition 3.6, therefore the proof is omitted.

Proof of Theorem 2.6

To show the implication (ii) $\Rightarrow $ (i) suppose that T(D; z) is absolutely convergent and thus ${\tilde{T}}(D;z)$ is convergent for all non-empty $D\in \mathbb D_\varepsilon $. Moreover, ${\tilde{T}}(D;z)$ is uniformly bounded from below by 1 and does not depend on the choice of the chopping map C. We set

$$\begin{aligned} a(D):= \log \tilde{T}(D;z)\ge 0 \end{aligned}$$

for non-empty $D\in \mathbb D_\varepsilon $ and $a(\varnothing ) := 0$. Furthermore, for $D\in \mathbb D_\varepsilon $ with $E_1,\ldots ,E_m\in \mathbb E_\varepsilon $ and $C(D)=\{E_1,\ldots , E_m\}$, let $E_s\in \mathbb E_\varepsilon $ be given by $E_{s(\{E_1,...,E_m\})}$ for some selection rule $s(\cdot )$ and set $D':=D\backslash E_s$. Exploiting the fixed point equation for ${\tilde{T}}(\cdot ;z)$ from Proposition 3.10, we get

$$\begin{aligned} \mathrm {e}^{a(D')} + \sum _{k=1}^\infty \frac{1}{k!}\int _{\mathbb X^k}I(E_s;D';Y_1,\ldots , Y_k) \mathrm {e}^{a(D'\cup Y_1\cup \cdots \cup Y_k)} \lambda _z^k(\mathrm {d}\varvec{Y}) \le \mathrm {e}^{a(E_s\cup D')}. \end{aligned}$$

Item (i) of Theorem 2.6 follows upon multiplication with $\exp (-a(D'))$ on both sides (in fact, we have shown that the inequality from item (i) holds for every choice of $s\in \{1,\ldots ,m\}$).

The implication (i) $\Rightarrow $ (ii) follows from Proposition 3.10 by an induction over N similar to the proof of Theorem 2.1 on p. 20. Indeed, check that for the induction step it is sufficient that the corresponding system of Kirkwood–Salsburg inequalities holds for all finite disjoint unions of snippets (for any choice of the chopping map C, the snippet-size $\varepsilon >0$ and the selection rule s). Bound (2.17) is then established using the triangle inequality, the alternating sign property, and Eq. (2.15). $\square $

3.4 Recurrence Relations for Subset Polymers: Proof of Theorem 2.7

Just as in the continuous case, we will show that the expansions $\tilde{T}$ solve a system of integral equations in the discrete setup. We do so by providing a recursive formula for the corresponding coefficients $\varphi ^{\mathsf {T}}_{1+k}$.

Lemma 3.11

For all finite subsets $D'\subset \mathbb {Z}^d$, all $x\in \mathbb {Z}^d\setminus D'$, and all $k\in \mathbb {N}$,

$$\begin{aligned}&\varphi _{1+k}^{\mathsf {T}}(D'\cup \{x\}, Y_1,\ldots , Y_k) \\&\quad = \varphi _{1+k}^{\mathsf {T}}(D', Y_1,\ldots , Y_k) + \sum _{i=1}^k \bigl ( - \mathbb {1}_{\{Y_i\ni x,Y_i \cap D = \varnothing \}}\bigr ) \varphi _{k}^{\mathsf {T}} \bigl (D'\cup Y_i, (Y_j)_{j\ne i}\bigr ) \end{aligned}$$

with $\varphi _1^{\mathsf {T}} \equiv 1$.

Proof

Notice that the analogue of Lemma 3.9 holds in the discrete setup of subset polymers as well, in particular for the choice $D_0:=\{x\}$ and $D_1:=D'$. However, since two disjoint polymers $Y_1$ and $Y_2$ cannot both intersect the same monomer x, only the summands corresponding to $L=\varnothing $ and $\vert L\vert =1$ in the sum on the right-hand side of the identity in Lemma 3.9 provide non-trivial contributions. $\square $

Let D be a finite non-empty subset of $\mathbb {Z}^d$ and z a non-negative activity function. Set

$$\begin{aligned} {\tilde{T}}(D;z):= 1+ \sum _{k=1}^\infty \frac{1}{k!}\sum _{(Y_1,\ldots , Y_k) \in \mathbb X^k}\bigl | \varphi _{1+k}^{\mathsf {T}}( D,Y_1,\ldots , Y_k) \bigr |z(Y_1) \cdots z(Y_k) \end{aligned}$$

and for $N\in \mathbb {N}$,

$$\begin{aligned}&{\tilde{T}}_N(D;z):= \mathbb {1}_{\{\vert D\vert \le N\}}\nonumber \\&\quad + \sum _{k=1}^\infty \frac{1}{k!}\sum _{(Y_1,\ldots , Y_k) \in \mathbb X^k} \mathbb {1}_{\{ |D| + \sum _{i=1}^k |Y_i| \le N\}} \varphi _{1+k}^{\mathsf {T}}( D,Y_1,\ldots , Y_k) z(Y_1) \cdots z(Y_k).\nonumber \\ \end{aligned}$$

(3.10)

Furthermore, we use the convention ${\tilde{T}}_N(\varnothing ;z) =1$ for all $N\ge 1$.

Again, we lift the established recurrent relations on the level of coefficients (given by Lemma 3.11) to the level of partial sums and series, deriving a system of integral equations for those. The following result is an analogue of Proposition 3.6 and Proposition 3.10 for subset polymers.

Proposition 3.12

Under the assumptions of Lemma 3.11, the identities

$$\begin{aligned} {\tilde{T}}(D'\cup \{x\};z) = {\tilde{T}}(D';z) + \sum _{\begin{array}{c} Y\ni x:\\ Y\cap D'=\varnothing \end{array}} z(Y) {\tilde{T}}(D'\cup Y;z) \end{aligned}$$

and for $N\in \mathbb {N}$,

$$\begin{aligned} {\tilde{T}}_{N+1}(D'\cup \{x\};z) = {\tilde{T}}_N(D';z) + \sum _{\begin{array}{c} Y\ni x:\\ Y\cap D'=\varnothing \end{array}} z(Y) \tilde{T}_N(D'\cup Y;z), \end{aligned}$$

hold for any non-negative activity z.

Remark 3.2

Notice that the first identity in Proposition 3.12 is just a sign-flipped version of the standard Kirkwood–Salsburg equations for the reduced correlation functions found in [1].

Proof

Lemma 3.11 yields

$$\begin{aligned}&\mathbb {1}_{\{ |D'\cup \{x\}| + \sum _{i=1}^k |Y_i| \le N+1\}} \varphi _{1+k}^{\mathsf {T}}( {D'\cup \{x\}},Y_1,\ldots , Y_k) \\&\qquad = \mathbb {1}_{\{ |D'| + \sum _{i=1}^k |Y_i| \le N\}} \varphi _{1+k}^{\mathsf {T}}(D', Y_1,\ldots , Y_k) \\&\qquad \qquad + \sum _{i=1}^k \bigl ( - \mathbb {1}_{\{Y_i \ni x,\ Y_i \cap D'=\varnothing \}}\bigr ) \mathbb {1}_{\{ |D'\cup Y_i| + \sum _{j\ne i} |Y_j| \le N\}} \varphi _{k}^{\mathsf {T}}( D'\cup Y_i, (Y_j)_{j\ne i}). \end{aligned}$$

The proof of the recurrence relation for ${\tilde{T}}_N(\cdot ;z)$ is concluded by exploiting the alternating sign property of the Ursell functions, summing over k and $Y_1,\ldots , Y_k$, and exploiting the symmetry of $\varphi _k^{\mathsf {T}}$. The recurrence relation for ${\tilde{T}}(\cdot ;z)$ follows by passing to the limit $N\rightarrow \infty $. $\square $

Proof of Theorem 2.7

To prove the implication $(i)\Rightarrow (ii)$, suppose that condition (i) is satisfied for some set function $a(\cdot )$. Proceeding as in the proof of Theorem 2.1 again, we prove by induction over N that

$$\begin{aligned} \tilde{T}_N(D;z) \le \exp ( a (D)), \end{aligned}$$

(3.11)

for all finite subsets $D\subset \mathbb {Z}^d$. For $N=1$, the inequality reads $\mathbb {1}_{\{\vert D\vert \le 1\}} \le \exp (a(D))$ and it is true because $a(D)\ge 0$. Now, suppose it holds true for some $N\ge 1$ and all D. Let $\widehat{D}\subset \mathbb {Z}^d$ be finite. If $\widehat{D}$ is empty, then ${\tilde{T}}_{N+1}(\widehat{D};z) = 1 \le \exp (a(\widehat{D}))$. If $\widehat{D}$ is not empty, let x be any element of $\widehat{D}$ and let $D':=\widehat{D}\backslash \{x\}$. Then Proposition 3.12 yields

$$\begin{aligned} {\tilde{T}}_{N+1}(\widehat{D};z) = {\tilde{T}}_N(D';z) + \sum _{\begin{array}{c} Y\ni x,\\ Y\cap D'=\varnothing \end{array}} z(Y) \tilde{T}_N(D'\cup Y;z). \end{aligned}$$

By the induction hypothesis and condition (2.18),

$$\begin{aligned} {\tilde{T}}_{N+1}(\widehat{D};z) \le \mathrm {e}^{a(D')} + \sum _{\begin{array}{c} Y\ni x,\\ Y\cap D'=\varnothing \end{array}} z(Y) \mathrm {e}^{a(D'\cup Y)} \le \mathrm {e}^{a(D'\cup \{x\})} = \mathrm {e}^{a(\widehat{D})}. \end{aligned}$$

This completes the inductive proof of (3.11). Passing to the limit $N\rightarrow \infty $, we get ${\tilde{T}}(D;z) \le \exp (a(D)) <\infty $.

To prove the converse implication $(ii)\Rightarrow (i)$, suppose that T(D; z) is absolutely convergent for all finite subsets D. Then ${\tilde{T}}(D;z)<\infty $ and Proposition 3.12 yields

$$\begin{aligned} {\tilde{T}}(D\cup \{x\}; z) = {\tilde{T}}(D;z) + \sum _{\begin{array}{c} Y\ni x,\\ Y\cap D=\varnothing \end{array}} z(Y) {\tilde{T}}(D\cup Y;z). \end{aligned}$$

Set $a(D):= \log {\tilde{T}}(D;z)$. Then $a(D)\ge 0$ because $\tilde{T}(D;z) \ge 1$, moreover

$$\begin{aligned} \mathrm {e}^{a(D\cup \{x\})} = \mathrm {e}^{a(D)} + \sum _{\begin{array}{c} Y\ni x,\\ Y\cap D=\varnothing \end{array}} z(Y) \mathrm {e}^{a(D\cup Y)} \end{aligned}$$

and the inequality (2.18) follows. $\square $

Notice that the preceeding results of Sect. 3.4 can be generalized by proving a more general version of Lemma 3.11—a direct analogue of Lemma 3.9, where we consider two arbitrary finite subsets $D_0\subset \mathbb {Z}^d$ and $D_1\subset \mathbb {Z}^d$, instead of the special case where one of the subsets is a monomer. Naturally, one can view configurations of polymers not only as configurations of monomers but as configurations of disjoint snippets of arbitrary shape and derive from the generalized version of Lemma 3.11 a system of Kirkwood–Salsburg equations different from the one in Proposition 3.12, equations which involve terms of higher order in the activity z. Those equations in turn lead to the following alternative for Theorem 2.7:

Theorem 3.13

Let $(z(X))_{X\in \mathbb X}$ be a non-negative activity. The following two conditions are equivalent:

(i)
There exists a function $a(\cdot )$ from the finite subsets of $\mathbb {Z}^d$ to $[0,\infty )$ such that $a(\varnothing )=0$ and the following system of inequalities is satisfied: For all finite, non-empty subsets $D\subset \mathbb {Z}^d$ there exists a subset $D_0\subset D$ such that—setting $D_1:=D\backslash \{D_0\}$ — we have
$$\begin{aligned} \sum \limits _{k\ge 1}\sum _{\{Y_1,\ldots ,Y_k\}\subset X} z(Y_1)\ldots z(Y_k) \mathrm {e}^{a(D_1\cup Y_1\cup \ldots \cup Y_k) - a(D_1)} \le \mathrm {e}^{a(D_1\cup D_0) - a(D_1)}-1, \end{aligned}$$
where the sum runs over sets of mutually disjoint polymers $\{Y_1,\ldots , Y_k\}\subset \mathbb X$ such that $Y_i\cap D_0\ne \varnothing $ and $Y_i\cap D_1= \varnothing $ for all $i\in \{1\ldots ,k\}$.
(ii)
T(D; z) is absolutely convergent for all finite subsets $D\subset \mathbb {Z}^d$.

Moreover, if one of the equivalent conditions (hence, both) holds true, then, for all finite subsets $D\subset \mathbb {Z}^d$, we have

$$\begin{aligned} \bigl | \log T(D;z)\bigr |\le \sum _{k=1}^\infty \frac{1}{k!}\sum _{(Y_1,\ldots , Y_k) \in \mathbb X^k} \mathbb {1}_{\{\exists i:\, Y_i \cap D\ne \varnothing \}} \bigl | \varphi _{k}^\mathsf T(Y_1,\ldots , Y_k) \bigr | z(Y_1)\cdots z(Y_k) \le a(D). \end{aligned}$$

The details of the proof are left for the reader as an exercise. Notice that the sufficient condition for convergence given by Theorem 3.13 is more general than the one given by Theorem 2.7. However, all the proofs of sufficient conditions for systems of subset polymers in Sect. 4 are using the special case of Theorem 2.7.

4 Application to Concrete Hard-Core Models

Our main results (Theorems 2.1, 2.6 and 2.7) provide characterizations of the domain of absolute convergence for the activity expansions $\rho _n(x_1,\ldots ,x_n;z)$ from which well-known classical criteria are easily recovered (Corollaries 2.2, 2.3 and 2.8). In this section, we illustrate how our convergence conditions provide new, “practitioner-type” sufficient conditions in concrete hard-core models, both discrete and continuous. Our goal here is not to improve on the best available conditions, but to provide upper bounds on the convergence radii that are of reasonable computational feasibility. In the one-dimensional setup of the Tonks gas, however, we are able to go as far as to recover the characterization of absolute convergence from [13].

4.1 Single-Type Subset Polymers in $\mathbb {Z}^d$

Consider the setup of subset polymers from Chapter 2.4. Suppose there is some finite non-empty set $S\subset \mathbb {Z}^d$ and a scalar $z>0$ such that

$$\begin{aligned} z(X) = {\left\{ \begin{array}{ll} z, &{}\quad X\text { is a translate of }S,\\ 0, &{}\quad \text {otherwise}. \end{array}\right. } \end{aligned}$$

(4.1)

We call polymers with non-zero activity active polymers. Define

$$\begin{aligned} V(D):= \bigl |\{ X\in \mathbb X\mid z(X)>0,\, X\cap D \ne \varnothing \}\bigl |, \end{aligned}$$

the number of active polymers intersecting a finite domain $D\subset \mathbb {Z}^d$. Notice $V(\{x\}) = |S|$, for all $x\in \mathbb {Z}^d$.

Theorem 4.1

Let $z(\cdot )$ be the activity function from (4.1). Suppose there exists $\alpha >0$ such that

$$\begin{aligned} |S|\, \mathrm {e}^{\alpha V(S)} z\le \mathrm {e}^{\alpha |S|}-1. \end{aligned}$$

(4.2)

Then T(D; z) is absolutely convergent for all finite subsets $D\subset \mathbb {Z}^d$, thus the activity expansions $\rho _n(x_1,\ldots ,x_n;z)$ converge absolutely for all $n\in \mathbb {N}$ and all $(x_1,\ldots , x_n)\in \mathbb X^n$.

Remark 4.1

Notice that Theorem 4.1 improves on the upper bounds for the convergence radii given by Kotecký–Preiss and by Gruber–Kunz. The improvement over the Gruber–Kunz condition is achieved by a more sophisticated choice of the ansatz function a in the proof of the theorem. However, although we do not have a general proof that the result by Fernández–Procacci is stronger, notice that for all non-pathological examples we considered (e.g., non-overlapping dimers or cubes) Fernández–Procacci provides better bounds than Theorem 4.1.

Example 4.1

(Hypercubes) If $S= \{1,\ldots , k\}^d$ with $k\in \mathbb {N}$, condition (4.2) becomes

$$\begin{aligned} z\le \sup _{\alpha > 0} \frac{\exp (\alpha k^d )-1}{k^d \exp ( \alpha (2k-1)^d)}. \end{aligned}$$

Carrying out the optimization over $\alpha $ yields the condition

$$\begin{aligned} (2k-1)^d z \le \Bigl (1-\frac{1}{(2 - 1/k)^d}\Bigr )^{(2-1/k)^d -1}. \end{aligned}$$

In the limit $d\rightarrow \infty $ at fixed $k\ge 2$, the right-hand side converges from above to the familiar bound $1/\mathrm {e}$.

Proof of Theorem 4.1

We apply Theorem 2.7 with $a(D):= \alpha V(D)$. We check that $V(\cdot )$ is strongly subadditive. Let B, C be finite subsets of $\mathbb {Z}^d$. Then for every polymer X,

$$\begin{aligned} \mathbb {1}_{\{X\cap B\ne \varnothing \}} + \mathbb {1}_{\{X\cap C\ne \varnothing \}} \ge \mathbb {1}_{\{X\cap (B\cup C)\ne \varnothing \}} + \mathbb {1}_{\{X\cap (B\cap C) \ne \varnothing \}}. \end{aligned}$$

Indeed if X intersects B but not C (or C but not B), the inequality reads $1+ 0 \ge 1 +0$ and it is true. If X intersects both B and C, the inequality reads $1+1 \ge 1+ \mathbb {1}_{\{X\cap (B\cap C) \ne \varnothing \}}$ and it is true as well. Finally if X intersects neither B nor C, then both sides of the inequality vanish. Summing over all polymers X, we get

$$\begin{aligned} V(B) + V(C) \ge V(B\cup C) + V(B\cap C). \end{aligned}$$

(4.3)

Now we turn to the criterion (i) from Theorem 2.7. Condition (2.18) for $D'= \varnothing $ reads

$$\begin{aligned} |S|\, z\, \mathrm {e}^{\alpha V(S)} \le \mathrm {e}^{\alpha V(\{x\})} - 1, \end{aligned}$$

it is satisfied because of $V(\{x\}) = |S|$ and the assumption (4.2). For non-empty $D'$, we bound the left-hand side of condition (2.18) with the help of the strong subadditivity. The inequality (4.3) applied to $B = D'\cup \{x\}$ and $C= X$ yields

$$\begin{aligned} V(D'\cup \{x\}) + V(X) \ge V(D'\cup X) + V(\{x\}), \end{aligned}$$

(4.4)

for $x\in \mathbb {Z}^d\setminus D'$, $x\in X$, and $X\cap D' =\varnothing $, and

$$\begin{aligned} V(D'\cup X) - V(D')&\le V(D'\cup \{x\}) + V(X) - V(D') - V(\{x\})\\&= V(D'\cup \{x\}) - V(D') + V(S) - |S|. \end{aligned}$$

This provides an X-independent bound for the exponent in the left-hand side of condition (2.18). The number of summands on the left-hand side of condition (2.18) is given by the number of active polymers intersecting x but not $D'$, which is equal to $V(D'\cup \{x\}) - V(D')$. Thus to prove (2.18) it suffices to show that

$$\begin{aligned} \bigl (V(D'\cup \{x\}) - V(D')\bigr ) z\, \mathrm {e}^{\alpha [V(D'\cup \{x\}) - V(D') + V(S) - |S|]} \le \mathrm {e}^{\alpha [V(D'\cup \{x\})-V(D')]} - 1. \end{aligned}$$

(4.5)

In view of condition (4.2), the last inequality in turn follows once we check

$$\begin{aligned} \bigl (V(D'\cup \{x\}) - V(D')\bigr ) \bigl ( \mathrm {e}^{\alpha |S|} - 1\bigr ) \mathrm {e}^{\alpha [V(D'\cup \{x\}) - V(D') - |S|]} \le |S|\bigl ( \mathrm {e}^{\alpha [V(D'\cup \{x\})-V(D')]} - 1\bigr ) \end{aligned}$$

or equivalently,

$$\begin{aligned} \frac{1- \exp (-\alpha |S|)}{|S|} \le \frac{1 - \exp ( -\alpha R )}{R},\quad R:= V(D'\cup \{x\}) - V(D'). \end{aligned}$$

Because of the subadditivity of V, we have $R \ge V(\{x\}) = |S|$. The exponential map $x\mapsto \exp (- \alpha x)$ is convex and therefore the difference quotient is monotone increasing, i.e., $(\exp (-\alpha x) - 1)/x \le (\exp ( -\alpha y) - 1))/y$ whenever $0\le x\le y$. We apply the inequality to $x = R$ and $y= |S|$ and obtain the required bound. $\square $

4.2 Single-Type Hard-Core System in $\mathbb {R}^d$

Consider a bounded convex shape $S\subset \mathbb {R}^d$ which is non-empty, regular closed and balanced (recall: A set $S\subset \mathbb {R}^d$ is called regular closed if and only if it equals the closure of its interior, i.e., $\overline{S^\circ }=S$, and it is called balanced if and only if $\alpha S\subset S$ for all $\vert \alpha \vert \le 1$). We investigate the special case of the hard-core setup in the continuum from Sect. 2.3 where $\mathbb X$ consists of all translates $x+ S = \{x+ y\mid y\in S\}$. Let us further assume that both the activity and the reference measure $\lambda $ are translationally invariant. Then we may identify $\mathbb X$ with $\mathbb {R}^d$, the reference measure $\lambda $ with the Lebesgue measure, and the activity function with a positive scalar, $z(x) \equiv z>0$.

For an integrable function $h: \mathbb X\rightarrow \mathbb {R}$ we write

$$\begin{aligned} \int _{\mathbb {X}} h(Z)\lambda (\mathrm {d}Z)=\int _{\mathbb {R}^d} h(x+S)\mathrm {d}x. \end{aligned}$$

Write |S| for the Lebesgue volume of the shape S and define for Borel sets $D\subset \mathbb {R}^d$

$$\begin{aligned} V(D):= \int _{\mathbb {R}^d} \mathbb {1}_{\{(x+S)\cap D\ne \varnothing \}} \mathrm {d}x. \end{aligned}$$

(4.6)

Notice $V(\{y\}) = |S|$, which is positive and finite by our assumptions on S, and $V(S) = |S\oplus S|$ with $A\oplus B :=\{a+b\mid a\in A,\, b\in B\}$ the Minkowski sum. The latter identity holds since we assumed the set S to be balanced which implies $\{(x+S)\cap S\ne \varnothing \}=S\oplus S$. Moreover, notice that V—as a function on $\mathbb {D}_{\varepsilon }$ with the convention $V(\varnothing )=0$—satisfies the measurability assumption from Theorem 2.6 by the same argument as formulated on p. 22 for ${\tilde{T}}$.

We refer to such systems as single-type hard-core systems in the continuum. In the language of stochastic geometry (see [28, Sects. 3,4]), the associated Gibbs measure is a hard-core germ–grain model with deterministic grain S (the germs are the positions x).

Theorem 4.2

Assume there exists $\alpha >0$ such that

$$\begin{aligned} |S| \mathrm {e}^{\alpha V(S)} z <\mathrm {e}^{\alpha |S|}-1. \end{aligned}$$

(4.7)

Then the activity expansions $\rho _n(x_1,\ldots ,x_n;z)$ converge absolutely for all $n\in \mathbb {N}$ and all $(x_1,\ldots , x_n)\in \mathbb {X}^n$.

Remark 4.2

Again, notice that while Theorem 4.2—just as its discrete analogue Theorem 4.1)—improves on the Kotecký–Preiss condition, it is in all the cases we considered as examples weaker than Fernández–Procacci (e.g., for systems of hard spheres the bounds on the radius of convergence obtained in [9, 22] are slightly better). However, the advantage of our criterion is that an explicit bound is provided directly, with no need for numerical computation regardless of the dimension.

Example 4.2

(Hard spheres) If $S=B_R(0)$ is the closed ball of radius $R>0$ around the origin, condition (4.7) becomes

$$\begin{aligned} z\le \sup _{\alpha > 0} \frac{\exp (\alpha |B_R(0)|)-1}{|B_R(0)|\, \exp ( \alpha |B_{2R}(0)|)}. \end{aligned}$$

Carrying out the optimization over $\alpha $ yields the condition

$$\begin{aligned} |B_{2R}(0)|\, z \le \Bigl (1-\frac{1}{2^d}\Bigr )^{2^d -1}. \end{aligned}$$

In the limit $d\rightarrow \infty $ at fixed $R>0$, the right-hand side converges from above to the familiar bound $1/\mathrm {e}$.

Proof

Let $\alpha >0$ satisfy condition (4.7). Set $a({\widehat{D}}):= \alpha V({\widehat{D}})$ for ${\widehat{D}}\in \mathbb D_\varepsilon $, choose some chopping map C and let $D\in \mathbb D_\varepsilon $ (the snippet-size $\varepsilon >0$ will be specified later in the proof). For the simplest selection rule s choosing always the first snippet $E_1$, condition (i) in Theorem 2.6 reads

$$\begin{aligned}&\sum _{k=1}^\infty \frac{z^k}{k!}\int _{\mathbb {X}^k} I(E_1;D';Y_1,\ldots , Y_k)\, \mathrm {e}^{\alpha [V(D'\cup (\cup _{i=1}^k Y_i))-V(D')]}\lambda ^k(\mathrm {d}\varvec{Y}) \nonumber \\&\quad \quad \qquad \le \mathrm {e}^{\alpha [V(D'\cup E_1)-V(D')]}-1, \end{aligned}$$

(4.8)

where $D'=D\backslash E_1$.

Notice that—unlike in the discrete case—terms of order higher than one in z do not necessarily vanish in the series in (4.8). Inspired by the proof of Theorem 4.7, we try first to bound the exponent on the left-hand side of (4.8), seeking a bound that separates $Y:= Y_1\cup \cdots \cup Y_k$ from $E_1$ and $D'$. If the constraint were that $E_1\subset Y$, we would conclude with strong subadditivity applied to $B:= Y$ and $C:= D'\cup E_1$ that $V(E_1)+ V(D'\cup Y) \le V(Y)+ V(D'\cup E_1)$. For the weaker constraint $E_1\cap Y\ne \varnothing $, this is no longer true. Let $Z\in \mathbb X$. A straightforward case distinction reveals that under the indicator I, the inequality

$$\begin{aligned} \mathbb {1}_{\{ Z\cap E_1\ne \varnothing \}} + \mathbb {1}_{\{ Z\cap (D'\cup Y)\} \ne \varnothing \}} \le \mathbb {1}_{\{ Z\cap Y\ne \varnothing \}} + \mathbb {1}_{\{ Z\cap (D'\cup E_1)\ne \varnothing \}} \end{aligned}$$

is correct for all possible values of the left-hand side except possibly $1+1$. Indeed it may happen that Z intersects both $E_1$ and $D'\cup Y$, hence a fortiori $D'\cup E_1$, but not Y, so that the right-hand side becomes $0+1$. This happens precisely when Z intersects $D'$ and $E_1$ but not Y. The inequality becomes correct if we add the indicator of this event to the right-hand side. Integrating over Z, we obtain

$$\begin{aligned} V(E_1)+ V(D'\cup Y) \le V(Y)+ V(D'\cup E_1) + \int _\mathbb X \mathbb {1}_{\{Z\cap E_1\ne \varnothing ,\, Z\cap Y=\varnothing ,\, Z\cap D'\ne \varnothing \}} \lambda (\mathrm {d}Z). \end{aligned}$$

Moreover, there exists a constant $C=C(S,d)>0$ that depends only on the dimension d and the shape S such that if $k=1$ and $Y=Y_1\in \mathbb X$ is a translate of S, then

$$\begin{aligned} \int _\mathbb X \mathbb {1}_{\{Z\cap E_1\ne \varnothing ,\, Z\cap Y=\varnothing ,\, Z\cap D'\ne \varnothing \}} \lambda (\mathrm {d}Z)\le C\varepsilon . \end{aligned}$$

Indeed, on the left side we may drop the indicator that Z intersects $D'$ and see that it is sufficient to check

$$\begin{aligned} \int _\mathbb X \mathbb {1}_{\{Z\cap E_1\ne \varnothing ,\, Z\cap Y=\varnothing \}} \lambda (\mathrm {d}Z)\le C\varepsilon . \end{aligned}$$

(4.9)

To see that such an estimate holds let $B_\varepsilon (c)$ be a closed ball of radius $\varepsilon $ around some $c\in \mathbb {R}^d$ containing the snippet $E_1$ and let $x\in Y\cap E_1$ (by assumption this intersection is non-empty). Then $x\in B_\varepsilon (c)$ and the inequality

$$\begin{aligned} \mathbb {1}_{\{Z\cap E_1\ne \varnothing ,\, Z\cap Y=\varnothing \}}\le \mathbb {1}_{\{Z\cap B_\varepsilon (c)\ne \varnothing ,\, x\notin Z\}} \end{aligned}$$

holds pointwise in Z, thus also

$$\begin{aligned} \int _\mathbb X \mathbb {1}_{\{Z\cap E_1\ne \varnothing ,\, Z\cap Y=\varnothing \}}\lambda (\mathrm {d}Z)\le \int _\mathbb X \mathbb {1}_{\{Z\cap B_\varepsilon (c)\ne \varnothing ,\, x\notin Z\}}\lambda (\mathrm {d}Z). \end{aligned}$$

Notice that

$$\begin{aligned} \int _\mathbb X \mathbb {1}_{\{Z\cap B_\varepsilon (c)\ne \varnothing ,\, x\notin Z\}}\lambda (\mathrm {d}Z)=\vert \{y\in \mathbb {R}^d\vert (y+S)\cap B_\varepsilon (0)\ne \varnothing , \tilde{x}\notin y+S\}\vert , \end{aligned}$$

where $B_\varepsilon (0)$ is the closed ball of radius $\varepsilon $ around 0 and $\tilde{x}:=x-c$. For the set on the right-hand side of that equation, the identity

$$\begin{aligned} \{y\in \mathbb {R}^d\vert (y+S)\cap B_\varepsilon (0)\ne \varnothing , \tilde{x}\notin y+S\}=\left( S\oplus B_\varepsilon (0)\right) \backslash (\tilde{x}+S), \end{aligned}$$

holds since S being balanced directly implies

$$\begin{aligned}\{y\in \mathbb {R}^d\vert (y+S)\cap B_\varepsilon (0)\ne \varnothing \}=S\oplus B_\varepsilon (0) \end{aligned}$$

and

$$\begin{aligned}\{y\in \mathbb {R}^d\vert \tilde{x}\in y+S\}=\tilde{x}+S. \end{aligned}$$

Furthermore, observe that the inclusion

$$\begin{aligned} \left( S\oplus B_\varepsilon (0)\right) \backslash (\tilde{x}+S)\subset \left( \left( S\oplus B_\varepsilon (0)\right) \backslash S\right) \cup \left( \left( S\oplus B_\varepsilon (\tilde{x})\right) \backslash (\tilde{x}+S)\right) \end{aligned}$$

holds since $\tilde{x}\in B_\varepsilon (0)$ and S is balanced set. Moreover, $\left( S\oplus B_\varepsilon (\tilde{x})\right) \backslash (\tilde{x}+S)$ is the translate of $\left( S\oplus B_\varepsilon (0)\right) \backslash S$ by $\tilde{x}$, hence it has the same Lebesgue volume and

$$\begin{aligned} \vert \left( S\oplus B_\varepsilon (0)\right) \backslash (\tilde{x}+S)\vert \le&\vert \left( S\oplus B_\varepsilon (0)\right) \backslash S\vert + \vert \left( S\oplus B_\varepsilon (\tilde{x})\right) \backslash (\tilde{x}+S)\vert \\=&2\vert \left( S\oplus B_\varepsilon (0)\right) \backslash S\vert \\=&2\left( \vert S\oplus B_\varepsilon (0)\vert -\vert S\vert \right) . \end{aligned}$$

Finally, by Steiner’s formula for compact convex sets (see [28]), $\vert S\oplus B_\varepsilon (0)\vert -\vert S\vert $ is given by a non-constant polynomial in $\varepsilon $, which yields a bound of the form given by the right-hand side of (4.9) (where the constant $C>0$ can be expressed in terms of the intrinsic volumes of S following the formula).

Consequently, we obtain the bound

$$\begin{aligned} V(E_1)+ V(D'\cup Y) \le V(Y)+ V(D'\cup E_1) + C\varepsilon \end{aligned}$$

(4.10)

which corresponds to the bound (4.4) in the proof of Theorem 4.1.

The inequality (4.10) immediately yields the following upper bound for the left-hand side of (4.8):

$$\begin{aligned} \mathrm {e}^{\alpha C \varepsilon }\, \mathrm {e}^{\alpha [V(D'\cup E_1)-V(D')- V(E_1)]} \sum _{k=1}^\infty \frac{z^k}{k!}\int _{\mathbb {X}^k} I(E_1;D';Y_1,\ldots , Y_k)\, \mathrm {e}^{\alpha V(Y)}\lambda ^k(d\varvec{Y}). \end{aligned}$$

The summand for $k=1$ is equal to

$$\begin{aligned} z\, \mathrm {e}^{\alpha V(S)}\int _\mathbb X\mathbb {1}_{\{Y_1\cap E_1\ne \varnothing , Y_1\cap D'= \varnothing \}}\lambda (\mathrm {d}Y_1)=z[V(D'\cup E_1)-V(D')]\mathrm {e}^{\alpha V(S)}. \end{aligned}$$

For $k\ge 2$, we bound $V(Y) \le \sum _{i=1}^k V(Y_i) = k V(S)$, drop the indicator that the $Y_i$’s do not intersect $D'$, and get the upper bound

$$\begin{aligned} z^k \mathrm {e}^{\alpha k V(S)} \int _{\mathbb X^k}\prod _{i=1}^k \mathbb {1}_{\{Y_i\cap E_1 \ne \varnothing \}}\mathbb {1}_{\{Y_1,\ldots , Y_k\text { disjoint} \}}\lambda ^k (\mathrm {d}\varvec{Y}). \end{aligned}$$

Notice that there exists $N\in \mathbb {N}$ such that for all $k\ge N+1$ the integral vanishes. To see this, assume that there are infinitely many disjoint objects $Y\in \mathbb X$ intersecting the snippet $E_1$ (and therefore some open $\varepsilon $-ball $B_\varepsilon $ in which the snippet is contained). Since all the objects Y are translates of S, we can choose the same radius $r>0$ for all of the infinitely many disjoint objects Y intersecting $E_1$ such that $Y=x+S \subset B_r(x)$. Naturally, every such r-ball must intersect $B_\varepsilon $ and therefore their union is again a bounded Borel subset of $\mathbb {R}^d$. But — by our assumptions on the shape S—every $Y\in \mathbb X$ has the same fixed, strictly positive Lebesgue measure, thus their disjoint union must have infinite Lebesgue measure, which is a contradiction to its boundedness.

For $k \le N$, we drop the indicator that $Y_3,\ldots , Y_k$ are disjoint and find that the integral is bounded by

$$\begin{aligned} V(E_1)^{k-2} \int _{\mathbb X}\mathbb {1}_{\{Y_1\cap E_1 \ne \varnothing \}} \Bigl ( \int _\mathbb X \mathbb {1}_{\{Y_2\cap E_1 \ne \varnothing ,\, Y_2 \cap Y_1 = \varnothing \}}\lambda _z(\mathrm {d}Y_2) \Bigr ) \lambda _z(\mathrm {d}Y_1). \end{aligned}$$

The inner integral is bounded by $C\varepsilon $ because of (4.9), the outer integral gives an additional factor $V(E_1)$. Altogether, the left-hand side of (4.8) is bounded by

$$\begin{aligned}&\mathrm {e}^{\alpha C \varepsilon }\, \mathrm {e}^{\alpha [V(D'\cup E_1)-V(D')- V(E_1)]}\Bigl ( z[V(D'\cup E_1)-V(D')]\mathrm {e}^{\alpha V(S)}\\&\quad + C\varepsilon \sum _{k=2}^N z^k V(E_1)^{k-1} \mathrm {e}^{\alpha k {V(S)}} \Bigr ). \end{aligned}$$

Proceeding as in the proof of Theorem 4.2, but taking into account the strict inequality from assumption (4.7), we find that there exist $\alpha >0$ such that

$$\begin{aligned} \mathrm {e}^{\alpha [V(D'\cup E_1)-V(D')- V(E_1)]} z[V(D'\cup E_1)-V(D')]\mathrm {e}^{\alpha V(S)} <\mathrm {e}^{\alpha [V(D'\cup E_1) - V(D)] }- 1, \end{aligned}$$

compare to (4.5). Therefore, picking $\varepsilon $ small enough, we see that (4.8), hence also condition (i) in Theorem 2.6 is satisfied and all T(D; z), $D\in \mathbb D_\varepsilon $, are absolutely convergent. The claim of the theorem follows immediately. $\square $

4.3 Multi-Type Hard Spheres in $\mathbb {R}^d$

Let $(r_n)_{n\in \mathbb {N}}$ be an increasing sequence of positive real numbers and let $B_{r_n}(0)\subset \mathbb {R}^d$, $n\in \mathbb {N}$, be the family of d-dimensional closed balls around 0 with the corresponding radii. To the sequence $(r_n)_{n\in \mathbb {N}}$ of radii is associated the sequence of non-negative activities $(z_n)_{n\in \mathbb {N}}$ such that the ball $B_{r_i}$ has the activity $z_i$. In the setup of hard-core systems in the continuum from Sect. 2.3, let $\mathbb X$ be given by all possible translates of these objects. Notice that the closed balls are compact convex sets that are non-empty and regular closed. We refer to this special case of a hard-core system as a system of multi-type hard spheres in $\mathbb {R}^d$. We will show a new sufficient condition for absolute convergence of the activity expansions in these types of models.

For an integrable function $h: \mathbb X\rightarrow \mathbb {R}$ we write

$$\begin{aligned} \int _{\mathbb {X}} h(Z)\lambda (\mathrm {d}Z)=\sum _{\ell \ge 1}\ \int _{\mathbb {R}^d} h(x+B_{r_\ell }(0))\mathrm {d}x. \end{aligned}$$

We define the family of functions $(V_r)_{r>0}$ by setting for Borel sets $D\subset \mathbb {R}^d$

$$\begin{aligned} V_r(D):=\int _{\mathbb R^d}\mathbb {1}_{\{(x+B_r(0))\cap D\ne \varnothing \}} \mathrm {d}x, \end{aligned}$$

(4.11)

where $B_r(0)$ is the d-dimensional closed ball of radius $r>0$ around 0. Naturally, the map $V_r$ coincides with the map V from (4.6) for the grain S given by the closed ball $B_r(0)$ and therefore satisfies the measurability assumption from Theorem 2.6 (as a function on $\mathbb {D}_{\varepsilon }$ with the convention $V(\varnothing )=0$). Furthermore, we have $V_r(\{y\})=\vert B_r(0)\vert $ and $V_r(B_s(y))=\vert B_s(y)\oplus B_r(0)\vert =\vert B_{s+r}(0)\vert $ (where $\oplus $ denotes the Minkowski sum) for any $y\in \mathbb {R}^d$ and any numbers $r,s>0$.

The following auxiliary result turns out to be essential for the proof of the new sufficient condition:

Lemma 4.3

Let $D_1$ be a finite union of bounded convex regular closed subsets of $\mathbb {R}^d$ and let $D_2$ be a d-dimensional ball in $\mathbb {R}^d$. The map $(0,\infty )\ni r\mapsto \frac{V_r(D_1 \cup D_2)-V_r(D_1)}{V_r(D_2)}$ is monotonically decreasing in r.

Proof

First of all, observe that for sets D given by a finite union of convex, regular closed subsets of $\mathbb {R}^d$ the volume $V_r(D)$ can be written as

$$\begin{aligned} V_r(D)=\vert D\vert + S(D)r+o(r), \end{aligned}$$

(4.12)

where S(D) denotes the surface area of D. This follows from a generalized version of the classical Steiner’s formula (see [27, Sect. 4.4]). In particular, we see that the map $r\mapsto \frac{V_r(D_1 \cup D_2)-V_r(D_1)}{V_r(D_2)}$ is differentiable in $r=0$.

Next, we notice that the map satisfies the following semi-group property:

$$\begin{aligned} V_{r+\varepsilon }(D)=V_\varepsilon ( D\oplus B_r(0)). \end{aligned}$$

Therefore, to prove the claim of the lemma, it suffices to consider the differential at zero:

$$\begin{aligned} \lim \limits _{\varepsilon \searrow 0}\frac{1}{\varepsilon }\left( \frac{V_\varepsilon (A\cup B)-V_\varepsilon (A)}{V_\varepsilon (B)}-\frac{\vert A\cup B\vert -\vert A\vert }{\vert B \vert }\right) , \end{aligned}$$

where $A:=D_1\oplus B_r(0)$ and $B:=D_2\oplus B_r(0)$.

Using the formula (4.12), a simple computation shows that this limit is equal to

$$\begin{aligned} \frac{\vert B \vert \left( S(A\cup B)-S(A)\right) -S(B)\left( \vert A\cup B\vert -\vert A\vert \right) }{\vert B \vert ^2}. \end{aligned}$$

The monotonicity in the claim of the lemma is then equivalent to

$$\begin{aligned} \vert B \vert \left( S(A\cup B)-S(A)\right) -S(B)\left( \vert A\cup B\vert -\vert A\vert \right) \le 0 \end{aligned}$$

or, equivalently,

$$\begin{aligned}\frac{\vert B\vert }{S(B)} \le \frac{\vert A\cup B\vert -\vert A\vert }{S(A\cup B)-S(A)}. \end{aligned}$$

Using the obvious identities $\vert A\cup B\vert -\vert A\vert = \vert B\vert - \vert A\cap B\vert $ and $S(A\cup B)-S(A)=S(B)-S(A\cap B)$, we can rewrite the last inequality as

$$\begin{aligned}\frac{S(B)}{\vert B \vert }\le \frac{ S(A\cap B)}{\vert A\cap B\vert }, \end{aligned}$$

which holds by the isoperimetric inequality since $B=D_2\oplus B_r(0)$ is a ball in $\mathbb {R}^d$ (“the ball is the shape that minimizes the surface area for given volume”, see [7, 3.2.43]). $\square $

The following sufficient condition is, in some sense, a “continuous version” of the Gruber–Kunz criterion in the setup of hard spheres in $\mathbb {R}^d$. The similarity in the form arises as follows: To establish the recurrence relations underlying the proof of Gruber–Kunz we selected a monomer, a single point in $\mathbb {Z}^d$, from a configuration of polymers. We follow this idea in the proof of the following result, choosing the chopping map C and the selection rule s such that a tiny snippet that approximates a single point in the continuous space sufficiently well is selected. At the same time we choose an ansatz function a that can be interpreted as the continuous analogue of the ansatz function from the proof of Corollary 2.8.

Theorem 4.4

In the setup of multi-type hard objects on $\mathbb {R}^d$, assume that the activity z satisfies

$$\begin{aligned} \exists \alpha >0:~~\sum \limits _{\ell \ge 1}\vert B_{r_\ell }\vert \mathrm {e}^{\alpha \vert B_{r_\ell +r_1}\vert }z_\ell < e^{\alpha \vert B_{r_1}\vert }-1, \end{aligned}$$

(4.13)

where by $\vert B_r\vert $ we denote the (Lebesgue) volume of a ball of radius $r>0$. Then the activity expansions $\rho _n(X_1,\ldots , X_n;z)$ converge absolutely.

Remark 4.3

Activity expansions for systems with infinitely many types of objects are not particularly well-studied in statistical mechanics. In the case of finitely many types, we expect our result to exceed Kotecký–Preiss but to be weaker then Fernández–Procacci—as in the special case of a single type treated above (see Remark 4.2). The general case, for $r_n\rightarrow \infty $ in particular, remains to be investigated.

Proof

Again, our strategy is to show that condition (i) from Theorem 2.6 is satisfied for an appropriate ansatz function a. By assumption, $r_1$ is the radius of the smallest ball present in the system. Set $a({\widehat{D}}):=\alpha V_{r_1}({\widehat{D}})$ for $\widehat{D}\in \mathbb D_\varepsilon $, where $V_{r_1}$ is given by (4.11) and $\alpha $ satisfies (4.13). Choose some chopping map C and let $D\in \mathbb D_\varepsilon $ (the snippet-size $\varepsilon >0$ is to be specified later in the proof). Just as in the proof of Theorem 4.2, independently of the choice of the snippet $E_1$ (i.e., independently of the selection rule s), we obtain the following upper bound for the left-hand side of the inequality from condition (i):

$$\begin{aligned} \mathrm {e}^{\alpha C_1 (\varepsilon )}\, \mathrm {e}^{\alpha [V(D'\cup E_1)-V(D')- V(E_1)]} \sum _{k=1}^\infty \frac{z^k}{k!}\int _{\mathbb {X}^k} I(E_1;D',Y_1,\ldots , Y_k)\, \mathrm {e}^{\alpha V(\varvec{Y})}\lambda ^k(\mathrm {d}\varvec{Y}). \end{aligned}$$

(4.14)

The positive number $C_1(\varepsilon )$ converges towards 0 for $\varepsilon \searrow 0$ and is precisely the bound from (4.9) in the proof of Theorem 4.2 for Y given by a translate of $B_{r_1}(0)$, i.e., by a sphere of minimal volume present in the system.

The summand for $k=1$ in (4.14) is equal to

$$\begin{aligned} \sum \limits _{\ell \ge 1}z_\ell \mathrm {e}^{\alpha \vert B_{r_\ell +r_1}\vert }\int _{\mathbb X}\mathbb {1}_{\{r(Y_1) = r_\ell \}}\mathbb {1}_{\{Y_1\cap E_1\ne \varnothing , Y_1\cap D'= \varnothing \}}\lambda (\mathrm {d}Y_1). \end{aligned}$$

Notice that the integrals in the last expression are equal to $[V_{r_\ell }(D'\cup E_1)-V_{r_\ell }(D')]$ for every $\ell \in \mathbb N$.

The summand for any $k\ge 2$ in (4.14) is bounded from above by

$$\begin{aligned}&\sum \limits _{\ell _1,\ldots ,\ell _k}z_{\ell _1}\ldots z_{\ell _k}\mathrm {e}^{\alpha \sum \limits _{i=1}^k \vert B_{r_{\ell _i}+r_1}\vert }\int _{\mathbb X^k}\prod \limits _{i=1}^k\mathbb {1}_{\{r(Y_i) = r_{\ell _i} \}}\nonumber \\&\quad \prod \limits _{i=1}^k\mathbb {1}_{\{Y_i\cap E_1\ne \varnothing , Y_i\cap D'= \varnothing \}}\mathbb {1}_{\{Y_1,\ldots ,Y_k \textit{ disjoint}\}}\lambda ^k(\mathrm {d}\varvec{Y}), \end{aligned}$$

(4.15)

which—by arguments similar to the ones used for the bound $C_1(\varepsilon )$—is again bounded by

$$\begin{aligned} C_2(\varepsilon )\sum \limits _{\ell _1,\ldots ,\ell _k}z_{\ell _1}\ldots z_{\ell _k}\mathrm {e}^{\alpha \sum \limits _{i=1}^k \vert B_{r_{\ell _i}+r_1}\vert }\vert B_{r_{\ell _1}}\vert \ldots \vert B_{r_{\ell _k}}\vert \end{aligned}$$

(4.16)

for a positive constant $C_2(\varepsilon )$ that is independent of k and satisfies $C_2(\varepsilon )\searrow 0$ for $\varepsilon \searrow 0$. Notice that the sum in (4.16) is finite for every $k\in \mathbb {N}$ by assumption (4.13) [since it is simply given by the k-th power of the left-hand side of the inequality in (4.13)]. Moreover, by the same argument as in the proof of Theorem 4.2, the expression in (4.15) does vanish for all but finitely many $k\in \mathbb {N}$, i.e., there exists a number $N\in \mathbb {N}$ such that (4.15) is equal to zero for all $k\ge N+1$.

Altogether we get the upper bound

$$\begin{aligned}&\mathrm {e}^{\alpha C_1 (\varepsilon )}\, \mathrm {e}^{\alpha [V(D'\cup E_1)-V(D')- V(E_1)]}\times \Bigl ( \sum \limits _{\ell \ge 1}z_\ell \mathrm {e}^{\alpha \vert B_{r_\ell +r_1}\vert }[V_{r_\ell }(D'\cup E_1)-V_{r_\ell }(D')]+\\ {}&C_2(\varepsilon )\sum \limits _{2\le k\le N}\sum \limits _{\ell _1,\ldots ,\ell _k}z_{\ell _1}\ldots z_{\ell _k}\mathrm {e}^{\alpha \sum \limits _{i=1}^k \vert B_{r_{\ell _i}+r_1}\vert }\vert B_{r_{\ell _1}}\vert \ldots \vert B_{r_{\ell _k}}\vert \Bigr ). \end{aligned}$$

As in the single-type case, we see that it is sufficient to prove the strict inequality

$$\begin{aligned} \mathrm {e}^{\alpha [V(D'\cup E_1)-V(D')- V(E_1)]}\sum \limits _{\ell \ge 1}z_\ell \mathrm {e}^{\alpha \vert B_{r_\ell +r_1}\vert }[V_{r_\ell }(D'\cup E_1)-V_{r_\ell }(D')]< \mathrm {e}^{\alpha [V(D'\cup E_1)-V(D)]}-1 \end{aligned}$$

for small values of $\varepsilon >0$ and, consequently, for small volumes of the snippet $E_1$ contained in an $\varepsilon $-ball.

To do so, we bound $[V_{r_\ell }(D'\cup E_1)-V_{r_\ell }(D')]$ from above by $[V_{r_\ell }(D'\cup B_\varepsilon )-V_{r_\ell }(D')]$ for every $\ell \in \mathbb {N}$, where $B_\varepsilon $ is the ball of radius $\varepsilon $ containing the snippet $E_1$. Then we use Lemma 4.3 to obtain

$$\begin{aligned}{}[V_{r_\ell }(D'\cup B_\varepsilon )-V_{r_\ell }(D')]\le [V_{r_m}(D'\cup B_\varepsilon )-V_{r_m}(D')]\frac{V_{r_\ell }(B_\varepsilon )}{V_{r_m}(B_\varepsilon )} \end{aligned}$$

for $m\le \ell $ (since in that case $r_m\le r_\ell $ holds by assumption) and therefore

$$\begin{aligned} \sum \limits _{\ell \ge 1}z_\ell \mathrm {e}^{\alpha \vert B_{r_\ell +r_1}\vert }[V_{r_\ell }(D'\cup E_1)-V_{r_\ell }(D')]\le \frac{V_{r_1}(D'\cup B_\varepsilon )-V_{r_1}(D')}{V_{r_1}(B_\varepsilon )}\sum \limits _{\ell \ge 1}z_\ell \mathrm {e}^{\alpha \vert B_{r_\ell +r_1}\vert }V_{r_\ell }(B_\varepsilon ). \end{aligned}$$

By dominated convergence and assumption (4.13) we can choose $\varepsilon >0$ small enough to strictly bound the right-hand side of the last equation by

$$\begin{aligned} \frac{V_{r_1}(D'\cup E_1)-V_{r_1}(D')}{V_{r_1}(E_1)}(\mathrm {e}^{\alpha V_{r_1}(E_1)}-1). \end{aligned}$$

Finally, it suffices to show the inequality

$$\begin{aligned} \mathrm {e}^{\alpha [V(D'\cup E_1)-V(D')- V(E_1)]}\frac{V_{r_1}(D'\cup E_1)-V_{r_1}(D')}{V_{r_1}(E_1)}(\mathrm {e}^{\alpha \vert V_{r_1}(E_1) \vert }-1)\le \mathrm {e}^{\alpha [V(D'\cup E_1)-V(D)]}-1 \end{aligned}$$

(4.17)

as in (4.5) to conclude the proof. $\square $

4.4 Tonks Gas on $\mathbb {Z}$

Next we turn to the discrete one-dimensional Tonks gas with translationally invariant activities. That is, in the setup of subset polymers from Sect. 2.4 for $d=1$, let $(z_\ell )_{\ell \in \mathbb {N}}$ be a sequence of non-negative numbers and consider the activity

$$\begin{aligned} z(X) = {\left\{ \begin{array}{ll} z_\ell , &{}\quad X = \{m, m+1,\ldots , m+\ell -1\}\text { for some }m \in \mathbb {Z},\\ 0, &{}\quad \text {else}. \end{array}\right. } \end{aligned}$$

(4.18)

Theorem 4.5

Let $d=1$ and let $(z_\ell )_{\ell \in \mathbb {N}}$ be a sequence of non-negative activities.

(a)
Suppose there exists $\alpha >0$ such that
$$\begin{aligned} \sum _{\ell =1}^\infty \mathrm {e}^{\alpha \ell }z_\ell \le \mathrm {e}^{\alpha }-1. \end{aligned}$$
(4.19)
Then T(D; z) is absolutely convergent for all finite subsets $D\subset \mathbb {Z}$.
(b)
Conversely, if T(D; z) is absolutely convergent for all finite subsets $D\subset \mathbb {Z}$, then there exists $\alpha >0$ such that (4.19) holds true.

Remark 4.4

The condition (4.19) is exactly the necessary and sufficient criterion for absolute convergence of the activity expansion of the pressure in the system derived in [13]. While the result itself is not novel, we consider the proof to be instructive since it demonstrates how our approach can provide conditions improving on the Fernández–Procacci criterion. In this concrete setup even the optimal result—recovering the whole domain of convergence—can be achieved.

The proof of the sufficient condition relies on a refinement of Theorem 2.7. Roughly, we weaken condition (i) to consider the Kirkwood–Salsburg inequalities being satisfied only for single rods rather than for arbitrary configurations of rods; at the same time we specify the selection rule by assuming that the leftmost (or, alternatively, the rightmost) element $\{x\}$ is always picked from any given domain.

Proposition 4.6

Suppose there exists a non-negative function $a(\cdot )$ from the finite intervals of $\mathbb {Z}$ to $[0,\infty )$ with $a(\varnothing ) =0$ and for every finite interval D of $\mathbb Z$ with $x=\min D$ such that

$$\begin{aligned} \sum _{\begin{array}{c} Y\ni x,\\ Y\cap D' =\varnothing \end{array}} z(Y)\, \mathrm {e}^{a(D'\cup Y) - a(D')} \le \mathrm {e}^{a(D'\cup \{x\}) - a(D')} - 1, \end{aligned}$$

(4.20)

where we set $D'=D\backslash \{x\}$. Then T(D, z) is absolutely convergent, for all finite $D\subset \mathbb {Z}$ (interval or not).

Proof

We revisit the proof of the implication (i)$\ \Rightarrow \ $(ii) of Theorem 2.7 given on p. 24 and prove first by induction over N that ${\tilde{T}}_N(D;z)\le \exp (a(D))$, for all finite discrete intervals $D\subset \mathbb {Z}$. For $N=1$, the inequality is trivial because ${\tilde{T}}_1(D; z)=\mathbb {1}_{\{\vert D\vert \le 1\}}\le 1\le \exp (a(\widehat{D}))$. Now, suppose ${\tilde{T}}_N(D;z)\le \exp (a(D))$ for some $N\in \mathbb {N}$ and all discrete intervals $D\subset \mathbb {Z}$. Let $\widehat{D}\subset \mathbb {Z}$ be any discrete interval. If ${\widehat{D}}=\varnothing $, then ${\tilde{T}}_{N+1}({\widehat{D}};z) =1\le \exp (a({\widehat{D}}))$. If ${\widehat{D}}$ is non-empty, let $x:=\min {\widehat{D}}$ (or, alternatively, $x:=\max {\widehat{D}}$) and set $D'=\widehat{D}\backslash \{x\}$, then Proposition 3.12 yields

$$\begin{aligned} {\tilde{T}}_{N+1}({\widehat{D}};z) = {\tilde{T}}_N(D';z) + \sum _{\begin{array}{c} Y\ni x:\\ Y\cap D'=\varnothing \end{array}} z(Y) \tilde{T}_N(D'\cup Y;z). \end{aligned}$$

Since all the arguments $D'$ and $D'\cup Y$ of ${\tilde{T}}_N$ on the right side of this identity are again finite discrete intervals, the inductive hypothesis and our assumption (4.20) imply that

$$\begin{aligned} {\tilde{T}}_{N+1}({\widehat{D}};z)\le \mathrm {e}^{a(D')} + \sum _{\begin{array}{c} Y\ni x:\\ Y\cap D'=\varnothing \end{array}} z(Y) \mathrm {e}^{a(D'\cup Y)} \le \mathrm {e}^{ a({\widehat{D}})}. \end{aligned}$$

This completes the inductive proof of the inequality $\tilde{T}_N(D;z) \le \exp ( a(D))$. Passing to the limit $N\rightarrow \infty $, we get ${\tilde{T}}(D;z) \le \exp (a(D))<\infty $ for all intervals $D\subset \mathbb {Z}$. The convergence extends to all finite sets because $\log {\tilde{T}}(\cdot ;z)$ is subadditive. $\square $

Proof of Theorem 4.5(a)

Consider the selection rule $s(D):= \min D$ that picks the left-most point of a finite set. For $\alpha >0$ and $L \in \mathbb {N}$ let

$$\begin{aligned} V_L(D):= \bigl |\{ X\subset \mathbb {Z}\mid X\text { is an { L}-rod, } X\cap D \ne \varnothing \}\bigr | \end{aligned}$$

and $a(D)\equiv a_{\alpha ,L}(D):= \alpha \, V_L(D)$. The choice of $\alpha $ and L is specified later. For a non-empty interval D, write $x:= s(D) = \min (D)$, and $D':= D\setminus \{x\}$. If $D'$ is non-empty, then condition (4.20) reads

$$\begin{aligned} \sum _{\ell =1}^\infty z_\ell \, \mathrm {e}^{\alpha \ell } \le \mathrm {e}^\alpha - 1. \end{aligned}$$

(4.21)

Indeed in that case for each $\ell $ there is a single $\ell $-rod X that contains x but does not intersect $D'$ (note that $x+1\in D'$ because of the assumption that D is an interval and $x=\min D$). The rod is simply the $\ell $-rod with right-most endpoint x. Moreover $V_L (D'\cup X) - V_L(D') = \ell $ and $V_L(D'\cup \{x\}) - V_L(D') = 1$.

On the other hand if $D'$ is empty, then the number of $\ell $-rods that contain any given site $x\in \mathbb {Z}^d$ is equal to $\ell $ and the number of L-rods intersecting an $\ell $-rod is equal to $L+\ell -1$, therefore condition (4.20) reads instead

$$\begin{aligned} \sum _{\ell =1}^ \infty \ell z_\ell \, \mathrm {e}^{\alpha ( \ell +L-1)} \le \mathrm {e}^{\alpha L} - 1. \end{aligned}$$

(4.22)

The proof of Theorem 4.5 is complete once we check the existence of $\alpha >0$ and $L\in \mathbb {N}$ such that the inequalities (4.21) and (4.22) hold true. Set

$$\begin{aligned} h(u):= 1+ \sum _{\ell =1}^\infty z_\ell \, u^\ell \qquad (u\in \mathbb {R}_+). \end{aligned}$$

Conditions (4.21) and (4.22) are equivalent to

$$\begin{aligned} h(\mathrm {e}^\alpha ) \le \mathrm {e}^\alpha , \quad h'(\mathrm {e}^\alpha ) \le 1- \mathrm {e}^{-\alpha L}. \end{aligned}$$

(4.23)

Notice that h is convex and monotone increasing with $h(0)=1$. The assumption (4.19) yields the existence of some $u = \mathrm {e}^\alpha >0$ such that $h(u)< u$. On the other hand, clearly $h(0) =1>0$. Therefore the mean-value theorem yields the existence of a point ${\tilde{u}} \in (0,u)$ such that $h({\tilde{u}}) = {\tilde{u}}$. The point ${\tilde{u}}$ is necessarily larger then 1 because $h(\tilde{u})$ is. Suppose by contradiction that $h'({\tilde{u}}) \ge 1$. Then the convexity of h implies

$$\begin{aligned} h(u) \ge h({\tilde{u}}) + h'({\tilde{u}}) (u-{\tilde{u}}) \ge h({\tilde{u}}) + (u-{\tilde{u}}) = u, \end{aligned}$$

which contradicts the assumption $h(u)<u$. Therefore $h({\tilde{u}}) ={\tilde{u}}>1$ and $h'({\tilde{u}})<1$. Replacing $\alpha $ with $\tilde{\alpha }:= \log {\tilde{u}}$ if needed, and picking $L=L(\alpha )$ large enough, we find that (4.23) is satisfied for some $\alpha >0$. This concludes the proof. $\square $

Proof of Theorem 4.5(b)

Let $a(D):=\log {\tilde{T}}(D;z) = \log T(D;-z)$. In view of Eq. (2.15) and the alternating sign property, we have

$$\begin{aligned} a(D) = \sum _{k=1}^\infty \frac{1}{k!}\sum _{(Y_1,\ldots , Y_k)\in \mathbb X^k} \mathbb {1}_{\{\exists i:\, Y_i \cap D\ne \varnothing \}} \bigl |\varphi _k^{\mathsf {T}}(Y_1,\ldots , Y_k) \bigr |\, z(Y_1) \cdots z(Y_k). \end{aligned}$$

By Proposition 3.12, for every $D\subset \mathbb {Z}\setminus \{1\}$, we have

$$\begin{aligned} \sum _{\begin{array}{c} Y\ni 1, \\ Y\cap D= \varnothing \end{array}} z(X) \mathrm {e}^{a(D\cup Y) - a(D)} \le \mathrm {e}^{a(D\cup \{1\}) - a(D)} - 1. \end{aligned}$$

(4.24)

Let us choose $D\subset \mathbb {Z}\cap (-\infty ,0]$ with $0 \in D$. Then for every given $\ell \in \mathbb {N}$, the unique rod of length $\ell $ that contains 1 but does not intersect D is the rod $\{1,\ldots ,\ell \}$, and we obtain

$$\begin{aligned} \sum _{\ell =1}^\infty z_\ell \, \mathrm {e}^{a(D\cup \{1,\ldots , \ell \}) - a(D)} \le \mathrm {e}^{a(D\cup \{1\}) - a(D)} - 1. \end{aligned}$$

(4.25)

Let $D_0:=D$ and for $m\ge 1$ set $D_{m}:= D\cup \{1,\ldots ,m\}$. The exponent on the left-hand side in (4.25) may be written as

$$\begin{aligned} a(D\cup \{1,\ldots , \ell \}) - a(D) = \sum _{m=1}^{\ell } \bigl ( a( D_{m}) - a(D_{m-1}) \bigr ). \end{aligned}$$

(4.26)

Now

$$\begin{aligned}&a(D_m) - a(D_{m-1})\\&\quad = \sum _{k=1}^\infty \frac{1}{k!}\sum _{(Y_1,\ldots , Y_k)\in \mathbb X^k}\Bigl ( \mathbb {1}_{\{\exists i:\, Y_i \cap D_m \ne \varnothing \}} - \mathbb {1}_{\{\exists i:\, Y_i \cap D_{m-1} \ne \varnothing \}} \Bigr ) \bigl | \varphi _k^{\mathsf {T}}(Y_1,\ldots , Y_k)\bigr | z(Y_1)\cdots z(Y_k). \end{aligned}$$

The only clusters $(Y_1,\ldots , Y_k)$ that contribute to the sum are those that intersect $D_{m}$ but do not intersect $D_{m-1}$. This is only possible if one of the $Y_i$’s contains m and all of them are contained in $\mathbb {Z}\cap [m,\infty )$. Thus

$$\begin{aligned}&a(D_m) - a(D_{m-1})\\&\quad = \sum _{k=1}^\infty \frac{1}{k!}\sum _{(Y_1,\ldots , Y_k)\in \mathbb X^k} \mathbb {1}_{\{\exists i:\, Y_i \ni m\}} \mathbb {1}_{\{\forall i:\, Y_i \subset [m,\infty )\}} \bigl | \varphi _k^{\mathsf {T}}(Y_1,\ldots , Y_k)\bigr | z(Y_1)\cdots z(Y_k). \end{aligned}$$

Because of the translational invariance, the value of the sum does not depend on m. Thus $a(D_m) - a(D_{m-1}) = \alpha >0$ for all $m\ge 1$ and some $\alpha >0$. Turning back to (4.26), we obtain

$$\begin{aligned} a(D\cup \{1,\ldots , \ell \}) - a(D) = \ell \alpha \end{aligned}$$

and then (4.25) yields $\sum _{\ell =1}^\infty z_\ell \exp (\alpha \ell ) \le \exp ( \alpha ) - 1$. $\square $

4.5 Tonks Gas on $\mathbb {R}$

Next, we want to consider the continuous version of the one-dimensional Tonks gas. Let $(L_\ell )_{\ell \in \mathbb {N}}$ be sequence of strictly positive numbers and $\mathbb X$ the space of compact intervals $I\subset \mathbb {R}$ with lengths $|I|\in \{L_\ell \mid \ell \in \mathbb {N}\}$. The map $\mathbb {R}\times \mathbb {N}$, $(x,\ell )\mapsto [x-L_\ell /2,x+L_\ell /2]$ is a bijection between $\mathbb {R}\times \mathbb {N}$ and $\mathbb X$. The reference measure $\lambda $ is defined by the equality

$$\begin{aligned} \int _\mathbb X h(X) \lambda (\mathrm {d}X) = \sum _{\ell =1}^\infty \int _{-\infty }^\infty h\bigl (\bigl [ x - \tfrac{L_\ell }{2},x+ \tfrac{L_\ell }{2}\bigr ]\bigr ) \mathrm {d}x \end{aligned}$$

for all non-negative measurable functions $h:\mathbb X\rightarrow \mathbb {R}_+$. We assume that the activity is of the form

$$\begin{aligned} z(X) = {\left\{ \begin{array}{ll} z_\ell , &{}\quad X = [x,x+L_\ell ]\, \text { for some }\ell \in \mathbb {N}, x\in \mathbb {R},\\ 0, &{}\quad \text {else} \end{array}\right. } \end{aligned}$$

for some sequence $(z_\ell )_{\ell \in \mathbb {N}}$ of non-negative numbers. We assume that rod lengths are bounded from below, i.e., there exists $\delta >0$ such that

$$\begin{aligned} \inf _{\ell \in \mathbb {N}} L_\ell \ge \delta . \end{aligned}$$

(4.27)

From here on, we will consider the following chopping map: For $X=[x,x+L_\ell ]\in \mathbb X$, let $C(X) = \{E_1,\ldots , E_m\}$ consist of the intersections of X with the intervals $[x+(k-1)\varepsilon , x+k\varepsilon )$ with $k\in \mathbb {Z}$, where $\varepsilon \in (0,\delta )$. The space of snippets $\mathbb E_\varepsilon $ consists of intervals [a, b] and [a, b) of length $b-a\le \varepsilon $.

Theorem 4.7

In the setup of multi-type Tonks gas on $\mathbb {R}$, under the assumption (4.27):

(a)
Suppose there exists $\alpha >0$ such that
$$\begin{aligned} \sum _{\ell =1}^\infty \mathrm {e}^{\alpha L_\ell }z_\ell <\alpha . \end{aligned}$$
(4.28)
Then the expansion for T(D; z) is absolutely convergent, for all bounded sets $D\subset \mathbb {R}$.
(b)
Conversely, if T(D; z) is absolutely convergent for all bounded subsets $D\subset \mathbb {R}$, then there exists $\alpha >0$ such that (4.28) holds true with “$\le $” instead of “<”.

Remark 4.5

The theorem essentially recovers the necessary and sufficient convergence criterion from [13] (derived there for the activity expansion of the pressure in the system). The sufficient condition in [13] is (4.28) with “$\le $” instead of “<”. Again, while the result itself is not novel, its proof demonstrates the potential of our approach to go beyond the Fernández-Procacci criterion—also in continuous setups.

First we prove an auxiliary result, the analogue of Proposition 4.6 for the continuous setup, which is not quite as trivial. We introduce the following notion: Define the $\varepsilon $-gap-filling operation $\ \widehat{\cdot }\ $ by setting $\widehat{D}:=D\cup \{x\in \mathbb {R}\vert \ \exists y, z\in D \textit{ with } y<x<z \textit{ such that } z-y\le \varepsilon \}$ for any $D\subset \mathbb {R}$. Let ${\mathscr {P}}$ be some subset of the power set of $\mathbb {R}$, we say that a function $\xi :{\mathscr {P}}\rightarrow \mathbb {R}$ does not see gaps of diameter at most $\varepsilon $ if it is invariant under the $\varepsilon $-gap-filling operation, i.e., if $\xi (D)=\xi (\widehat{D})$ for all $D\in {\mathscr {P}}$.

Proposition 4.8

Suppose that there exists a non-negative, measurable map $a(\cdot )$ defined on finite unions of (bounded) intervals which does not see gaps of diameter at most $\varepsilon $ and satisfies the following system of inequalities: For any (bounded) interval D with $C(D)=\{E_1,\ldots , E_n\}$, $E_1,\ldots , E_n\in \mathbb {E}_\varepsilon $, where the chopping map C is defined as above, there is a subinterval $E_s\subset D$ of length at most $\varepsilon $, such that

$$\begin{aligned} \sum _{k=1}^\infty \frac{1}{k!}\int _{\mathbb X^k} I(E_s; D'; Y_1,\ldots , Y_k) \mathrm {e}^{a(D'\cup Y_1\cup \cdots \cup Y_k) - a(D')} \lambda _z^k(\mathrm {d}\varvec{Y}) \le \mathrm {e}^{a(E_s\cup D') - a(D')}-1, \end{aligned}$$

(4.29)

where we set $D':=D\backslash E_s$ and $I(E_s;D'; Y_1,\ldots , Y_k)$ is the indicator from Eq. (2.16). Then T(D; z) is absolutely convergent for all bounded subsets $D\subset \mathbb R$.

Proof

We can modify the Kirkwood–Salsburg-type equations $\tilde{\kappa }^s_z$ from Chapter 3.3 as follows: If $\xi (\cdot )$ is a function from $\mathbb D_\varepsilon $ to $\mathbb {R}_+$ that does not see gaps of diameter at most $\varepsilon $ and satisfies the measurability assumption from Theorem 2.6, define the function $\tilde{\mathscr {K}}^s_z\xi $ (possibly assuming the value “$\infty $”) by

for $D\in \mathbb D_\varepsilon $ with $E_1,\ldots ,E_n\subset \mathbb E_\varepsilon $ and $C(D)=\{E_1,\ldots , E_n\}$, where $\widehat{D}$ is given by “filling gaps” of diameter at most $\varepsilon $ in $D\subset \mathbb {R}$ as defined above.

Notice that for any such function $\xi (\cdot )$ (that does not see gaps of diameter at most $\varepsilon $ and satisfies the measurability assumption from Theorem 2.6)

$$\begin{aligned} \tilde{\mathscr {K}}^s_z\xi ={\tilde{\kappa }}^s_z\xi \end{aligned}$$

holds, where ${\tilde{\kappa }}^s_z\xi $ is the function defined by (3.9). In particular, the left hand side of the equation is well-defined. Since the functions $\tilde{T}_N(\cdot ;z)$, $N\in \mathbb {N}$, and ${\tilde{T}}(\cdot ;z)$ do not see gaps of diameter at most $\varepsilon $ (by our assumption $\varepsilon <\delta $ and the respective definitions), Proposition 3.10 implies

$$\begin{aligned} {\tilde{T}}\bigl (\cdot ;z\bigr ) = e(\cdot ) +\tilde{{\mathscr {K}}}_z^s {\tilde{T}}(\cdot ;z). \end{aligned}$$

and

$$\begin{aligned} {\tilde{T}}_{N+1}\bigl (\cdot ;z\bigr ) = e(\cdot ) + (\tilde{\mathscr {K}}_z^s {\tilde{T}}_N)\bigl (\cdot ;z\bigr ). \end{aligned}$$

Assumption (4.29) is equivalent to $e(D)+(\tilde{{\mathscr {K}}}^s_z\mathrm {e}^a)(D)\le \mathrm {e}^{a(D)}$ for any interval $D\subset \mathbb {R}$. We prove by induction over N that ${\tilde{T}}_N(D;z) \le \mathrm {e}^{a(D)}$ for all $N\in \mathbb {N}$ and all intervals $D\subset \mathbb {R}$. For $N=1$, we have by our assumption

$$\begin{aligned} {\tilde{T}}_1(D,z) = e(D)\le e(D) + (\tilde{{\mathscr {K}}}_z^s \mathrm {e}^a)(D)\le \mathrm {e}^{a(D)} \end{aligned}$$

for all intervals $D\subset \mathbb {R}$. Next, assume for some $N\in \mathbb {N}$ that ${\tilde{T}}_N(D;z) \le \mathrm {e}^{a(D)}$ for all intervals $D\subset \mathbb {R}$, then

$$\begin{aligned} {{\tilde{T}}}_{N+1}(D;z) = e(D)+\tilde{{\mathscr {K}}_z}^s \tilde{T}_{N}(D;z) \le e(D) +(\tilde{{\mathscr {K}}}_z^s \mathrm {e}^a)(D)\le \mathrm {e}^{a(D)}, \end{aligned}$$

where the first inequality holds by the inductive hypothesis, by monotonicity of $\tilde{{\mathscr {K}}}_z^s$ on non-negative functions and by the observation that for intervals $D\subset \mathbb {R}$ all the arguments of $\xi $ appearing in the definition of $(\tilde{\mathscr {K}}_z^s\xi )(D)$ are again intervals.

This completes the induction and proves $T_N(D;z)\le \mathrm {e}^{a(D)}$ for all $N\in \mathbb {N}$ and all intervals $D\subset \mathbb {R}$. Taking the limit $N\rightarrow \infty $ yields the corresponding bound for $\tilde{T}(D;z)$. The claim for arbitrary bounded subsets follows since every bounded subset is contained in some compact interval and

$$\begin{aligned} \tilde{T}(D_1;z)\le {\tilde{T}}(D_2,z) \end{aligned}$$

for $D_1\subset D_2\subset \mathbb {R}$. $\square $

Proof of Theorem 4.7(a)

In analogy to the discrete case, for $\alpha >0$ and $L >0$ let

$$\begin{aligned} V_L(D):= \int _{-\infty }^\infty \mathbb {1}_{\{ [x,x+L]\cap D\ne \varnothing \}}\, \mathrm {d}x. \end{aligned}$$

and $a(D)\equiv a_{\alpha ,L}(D):= \alpha \, V_L(D)$. The choice of $\alpha $ and L is specified later in the proof. We apply Proposition 4.8 with the choice of the chopping map introduced at the beginning of this subsection and the selection rule s that picks the leftmost snippet.

Remember the indicator $I(E_1;D'; Y_1,\ldots , Y_k)$ from Eq. (2.16). We show that there exists $\alpha >0$ such that

$$\begin{aligned}&\sum _{k=1}^\infty \frac{1}{k!}\int _{\mathbb X^k} I(E_1;D';Y_1,\ldots ,Y_k) \mathrm {e}^{\alpha [V_L(D'\cup Y_1\cup \cdots \cup Y_k) - V_L(D')]} \lambda _z^k(\mathrm {d}\varvec{Y}) \nonumber \\&\quad \quad \quad \le \mathrm {e}^{\alpha V_L(D'\cup E_1) - V_L(D')} - 1 \end{aligned}$$

(4.30)

for all intervals $D'=[a,b)\subset \mathbb {R}$ or $D' = [a,b]$, including the empty set $D'= \varnothing $, and all snippets $E_1 = [(k-1)\varepsilon , a)\in \mathbb E_\varepsilon $.

If $D'$ is non-empty, then because of $\inf _{\ell \in \mathbb {N}} L_\ell \ge \varepsilon $ and $|E_1|\le \varepsilon $ there cannot be two or more disjoint rods in X that intersect $E_1$ but do not intersect $D'$, so the inequality to be proven reduces to

$$\begin{aligned} \int _{\mathbb {X}} \mathrm {e}^{\alpha (V_L(D'\cup Y)-V_L(D'))} \mathbb {1}_{\{Y\cap E_1\ne \varnothing ,\ Y\cap D'= \varnothing \}}\lambda _z(\mathrm {d}Y)\le \mathrm {e}^{\alpha (V_L(D'\cup E_1)-V_L(D'))} - 1. \end{aligned}$$

(4.31)

Assuming that $L \ge \varepsilon $, this is equivalent to

$$\begin{aligned} \sum _{\ell =1}^\infty z_\ell \int _0^{|E_1|}\, \mathrm {e}^{\alpha (L_\ell +x)} \mathrm {d}x \le \mathrm {e}^{\alpha |E_1|} - 1. \end{aligned}$$

(4.32)

The integral on the left-hand side is equal to $\exp ( \alpha L_\ell ) [\exp (\alpha |E_1|)- 1]/\alpha $, so we find that (4.30) is equivalent to

$$\begin{aligned} \sum _{\ell =1}^\infty z_\ell \, \mathrm {e}^{\alpha L_\ell } \le \alpha , \end{aligned}$$

which holds true because of the assumption (4.28).

If $D'$ is empty, we note that there can be at most two disjoint rods in X that intersect the snippet $E_1$, hence (4.30) becomes

$$\begin{aligned}&\int _\mathbb X \mathrm {e}^{\alpha V_L(Y)} \mathbb {1}_{\{Y\cap E_1\ne \varnothing \}} \lambda _z(\mathrm {d}Y) + \frac{1}{2} \int _{\mathbb X^2} \mathrm {e}^{\alpha V_L(Y_1\cup Y_2)} \mathbb {1}_{\{Y_1\cap E_1\ne \varnothing ,\, Y_2\cap E_1\ne \varnothing ,\, Y_1\cap Y_2 = \varnothing \}} \lambda _z^2\bigl (\mathrm {d}(Y_1,Y_2)\bigr ) \nonumber \\&\quad \le \mathrm {e}^{\alpha V_L(E_1)} -1. \end{aligned}$$

(4.33)

The right-hand side is equal to $\exp (\alpha (L + |E_1|)) -1$. The first term on the left-hand side is equal to

$$\begin{aligned} \sum _{\ell =1}^\infty z_\ell (L_\ell + |E_1|) \mathrm {e}^{\alpha (L + L_\ell )} = \sum _{\ell =1}^\infty z_\ell \, L_\ell \, \mathrm {e}^{\alpha (L + L_\ell )} + O(\varepsilon ). \end{aligned}$$

The second term on the left-hand side of (4.33) is equal to

$$\begin{aligned} \sum _{\ell ,r=1}^\infty z_{\ell } z_r \int _{E_1^2} \mathbb {1}_{\{x<y\}}\, \mathrm {e}^{\alpha V_L( [x - L_\ell , y+L_r])} \mathrm {d}x \mathrm {d}y \end{aligned}$$

which is bounded by

$$\begin{aligned} \Bigl (\sum _{\ell =1}^\infty z_\ell \mathrm {e}^{\alpha L_\ell } \Bigr )^2 \mathrm {e}^{\alpha (\varepsilon +L)}|E_1|^2 = O(\varepsilon ^2). \end{aligned}$$

For the inequality (4.33) to be satisfied, it is sufficient that

$$\begin{aligned} \sum _{\ell =1}^\infty L_\ell z_\ell \mathrm {e}^{\alpha L_\ell } + O(\varepsilon ) \le \mathrm {e}^{\alpha |E_1|} - \mathrm {e}^{-\alpha L}. \end{aligned}$$

(4.34)

Arguments similar to the proof of Theorem 4.5(b), applied to the convex function $h:\mathbb {R}_+\rightarrow \mathbb {R}$, $h(u): = 1+\sum _{\ell =1}^\infty z_\ell u^{L_\ell }$, show that under condition (4.28) there exists $\alpha >0$ such that not only condition (4.28) holds true but in addition

$$\begin{aligned} h'(\mathrm {e}^\alpha ) = \sum _{\ell =1}^\infty L_\ell z_\ell \mathrm {e}^{\alpha L_\ell } < 1. \end{aligned}$$

Thus one can choose $L = L(\alpha )$ large enough and $\varepsilon $ small enough so that (4.34) and hence (4.33) hold true. $\square $

Proof of Theorem 4.7(b)

We proceed as in the proof of Theorem 4.5(b). Suppose that the expansions are absolutely convergent and define

$$\begin{aligned} a(D):= \log T(D;-z) = \sum _{k=1}^\infty \frac{1}{k!}\int _{\mathbb X^k} \mathbb {1}_{\{ \exists i\in [n]:\, Y_i \cap D \ne \varnothing \}} \bigl |\varphi _{k}^{\mathsf {T}}(Y_1,\ldots , Y_k)\bigr |\, \lambda _z^k(\mathrm {d}\varvec{Y}). \end{aligned}$$

Then by Proposition 3.13 and since $\inf _{\ell \in \mathbb {N}} L_\ell $ is bounded from below by $ \varepsilon >0$,

$$\begin{aligned} \int _{\mathbb X} \mathbb {1}_{\{ X\cap E_1 \ne \varnothing ,\, X\cap D'= \varnothing \}} \, \mathrm {e}^{a(D'\cup X) - a(D')} \lambda _z(\mathrm {d}X) \le \mathrm {e}^{a(D'\cup E_1) - a(D')} - 1 \end{aligned}$$

(4.35)

for example for $E_1= [0,\varepsilon )$ and $D' = [\varepsilon ,\varepsilon +L]$ with $L>0$ and $\varepsilon $ sufficiently small.

Before we evaluate the two sides of the inequality, we note two useful properties of $a(\cdot )$. First, the map a does not see gaps of diameter at most $\varepsilon $. Precisely, if $X=[x- L_\ell , x]$ with $x\in [0,\varepsilon )$ and $D'$ is as above, then

$$\begin{aligned} a(D'\cup X) = a([x-L_\ell ,\varepsilon +L]). \end{aligned}$$

Indeed, any rod $Y_i\in \mathbb X$ that intersects $[0,\varepsilon )$ must also intersect $D'\cup X$ because it has a length $|Y_i| \ge \varepsilon $. Second, because of translational invariance, the weight a(D) of a non-empty interval depends only on its length |D|. We check that in addition, it is an affine function of the length. For $x\in \mathbb {R}$, define

$$\begin{aligned} \alpha (x):= \sum _{\ell =1}^\infty z_\ell \sum _{k=0}^\infty \frac{1}{k!} \int _{\mathbb X^k} \mathbb {1}_{\{\forall i \in [k]:\, Y_i \subset (-\infty , x]\}} \bigl |\varphi _{1+k}^{\mathsf {T}}(Y_1,\ldots , Y_k, [x-L_\ell ,x])\bigr |\, \lambda _z^k(\mathrm {d}\varvec{Y}). \end{aligned}$$

The quantity $\alpha (x)$ is best thought of as an integral over clusters in which the right-most rod $[x-L_\ell , x]$ has its right end pinned at x. By translational invariance, $\alpha (x)$ is actually independent of x and we may write $\alpha (x) \equiv \alpha $ for some scalar $\alpha \ge 0$. Now let $I= [a,b]$ and $J= [b,c]$ with $a<b<c$. Then

$$\begin{aligned} a(I\cup J) - a(J) = \sum _{k=1}^\infty \frac{1}{k!}\int _{\mathbb X^k} \mathbb {1}_{\{ \exists i\in [k]:\, Y_i \cap I \ne \varnothing \}}\mathbb {1}_{\{\forall i\in [k]:\, Y_i \cap J = \varnothing \}} \bigl |\varphi _{k}^{\mathsf {T}}(Y_1,\ldots , Y_k)\bigr |\, \lambda _z^k(\mathrm {d}\varvec{Y}). \end{aligned}$$

Any cluster $(Y_1,\ldots , Y_k)$ that intersects I but not J has its right-most end in [a, b), therefore

$$\begin{aligned} a(I\cup J) - a(J) = \int _I \alpha (x) \mathrm {d}x = \alpha \, |I|. \end{aligned}$$

With these two observations, the left-hand side of (4.35) becomes

$$\begin{aligned} \sum _{\ell =1}^\infty z_\ell \int _0^\varepsilon \mathrm {e}^{ a( [x- L_\ell ,x]\cup D') - a(D')} \mathrm {d}x =\sum _{\ell =1}^\infty z_\ell \int _0^\varepsilon \mathrm {e}^{\alpha (x+ L_\ell )} \mathrm {d}x = \sum _{\ell =1}^\infty z_\ell \mathrm {e}^{\alpha L_\ell } \frac{1}{\alpha }(\mathrm {e}^{\alpha \varepsilon } - 1) \end{aligned}$$

while the right-hand side of (4.35) is $\exp (\alpha \varepsilon ) - 1$. It follows that

$$\begin{aligned} \sum _{\ell =1}^\infty z_\ell \mathrm {e}^{\alpha L_\ell } \le \alpha . \end{aligned}$$

$\square $

Data Availability

Data sharing not applicable to this article as no datasets were generated or analysed during the current study.

References

Bissacot, R., Fernández, R., Procacci, A.: On the convergence of cluster expansions for polymer gases. J. Stat. Phys. 139(4), 598–617 (2010)
Article ADS MathSciNet MATH Google Scholar
Bissacot, R., Fernández, R., Procacci, A., Scoppola, B.: An improvement of the Lovász local lemma via cluster expansion. Comb. Probab. Comput. 20(5), 709–719 (2011)
Article MATH Google Scholar
Brydges, D.C.: A short course on cluster expansions. In: Phénomènes critiques, systèmes aléatoires, théories de jauge, Part I, II (Les Houches, 1984), pp. 129–183. North-Holland, Amsterdam (1986)
Dobrushin, R.L.: Estimates of semi-invariants for the Ising model at low temperatures topics in statistics and theoretical physics. Am. Math. Soc. Transl. 177(2), 59–81 (1996)
MathSciNet MATH Google Scholar
Faris, W.G.: A connected graph identity and convergence of cluster expansions. J. Math. Phys. 49(11), 113302 (2008)
Article ADS MathSciNet MATH Google Scholar
Faris, W.G.: Combinatorial species and cluster expansions. Mosc. Math. J. 10(4), 713–727 (2010)
Article MathSciNet MATH Google Scholar
Federer, H.: Geometric Measure Theory. Springer, New York (1969)
MATH Google Scholar
Fernández, R., Procacci, A.: Cluster expansion for abstract polymer models. New bounds from an old approach. Commun. Math. Phys. 274(1), 123–140 (2007)
Article ADS MathSciNet MATH Google Scholar
Fernández, R., Procacci, A., Scoppola, B.: The analyticity region of the hard sphere gas. Improved bounds. J. Stat. Phys. 128(5), 1139–1143 (2007)
Article ADS MathSciNet MATH Google Scholar
Fialho, P.M.S.: Abstract polymer gas: a simple inductive proof of the Fernández–Procacci criterion. J. Stat. Phys. 178, 1354–1361 (2020)
Article ADS MathSciNet MATH Google Scholar
Friedli, S., Velenik, Y.: Statistical Mechanics of Lattice Systems. Cambridge University Press, Cambridge (2018).. (A concrete mathematical introduction)
MATH Google Scholar
Gruber, C., Kunz, H.: General properties of polymer systems. Commun. Math. Phys. 22(2), 133–161 (1971)
Article ADS MathSciNet Google Scholar
Jansen, S.: Cluster and virial expansions for the multi-species Tonks gas. J. Stat. Phys. 161(5), 1299–1323 (2015)
Article ADS MathSciNet MATH Google Scholar
Jansen, S.: Cluster expansions for Gibbs point processes. Adv. Appl. Probab. 51, 1129–1178 (2019)
Article MathSciNet MATH Google Scholar
Jansen, S.: Thermodynamics of a hierarchical mixture of cubes. J. Stat. Phys. 179, 309–340 (2020)
Article ADS MathSciNet MATH Google Scholar
Jansen, S., Kolesnikov, L.: Activity expansions for Gibbs correlation functions. In: Proceedings of the XI International Conference Stochastic and Analytic Methods in Mathematical Physics, Number 6 in Lectures in Pure and Applied Mathematics. Universitätsverlag Potsdam (2020)
Jansen, S., Kuna, T., Tsagkarogiannis, D.: Virial inversion and density functionals. Online preprint arXiv:1906.02322 [math-ph] (2019)
Kotecký, R., Preiss, D.: Cluster expansion for abstract polymer models. Commun. Math. Phys. 103(3), 491–498 (1986)
Article ADS MathSciNet MATH Google Scholar
Matheron, G.: Random sets and integral geometry. With a Foreword by Geoffrey S. Watson, Wiley Series in Probability and Mathematical Statistics. Wiley, New York, London, Sydney (1975)
Minlos, R.A., Pogosjan, S.K.: Estimates of Ursell functions, group functions and their derivatives. Teoret. Mat. Fiz. 31(2), 199–213 (1977). (English translation: Theor. Math. Phys. 31(2), 408–418 (1977))
Molchanov, I.: Theory of Random Sets. Springer, London (2005)
MATH Google Scholar
Nguyen, T.X., Fernández, R.: Convergence of cluster and virial expansions for repulsive classical gases. J. Stat. Phys. 179(2), 448–484 (2020)
Article ADS MathSciNet MATH Google Scholar
Penrose, O.: Convergence of fugacity expansions for classical systems. Stat. Mech. Found. Appl. 139, 598–617 (1967)
Google Scholar
Poghosyan, S., Ueltschi, D.: Abstract cluster expansion with applications to statistical mechanical systems. J. Math. Phys. 50(5), 053509 (2009)
Article ADS MathSciNet MATH Google Scholar
Procacci, A., Yuhjtman, S.A.: Convergence of Mayer and virial expansions and the Penrose tree-graph identity. Lett. Math. Phys. 107(1), 31–46 (2017)
Article ADS MathSciNet MATH Google Scholar
Ruelle, D.: Statistical Mechanics: Rigorous Results. World Scientific, Singapore (1969)
MATH Google Scholar
Schneider, R.: Convex Bodies: The Brunn-Minkowski Theory. Encyclopedia of Mathematics and Its Applications, 2nd edn. Cambridge University Press, Cambridge (2013)
Google Scholar
Schneider, R., Weil, W.: Stochastic and Integral Geometry. Springer, Berlin (2008)
Book MATH Google Scholar
Scott, A.D., Sokal, A.D.: The repulsive lattice gas, the independent-set polynomial, and the Lovász local lemma. J. Stat. Phys. 118(5–6), 1151–1261 (2005)
Article ADS MathSciNet MATH Google Scholar
Stell, G.: Cluster expansions for classical systems in equilibrium. In: Frisch, H.L., Lebowitz, J.L. (eds.) The Equilibrium Theory of Classical Fluids, pp. 171–261. Benjamin, New York (1964)
Google Scholar
Temmel, C.: Sufficient conditions for uniform bounds in abstract polymer systems and explorative partition schemes. J. Stat. Phys. 157(6), 1225–1254 (2014)
Article ADS MathSciNet MATH Google Scholar
Ueltschi, D.: Cluster expansions and correlation functions. Mosc. Math. J. 4(2), 511–522 (2004)
Article MathSciNet MATH Google Scholar
Ueltschi, D.: An improved tree-graph bound. Oberwolfach Rep., 14, 2017. In: Miniworkshop: Cluster Expansions: From Combinatorics to Analysis Through Probability (R. Fernández, S. Jansen, D. Tsagkarogiannis, eds.)

Download references

Funding

Open Access funding enabled and organized by Projekt DEAL. The authors did not receive support from any funding body for the submitted work.

Author information

Authors and Affiliations

Mathematisches Institut, Ludwig-Maximilians-Universität, Theresienstr. 39, 80333, Munich, Germany
Sabine Jansen & Leonid Kolesnikov
Munich Center for Quantum Science and Technology (MCQST), Schellingstr. 4, 80799, Munich, Germany
Sabine Jansen

Authors

Sabine Jansen
View author publications
You can also search for this author in PubMed Google Scholar
Leonid Kolesnikov
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Leonid Kolesnikov.

Ethics declarations

Competing Interests

The authors have no competing interests to declare that are relevant to the content of this article.

Additional information

Communicated by Yvan Velenik.

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

A Proof of Lemma 2.5

Proof of Lemma 2.5

We show that the system of inequalities

$$\begin{aligned}&\frac{1+\sum \limits _{k\ge 1}\sum \limits _{{\begin{array}{c} Y=\{y_1,\ldots ,y_k\}\\ y_i \not \sim x_1,\ y_i\sim y_j \end{array}}}\prod \limits _{i=1}^k\mu (y_i)\prod \limits _{w\in \Gamma (Y)}\mathrm {e}^{\mu (w)}}{\prod \limits _{q\in \Gamma (x_1)\cap \Gamma (X)}\mathrm {e}^{\mu (q)}}\nonumber \\&\quad \ge 1+\sum \limits _{k\ge 1}\sum \limits _{\begin{array}{c} \begin{array}{c} Y=\{y_1,...,y_k\}\\ y_i \not \sim x_1,\ y_i\sim y_j \end{array}\\ y_i\sim X \end{array}}\prod \limits _{i=1}^k\mu (y_i)\prod \limits _{w\in \Gamma (Y)\cap \Gamma (X)^C}\mathrm {e}^{\mu (w)}, \end{aligned}$$

(A.1)

which is equivalent to (2.12), holds under the assumptions of the lemma. We do so by proving the following three claims. However, first we would like to introduce some additional notation to complement the notation from Sect. 2.2.

For given $x_1\in \mathbb {X}$ and $X=\{x_2,...,x_p\}\subset \mathbb {X}$ let Q denote the set $\Gamma (x_1)\cap \Gamma (X)$ and let ${\mathcal {C}}$ denote the set of (non-empty) compatible subsets of Q. Furthermore, we define the family $(A_U)_{U\subset Q}$, $A_U=A_U(x_1,Q,\mu )$, indexed by all the subsets $U\subset Q$ (including the empty set), by

$$\begin{aligned} A_U=A_U(x_1,Q,\mu ):=\sum \limits _{k\ge 1}\sum \limits _{Y=\{y_1,...,y_k\}}\prod \limits _{i=1}^k\mu (y_i)\prod \limits _{w\in \Gamma (Y)\backslash U}\mathrm {e}^{\mu (w)}, \end{aligned}$$

where the sum is over subsets $Y=\{y_1,...,y_k\}\subset \mathbb {X}$ such that the following constraints are satisfied: Y is an compatible set, $Y\subset \Gamma (x_1)$, $Y\cap Q=\varnothing $ and $U= \Gamma (Y)\cap Q$.

Finally, define the family of coefficients $(\beta _U)_{U\subset Q}$, $\beta _U=\beta _U(x_1,Q,\mu )$, also indexed by all the subsets $U\subset Q$ (including the empty set), by

$$\begin{aligned} \beta _U=\beta _U(x_1,Q,\mu ):=\prod \limits _{q\in Q\backslash U}\mathrm {e}^{-\mu (q)}+\sum \limits _{\begin{array}{c} C\in {\mathcal {C}}\\ C\cap U=\varnothing \end{array}}\prod \limits _{c\in C}\mu (c)\prod \limits _{w\in (Q\backslash U)\backslash \Gamma (C)}\mathrm {e}^{-\mu (w)}. \end{aligned}$$

Then the following statements hold true:

Claim A.1

The right-hand side of (A.1) is bounded from above by

$$\begin{aligned} 1+\sum _{U\subset Q}A_U. \end{aligned}$$

Claim A.2

The left-hand side of (A.1) is bounded from below by

$$\begin{aligned} \beta _{\varnothing }+\sum \limits _{U\subset Q}\beta _UA_U. \end{aligned}$$

(A.2)

Claim A.3

The lower bounds $\beta _U\ge 1$ hold for every $U\subset Q$ and thus

$$\begin{aligned} \beta _{\varnothing }+\sum \limits _{U\subset Q}\beta _UA_U\ge 1+\sum _{U\subset Q}A_U. \end{aligned}$$

All the sums in the three claims above run over subsets of $Q=\Gamma (x_1)\cap \Gamma (X$) including the empty set (so that the coefficient $\beta _\varnothing $ appears in (A.2) twice). The inequalities (A.1) follow directly from the three claims. Proving the claims is thus sufficient to conclude the proof of the lemma:

Proof of Claim A.1

Reorder the sum in the right-hand side of (A.1) by putting the summand together which belong to the same $U:=\Gamma (Y)\cap Q$. Notice that the constraint $Y\sim X$ implies $Y\cap Q=\varnothing $ since $Y\subset \Gamma (x_1)$. The claim now follows directly from the simple observation that $\Gamma (Y)\cap \Gamma (X)^C\subset \Gamma (Y)\backslash U$ for any Y and thus

$$\begin{aligned}\prod \limits _{w\in \Gamma (Y)\backslash U}\mathrm {e}^{\mu (w)}\ge \prod \limits _{w\in \Gamma (Y)\cap \Gamma (X)^C}\mathrm {e}^{\mu (w)}.\end{aligned}$$

$\square $

Proof of Claim A.2

To see that the bounds stated in the claim hold, decompose the sum in the left-hand side of (A.1) as

$$\begin{aligned}&\nonumber 1+\sum \limits _{k\ge 1}\sum \limits _{{\begin{array}{c} Y=\{y_1,...,y_k\}\\ y_i \not \sim x_1,\ y_i\sim y_j \end{array}}}\prod \limits _{i=1}^k\mu (y_i)\prod \limits _{w\in \Gamma (Y)}\mathrm {e}^{\mu (w)}\\=&1+\sum \limits _{C\in \mathcal C\cup \{\varnothing \}}\sum \limits _{k\ge 1}\sum \limits _{{\begin{array}{c} Y=\{y_1,...,y_k\}\\ y_i \not \sim x_1,\ y_i\sim y_j \end{array}}}\mathbb {1}_{\{C=Q\cap Y\}}\prod \limits _{i=1}^k\mu (y_i)\prod \limits _{w\in \Gamma (Y)}\mathrm {e}^{\mu (w)}. \end{aligned}$$

(A.3)

Notice that for any $C\in {\mathcal {C}}$

$$\begin{aligned}&\sum \limits _{k\ge 1}\sum \limits _{{\begin{array}{c} Y=\{y_1,\ldots ,y_k\}\\ y_i \not \sim x_1,\ y_i\sim y_j \end{array}}}\mathbb {1}_{\{C=Q\cap Y\}}\prod \limits _{i=1}^k\mu (y_i)\prod \limits _{w\in \Gamma (Y)}\mathrm {e}^{\mu (w)}&\\&\quad \ge \prod \limits _{c\in C}\mu (c)\prod \limits _{w\in \Gamma (C)\cap Q}\mathrm {e}^{\mu (w)}\\&\quad \left( 1+\sum \limits _{k\ge 1}\sum \limits _{{\begin{array}{c} Y=\{y_1,...,y_k\}\\ y_i \not \sim x_1,\ y_i\sim y_j \end{array}}}\mathbb {1}_{\{Y\cap Q =\varnothing \}}\mathbb {1}_{\{Y\sim C\}}\prod \limits _{i=1}^k\mu (y_i)\prod \limits _{w\in \Gamma (Y)\backslash (\Gamma (C)\cap Q)}\mathrm {e}^{\mu (w)}\right) . \end{aligned}$$

This estimate is established by discarding the exponential weights corresponding to a subset of $\Gamma (C)$: Multiplying the lower bound in the last line with

$$\begin{aligned}\prod \limits _{w\in \Gamma (C)\backslash \Gamma (Y)\backslash Q}\mathrm {e}^{\mu (w)}\ge 1\end{aligned}$$

yields equality. No further estimates are necessary to prove the claim; we simply plug the obtained lower bound into the right-hand side of (A.3) and get

$$\begin{aligned}&1+\sum \limits _{k\ge 1}\sum \limits _{{\begin{array}{c} Y=\{y_1,...,y_k\}\\ y_i \not \sim x_1,\ y_i\sim y_j \end{array}}}\mathbb {1}_{\{Y\cap Q =\varnothing \}}\prod \limits _{i=1}^k\mu (y_i)\prod \limits _{w\in \Gamma (Y)}\mathrm {e}^{\mu (w)}+\sum \limits _{C\in \mathcal C}\prod \limits _{c\in C}\mu (c)\prod \limits _{w\in \Gamma (C)\cap Q}\mathrm {e}^{\mu (w)}\\&\quad \quad \times \left( 1+\sum \limits _{k\ge 1}\sum \limits _{{\begin{array}{c} Y=\{y_1,...,y_k\}\\ y_i \not \sim x_1,\ y_i\sim y_j \end{array}}}\mathbb {1}_{\{Y\cap Q =\varnothing \}}\mathbb {1}_{\{Y\sim C\}}\prod \limits _{i=1}^k\mu (y_i)\prod \limits _{w\in \Gamma (Y)\backslash (\Gamma (C)\cap Q)}\mathrm {e}^{\mu (w)}\right) \end{aligned}$$

or, equivalently,

$$\begin{aligned} 1&+\sum \limits _{C\in {\mathcal {C}}}\prod \limits _{c\in C}\mu (c)\prod \limits _{w\in \Gamma (C)\cap Q}\mathrm {e}^{\mu (w)}+\sum \limits _{k\ge 1}\sum \limits _{{\begin{array}{c} Y=\{y_1,...,y_k\}\\ y_i \not \sim x_1,\ y_i\sim y_j \end{array}}}\mathbb {1}_{\{Y\cap Q =\varnothing \}}\prod \limits _{i=1}^k\mu (y_i)\prod \limits _{w\in \Gamma (Y)}\mathrm {e}^{\mu (w)}\\ {}&+\sum \limits _{C\in \mathcal C}\prod \limits _{c\in C}\mu (c)\prod \limits _{w\in \Gamma (C)\cap Q}\mathrm {e}^{\mu (w)}\sum \limits _{k\ge 1}\sum \limits _{{\begin{array}{c} Y=\{y_1,...,y_k\}\\ y_i \not \sim x_1,\ y_i\sim y_j \end{array}}}\mathbb {1}_{\{Y\cap Q =\varnothing \}}\mathbb {1}_{\{Y\sim C\}}\prod \limits _{i=1}^k\mu (y_i)\\&\quad \prod \limits _{w\in \Gamma (Y)\backslash (\Gamma (C)\cap Q)}\mathrm {e}^{\mu (w)} \end{aligned}$$

as a lower bound for the left-hand side of (A.3).

Reordering the last expression by summing over Y first, one realizes that it is equal to

$$\begin{aligned}&1+\sum \limits _{C\in {\mathcal {C}}}\prod \limits _{c\in C}\mu (c)\prod \limits _{w\in \Gamma (C)\cap Q}\mathrm {e}^{\mu (w)}+\sum \limits _{k\ge 1}\sum \limits _{{\begin{array}{c} Y=\{y_1,...,y_k\}\\ y_i \not \sim x_1,\ y_i\sim y_j \end{array}}}\mathbb {1}_{\{Y\cap Q =\varnothing \}}\prod \limits _{i=1}^k\mu (y_i)\prod \limits _{w\in \Gamma (Y)}\mathrm {e}^{\mu (w)}\\&\quad \quad \times \left( 1+\sum \limits _{\begin{array}{c} C\in \mathcal C\\ C\sim Y \end{array}}\prod \limits _{\begin{array}{c} c\in C \end{array}}\mu (c)\prod \limits _{w\in (\Gamma (C)\cap Q)\backslash \Gamma (Y)}\mathrm {e}^{\mu (w)}\right) , \end{aligned}$$

which may be rewritten as

$$\begin{aligned}&1+\sum \limits _{C\in {\mathcal {C}}}\prod \limits _{c\in C}\mu (c)\prod \limits _{w\in \Gamma (C)\cap Q}\mathrm {e}^{\mu (w)} +\sum \limits _{k\ge 1}\sum \limits _{{\begin{array}{c} Y=\{y_1,...,y_k\}\\ y_i \not \sim x_1,\ y_i\sim y_j \end{array}}}\mathbb {1}_{\{Y\cap Q =\varnothing \}}\prod \limits _{i=1}^k\mu (y_i)\prod \limits _{w\in \Gamma (Y)\backslash (\Gamma (Y)\cap Q)}\mathrm {e}^{\mu (w)}\\&\quad \quad \times \left( \prod \limits _{w\in \Gamma (Y)\cap Q}\mathrm {e}^{\mu (w)}+\sum \limits _{\begin{array}{c} C\in \mathcal C\\ C\sim Y \end{array}}\prod \limits _{\begin{array}{c} c\in C \end{array}}\mu (c)\prod \limits _{w\in (\Gamma (C)\cup \Gamma (Y))\cap Q}\mathrm {e}^{\mu (w)}\right) . \end{aligned}$$

From the last expression, we obtain precisely the sum in (A.2) by putting the summands in the last expression which belong to the same $U=\Gamma (Y)\cap Q$ together and dividing by $\prod _{q\in Q}\mathrm {e}^{\mu (q)}$. This yields the claimed lower bound. $\square $

Proof of Claim A.3

Without loss of generality, we may assume that $Q=\Gamma (x_1)\cap \Gamma (X)$ is a finite non-empty set. Then the claim can be proven via induction over the cardinality of Q.

To start the induction consider the case $Q=\{q\}$, $q\in \mathbb {X}$. Then $\beta _\varnothing =\mathrm {e}^{-\mu (q)}+\mu (q)\ge 1$ and $\beta _{\{q\}}=1$ by definition.

For the inductive step, let $n\in \mathbb {N}$, $Q_n=\{q_1,...,q_n\}\subset \mathbb {X}$ and let ${\mathcal {C}}_n$ be the set of compatible subsets of $Q_n$. Furthermore, let $q_{n+1}\in \mathbb {X}\backslash Q_n$ and let $Q_{n+1}=Q_n\cup \{q_{n+1}\}$. Naturally, there exists a family of subsets $\Theta _n\subset \mathcal {C}_n$, such that the set $\mathcal {C}_{n+1}$ of compatible subsets of $Q_{n+1}$ is given by $\mathcal {C}_{n+1}=\{C\cup \{q_{n+1}\}\vert C\in \Theta _n \textit{ or }C=\varnothing \}\cup {\mathcal {C}}_n=:\overline{\Theta }_n\cup \mathcal C_n$.

Under the assumption that $\beta _U(Q_n)\ge 1$ for all $U\subset Q_n$ it is to show that $\beta _U(Q_{n+1})\ge 1$ for all $U\subset Q_{n+1}$. Therefore let $U\subset Q_{n+1}$. If $q_{n+1}\in U$ then $\beta _U(Q_{n+1})= \beta _{U\backslash \{q_{n+1}\}}(Q_n)\ge 1$ by the inductive hypothesis. Left to consider is the case $q_{n+1}\notin U$ (and thus $U\subset Q_n$). Recall that we defined the coefficient $\beta _U(Q_{n+1})$ by

$$\begin{aligned} \beta _U(Q_{n+1})=\prod \limits _{q\in Q_{n+1}\backslash U}\mathrm {e}^{-\mu (q)}+\sum \limits _{\begin{array}{c} C\in {\mathcal {C}}_{n+1}\\ C\cap U=\varnothing \end{array}}\prod \limits _{c\in C}\mu (c)\prod \limits _{w\in (Q_{n+1}\backslash U)\backslash \Gamma (C)}\mathrm {e}^{-\mu (w)}. \end{aligned}$$

Using the decomposition ${\mathcal {C}}_{n+1}=\overline{\Theta }_n\cup {\mathcal {C}}_n$, we get

$$\begin{aligned} \beta _U(Q_{n+1})=&\ \mathrm {e}^{-\mu (q_{n+1})}\prod \limits _{q\in Q_{n}\backslash U}\mathrm {e}^{-\mu (q)}+\sum \limits _{\begin{array}{c} C\in \overline{\Theta }_n\\ C\cap U=\varnothing \end{array}}\prod \limits _{c\in C}\mu (c)\prod \limits _{w\in (Q_{n+1}\backslash U)\backslash \Gamma (C)}\mathrm {e}^{-\mu (w)} \\&+\sum \limits _{\begin{array}{c} C\in {\mathcal {C}}_n\\ C\cap U=\varnothing \end{array}}\prod \limits _{c\in C}\mu (c)\prod \limits _{w\in (Q_{n+1}\backslash U)\backslash \Gamma (C)}\mathrm {e}^{-\mu (w)} \\ =&\ \mathrm {e}^{-\mu (q_{n+1})}\prod \limits _{q\in Q_{n}\backslash U}\mathrm {e}^{-\mu (q)}+\mu (q_{n+1})\prod \limits _{q\in (Q_{n}\backslash U)\backslash \Gamma (q_{n+1})}\mathrm {e}^{-\mu (q)} \\&+\mu (q_{n+1})\sum \limits _{\begin{array}{c} C\in \Theta _n\\ C\cap U=\varnothing \end{array}}\prod \limits _{c\in C}\mu (c)\prod \limits _{w\in (Q_{n+1}\backslash U)\backslash \Gamma (C\cup \{q_{n+1}\})}\mathrm {e}^{-\mu (w)}\\ {}&+\mathrm {e}^{-\mu (q_{n+1})}\sum \limits _{\begin{array}{c} C\in \Theta _n\\ C\cap U=\varnothing \end{array}}\prod \limits _{c\in C}\mu (c)\prod \limits _{w\in (Q_{n}\backslash U)\backslash \Gamma (C)}\mathrm {e}^{-\mu (w)} \\ {}&+\sum \limits _{\begin{array}{c} C\in {\mathcal {C}}_n\backslash \Theta _n\\ C\cap U=\varnothing \end{array}}\prod \limits _{c\in C}\mu (c)\prod \limits _{w\in (Q_{n}\backslash U)\backslash \Gamma (C)} \mathrm {e}^{-\mu (w)}. \end{aligned}$$

Multiplying the second summand in the last expression by the products of negative exponential weights

$$\begin{aligned}\prod \limits _{w\in \Gamma (\{q_{n+1}\})}\mathrm {e}^{-\mu (w)})\le 1\end{aligned}$$

and the third summand by

$$\begin{aligned}\qquad \prod \limits _{w\in \Gamma (\{q_{n+1}\})\backslash \Gamma (C)}\mathrm {e}^{-\mu (w)})\le 1,\end{aligned}$$

we obtain the following lower bound:

$$\begin{aligned}&\beta _U(Q_{n+1}) \\&\ge \left( \mathrm {e}^{-\mu (q_{n+1})}+\mu (q_{n+1})\right) \left( \ \prod \limits _{q\in Q_{n}\backslash U}\mathrm {e}^{-\mu (q)} +\sum \limits _{\begin{array}{c} C\in \Theta _n\\ C\cap U=\varnothing \end{array}}\prod \limits _{c\in C}\mu (c)\prod \limits _{w\in (Q_{n}\backslash U)\backslash \Gamma (C)}\mathrm {e}^{-\mu (w)}\right) \\ {}&\quad +\sum \limits _{\begin{array}{c} C\in \mathcal C_n\backslash \Theta _n\\ C\cap U=\varnothing \end{array}}\prod \limits _{c\in C}\mu (c)\prod \limits _{w\in (Q_{n}\backslash U)\backslash \Gamma (C)}\mathrm {e}^{-\mu (w)} \end{aligned}$$

Since $\mathrm {e}^{-\mu (q_{n+1})}+\mu (q_{n+1})\ge 1$ for any $\mu $, this last expression is in turn bounded from below by

$$\begin{aligned}&\prod \limits _{q\in Q_{n}\backslash U}\mathrm {e}^{-\mu (q)} +\sum \limits _{\begin{array}{c} C\in \Theta _n\\ C\cap U=\varnothing \end{array}}\prod \limits _{c\in C}\mu (c)\prod \limits _{w\in (Q_{n}\backslash U)\backslash \Gamma (C)}\mathrm {e}^{-\mu (w)}+\sum \limits _{\begin{array}{c} C\in {\mathcal {C}}_n\backslash \Theta _n\\ C\cap U=\varnothing \end{array}}\prod \limits _{c\in C}\mu (c)\prod \limits _{w\in (Q_{n}\backslash U)\backslash \Gamma (C)}\mathrm {e}^{-\mu (w)}\\&\quad \quad =\prod \limits _{q\in Q_{n}\backslash U}\mathrm {e}^{-\mu (q)} +\sum \limits _{\begin{array}{c} C\in {\mathcal {C}}_n\\ C\cap U=\varnothing \end{array}}\prod \limits _{c\in C}\mu (c)\prod \limits _{w\in (Q_{n}\backslash U)\backslash \Gamma (C)}\mathrm {e}^{-\mu (w)}=\beta _U(Q_n). \end{aligned}$$

By the inductive hypothesis $\beta _U(Q_n)$ is bounded from below by 1, hence we have shown $\beta _U(Q_{n+1})\ge 1$. This concludes the induction and therefore also the proof of Claim A.3. $\square $

Combining the three statements from the claims A.1, A.2 and A.3 immediately yields the claim of Lemma 2.5. $\square $

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Jansen, S., Kolesnikov, L. Cluster Expansions: Necessary and Sufficient Convergence Conditions. J Stat Phys 189, 33 (2022). https://doi.org/10.1007/s10955-022-02992-6

Download citation

Received: 16 January 2022
Accepted: 12 September 2022
Published: 24 September 2022
DOI: https://doi.org/10.1007/s10955-022-02992-6

Keywords

Mathematics Subject Classification

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Cluster Expansions: Necessary and Sufficient Convergence Conditions

Abstract

Similar content being viewed by others

Local Moderate and Precise Large Deviations via Cluster Expansions

Identities for Correlation Functions in Classical Statistical Mechanics and the Problem of Crystal States

Convergence of Density Expansions of Correlation Functions and the Ornstein–Zernike Equation

1 Introduction

2 Main Results

2.1 (Locally) Stable Pair Potentials

Theorem 2.1

Remark 2.1

Corollary 2.2

Remark 2.2

Proof

Corollary 2.3

Remark 2.3

Proof

Remark 2.4

2.2 Abstract Polymer Models

Proposition 2.4

Lemma 2.5

Remark 2.5

Proof of Proposition 2.4

Example 2.1

2.3 Hard-Core Systems in the Continuum

Theorem 2.6

2.4 Subset Polymers

Theorem 2.7

Corollary 2.8

Proof

Theorem 2.9

Proof

3 Combinatorial Lemmas: Proof of Theorems 2.1, 2.6, and 2.7

3.1 Forest Partition Schemes: Alternating Sign Property

Definition 3.1

Example 3.1

Proposition 3.2

Proof

Corollary 3.3

Proof

Lemma 3.4

Remark 3.1

Proof of Lemma 3.4

3.2 Kirkwood–Salsburg Equations: Proof of Theorem 2.1

Lemma 3.5

Proposition 3.6

Proof

Proof of Theorem 2.1

3.3 Integral Equations for Hard-Core Models: Proof of Theorem 2.6

Lemma 3.7

Lemma 3.8

Proof

Lemma 3.9

Proof

Proposition 3.10

Proof of Theorem 2.6

3.4 Recurrence Relations for Subset Polymers: Proof of Theorem 2.7

Lemma 3.11

Proof

Proposition 3.12

Remark 3.2

Proof

Proof of Theorem 2.7

Theorem 3.13

4 Application to Concrete Hard-Core Models

4.1 Single-Type Subset Polymers in \(\mathbb {Z}^d\)

Theorem 4.1

Remark 4.1

Example 4.1

Proof of Theorem 4.1

4.2 Single-Type Hard-Core System in \(\mathbb {R}^d\)

Theorem 4.2

Remark 4.2

Example 4.2

Proof

4.3 Multi-Type Hard Spheres in \(\mathbb {R}^d\)

Lemma 4.3

Proof

Theorem 4.4

Remark 4.3