1 Intermediate dimensions: definitions and background

We work with subsets of \({\mathbb {R}}^n\) throughout, although much of what we establish also holds in more general metric spaces. We denote the diameter of a set F by |F|, and when we refer to a cover \(\{U_i\}\) of a set F we mean that \(F\subseteq \bigcup _i U_i\) where \(\{U_i\}\) is a finite or countable collection of sets.

Recall that Hausdorff dimension \(\dim _\mathrm{H}\) may be defined without introducing Hausdorff measures, but using Hausdorff content. For \(F\subseteq {\mathbb {R}}^n\),

$$\begin{aligned} \dim _\mathrm{H}F = \inf \big \{&s\ge 0 : \hbox { for all }\varepsilon >0 \hbox { there exists a cover } \{U_i\}\hbox { of }F\hbox { such that }\sum |U_i|^s \le \varepsilon \big \}, \end{aligned}$$

see [2, Section 3.2]. (Lower) box dimension \(\underline{\mathrm{dim}}_\mathrm{B}\,\) may be expressed in a similar manner, by forcing the covering sets to be of the same diameter. For bounded \(F\subseteq {\mathbb {R}}^n\),

$$\begin{aligned} \underline{\mathrm{dim}}_\mathrm{B}\,F = \inf \Bigg \{ s\ge 0&: \hbox { for all }\varepsilon >0 \hbox { there exists a cover } \{U_i\} \hbox { of }F \\&\hbox { such that }|U_i| = |U_j|\hbox { for all }i,j\hbox { and }\sum |U_i|^s \le \varepsilon \Bigg \}. \end{aligned}$$

see [2, Chapter 2]. Expressed in this way, Hausdorff and box dimensions may be regarded as extreme cases of the same definition, one with no restriction on the size of covering sets, and the other requiring them all to have equal diameters. With this in mind, one might regard them as the extremes of a continuum of dimensions with increasing restrictions on the relative sizes of covering sets. This is the main idea of this paper, which we formalise by considering restricted coverings where the diameters of the smallest and largest covering sets lie in a geometric range \(\delta ^{1/\theta } \le |U_i| \le \delta \) for some \(0\le \theta \le 1\).

Definition 1.1

Let \(F\subseteq {\mathbb {R}}^n\) be bounded. For \(0\le \theta \le 1\) we define the lower \(\theta \)-intermediate dimension of F by

$$\begin{aligned} \underline{\mathrm{dim}}_{\,\theta }F = \inf \Bigg \{&s\ge 0 : \hbox { for all }\varepsilon>0 \hbox { and all }\delta _0>0,\hbox { there exists }0<\delta \le \delta _0 \\&\hbox { and a cover } \{U_i\} \hbox { of }F\hbox { such that } \delta ^{1/\theta } \le |U_i| \le \delta \hbox { and } \sum |U_i|^s \le \varepsilon \Bigg \}. \end{aligned}$$

Similarly, we define the upper \(\theta \)-intermediate dimension of F by

$$\begin{aligned} \overline{\mathrm{dim}}_{\,\theta }F = \inf \Bigg \{&s\ge 0 : \hbox { for all }\varepsilon>0 \hbox { there exists }\delta _0>0\hbox { such that for all }0<\delta \le \delta _0, \\&\hbox { there is a cover } \{U_i\} \hbox { of }F\hbox { such that } \delta ^{1/\theta } \le |U_i| \le \delta \hbox { and } \sum |U_i|^s \le \varepsilon \Bigg \}. \end{aligned}$$

With these definitions,

$$\begin{aligned} \underline{\hbox { dim}}_{0} F = \overline{\hbox { dim}}_{0} F = \mathrm{dim}_\mathrm{H}\,F, \quad \underline{\hbox { dim}}_{1} F = \underline{\mathrm{dim}}_\mathrm{B}\,F \quad {\hbox { and }}\quad \overline{\hbox { dim}}_{1} F = \overline{\mathrm{dim}}_\mathrm{B}\,F, \end{aligned}$$

where \(\overline{\mathrm{dim}}_\mathrm{B}\,\) is the upper box dimension. Moreover, it follows immediately that, for a bounded set F and \(\theta \in [0,1]\),

$$\begin{aligned} \mathrm{dim}_\mathrm{H}\,F \le \underline{\mathrm{dim}}_{\,\theta }F \le \overline{\mathrm{dim}}_{\,\theta }F \le \overline{\mathrm{dim}}_\mathrm{B}\,F \quad {\hbox { and }}\quad \underline{\mathrm{dim}}_{\,\theta }F \le \underline{\mathrm{dim}}_\mathrm{B}\,F. \end{aligned}$$

It is also immediate that \(\underline{\mathrm{dim}}_{\,\theta }F\) and \(\overline{\mathrm{dim}}_{\,\theta }F\) are increasing in \(\theta \), though as we shall see they need not be strictly increasing. Furthermore, \(\overline{\mathrm{dim}}_{\,\theta }\) is finitely stable, that is \(\overline{\mathrm{dim}}_{\,\theta }(F_1\cup F_2) = \max \{\overline{\mathrm{dim}}_{\,\theta }F_1, \overline{\mathrm{dim}}_{\,\theta }F_2\}\), and, for \(\theta \in (0,1]\), both \(\underline{\mathrm{dim}}_{\,\theta }F\) and \(\overline{\mathrm{dim}}_{\,\theta }F\) are unchanged on replacing F by its closure.

In many situations, even if \(\mathrm{dim}_\mathrm{H}\,F < \overline{\mathrm{dim}}_\mathrm{B}\,F\), we still have \(\underline{\mathrm{dim}}_\mathrm{B}\,F = \overline{\mathrm{dim}}_\mathrm{B}\,F\) and \(\underline{\mathrm{dim}}_{\,\theta }F = \overline{\mathrm{dim}}_{\,\theta }F\) for all \(\theta \in [0,1]\). In this case we refer to the box dimension \(\mathrm{dim}_\mathrm{B}F = \underline{\mathrm{dim}}_\mathrm{B}\,F = \overline{\mathrm{dim}}_\mathrm{B}\,F\) and the \(\theta \)-intermediate dimension \(\hbox { dim}_{\theta } F = \underline{\mathrm{dim}}_{\,\theta }F = \overline{\mathrm{dim}}_{\,\theta }F\).

This paper is devoted to understanding \(\theta \)-intermediate dimensions. The hope is that \(\hbox { dim}_{\theta } F\) will interpolate between the Hausdorff and box dimensions in a meaningful way, a rich and robust theory will be discovered, and interesting further questions unearthed. We first derive useful properties of intermediate dimensions, including that \(\underline{\hbox { dim}}_{0} F\) and \(\overline{\hbox { dim}}_{0} F \) are continuous on (0, 1] but not necessarily at 0, as well as proving versions of the mass distribution principle, Frostman’s lemma and product formulae. We then examine a range of examples illustrating different types of behaviour including sequences formed by negative powers of integers, and self-affine Bedford–McMullen carpets.

Intermediate dimensions provide an insight into the distribution of the diameters of covering sets needed when estimating the Hausdorff dimensions of sets whose Hausdorff and box dimensions differ. They also have concrete applications to well-studied problems. For example, since the intermediate dimensions are preserved under bi-Lipschitz mappings, they provide another invariant for Lipschitz classification of sets. A very specific variant was used in [6] to estimate the singular sets of partial differential equations.

A related approach to ‘dimension interpolation’ was recently considered in [5] where a new dimension function was introduced to interpolate between the box dimension and the Assouad dimension. In this case the dimension function was called the Assouad spectrum, denoted by \(\hbox { dim}_\text {A}^{\theta } F\)\((\theta \in (0,1))\).

2 Properties of intermediate dimensions

2.1 Continuity

The first natural question is whether, for a fixed bounded set F, \(\underline{\mathrm{dim}}_{\,\theta }F\) and \(\overline{\mathrm{dim}}_{\,\theta }F\) vary continuously for \(\theta \in [0,1]\). We show this is the case, except possibly at \(\theta = 0\). We provide simple examples exhibiting discontinuity at \(\theta = 0\) in Sect. 3.2. However, for many natural sets F we find that the intermediate dimensions are continuous at 0 (and thus on [0, 1]), for example for self-affine carpets, see Sect. 4.

Proposition 2.1

Let F be a non-empty bounded subset of \({\mathbb {R}}^n\) and let \(0\le \theta <\phi \le 1\). Then

$$\begin{aligned} \underline{\mathrm{dim}}_{\,\theta }F \ \le \ \underline{\mathrm{dim}}_{\,\phi }F \ \le \ \underline{\mathrm{dim}}_{\,\theta }F +\left( 1-\frac{\theta }{\phi }\right) (n- \underline{\mathrm{dim}}_{\,\theta }F). \end{aligned}$$
(2.1)

and

$$\begin{aligned} \overline{\mathrm{dim}}_{\,\theta }F \ \le \ \overline{\mathrm{dim}}_{\,\phi }F \ \le \ \overline{\mathrm{dim}}_{\,\theta }F +\left( 1-\frac{\theta }{\phi }\right) (n- \overline{\mathrm{dim}}_{\,\theta }F). \end{aligned}$$
(2.2)

In particular, \(\theta \mapsto \underline{\mathrm{dim}}_{\,\theta }F\) and \(\theta \mapsto \overline{\mathrm{dim}}_{\,\theta }F\) are continuous for \(\theta \in (0,1]\).

Proof

We will only prove (2.1) since (2.2) is similar. The left-hand inequality of (2.1) is just the monotonicity of \(\underline{\mathrm{dim}}_{\,\theta }F\). The right-hand inequality is trivially satisfied when \(\underline{\mathrm{dim}}_{\,\theta }F=n\), so we assume that \(0\le \underline{\mathrm{dim}}_{\,\theta }F <n\). Suppose that \(0\le \theta <\phi \le 1\) and that \(0\le \underline{\mathrm{dim}}_{\,\theta }F<s <n\). Then, given \(\varepsilon >0\), we may find arbitrarily small \(\delta >0\) and countable or finite covers \(\{U_i\}_{i\in I}\) of F such that

$$\begin{aligned} \sum _{i\in I} |U_i|^s<\varepsilon \quad \hbox { and } \quad \delta \le |U_i| \le \delta ^\theta \quad \hbox { for all } i\in I. \end{aligned}$$
(2.3)

Let

$$\begin{aligned} I_0 =\{i\in I: \delta \le |U_i| \le \delta ^\phi \} \quad \hbox { and } \quad I_1 =\{i\in I: \delta ^\phi < |U_i| \le \delta ^\theta \}. \end{aligned}$$

For each \( i\in I_1\) we may split \(U_i\) into subsets of small coordinate cubes to get sets \(\{U_{i,j}\}_{j\in J_i}\) such that \(U_i \subseteq \bigcup _{j\in J_i} U_{i,j}\), with \(|U_{i,j}| \le \delta ^\phi \) and \(\mathrm{card} J_i \ \le \ c_n|U_i|^n \delta ^{-\phi n} \ \le \ c_n\delta ^{n(\theta -\phi )}\), where \(c_n = 4^n n^{n/2}\).

Let \(s<t\le n\). Then \( \{U_i\}_{i\in I_0} \cup \{U_{i,j}\}_{i\in I_1, j \in J_i}\) is a cover of F such that \(\delta \le |U_i|, |U_{i,j}| \le \delta ^\phi \). Taking sums with respect to this cover:

$$\begin{aligned} \sum _{i\in I_0}|U_i|^t + \sum _{i\in I_1}\sum _{j\in J_{i}}|U_{i,j}|^t\le & {} \sum _{i\in I_0}|U_i|^t + \sum _{i\in I_1} \delta ^{\phi t} c_n |U_i|^n \delta ^{-\phi n} \\\le & {} \sum _{i\in I_0}|U_i|^t + c_n \sum _{i\in I_1} |U_i|^s |U_i|^{n-s}\delta ^{\phi ( t-n)} \\\le & {} \sum _{i\in I_0}|U_i|^s + c_n \sum _{i\in I_1} |U_i|^s \delta ^{\theta (n-s)}\delta ^{\phi ( t-n)} \\\le & {} \sum _{i\in I_0}|U_i|^s + c_n \delta ^{\phi [t-(n\phi +\theta (s-n))/\phi ]}\sum _{i\in I_1} |U_i|^s \\\le & {} (1+c_n) \sum _{i\in I}|U_i|^s\ < \ (1+c_n)\varepsilon \end{aligned}$$

if \(t\ge (n\phi +\theta (s-n))/\phi \), from (2.3). This holds for some cover for arbitrarily small \(\varepsilon \) and all \(s>\underline{\mathrm{dim}}_{\,\theta }F\), giving \(\underline{\mathrm{dim}}_{\,\phi }F \le n + \theta (\underline{\mathrm{dim}}_{\,\theta }F-n)/\phi \), which rearranges to give (2.1).

Finally, note that (2.2) follows by exactly the same argument noting that the assumption \( \overline{\mathrm{dim}}_{\,\theta }F <s \) gives rise to \(\delta _0>0\) such that for all \(\delta \in (0,\delta _0)\) we can find covers \(\{U_i\}_{i\in I}\) of F satisfying (2.3). \(\square \)

2.2 A mass distribution principle for \(\underline{\mathrm{dim}}_{\,\theta }\) and \(\overline{\mathrm{dim}}_{\,\theta }\)

The mass distribution principle is a powerful tool in fractal geometry and provides a useful mechanism for estimating the Hausdorff dimension from below by considering measures supported on the set, see [2, page 67]. We present natural analogues for \(\underline{\mathrm{dim}}_{\,\theta }\) and \(\overline{\mathrm{dim}}_{\,\theta }\).

Proposition 2.2

Let F be a Borel subset of \({\mathbb {R}}^n\) and let \(0\le \theta \le 1\) and \(s\ge 0\). Suppose that there are numbers \(a, c, \delta _0 >0\) such that for all \(0< \delta \le \delta _0\) we can find a Borel measure \(\mu _\delta \) supported by F with \(\mu _\delta (F) \ge a \), and with

$$\begin{aligned} \mu _\delta (U) \le c|U|^s \quad \hbox { for all Borel sets }U \subseteq {\mathbb {R}}^n\hbox { with } \delta \le |U|\le \delta ^\theta . \end{aligned}$$
(2.4)

Then \(\underline{\mathrm{dim}}_{\,\theta }F \ge s\). Moreover, if measures \(\mu _\delta \) with the above properties can be found only for a sequence of \(\delta \rightarrow 0\), then the conclusion is weakened to \(\overline{\mathrm{dim}}_{\,\theta }F \ge s\).

Proof

Let \(\{U_i\}\) be a cover of F such that \(\delta \le |U_i|\le \delta ^\theta \) for all i. Then

$$\begin{aligned} a\ \le \ \mu _\delta (F) \ \le \ \mu _\delta \left( \bigcup _i U_i\right) \ \le \ \sum _i \mu _\delta (U_i) \ \le \ c\sum _i |U_i|^s, \end{aligned}$$

so that \(\sum _i |U_i|^s \ge a/c>0\) for every admissible cover and therefore \(\underline{\mathrm{dim}}_{\,\theta }F \ge s\).

The weaker conclusion regarding the upper intermediate dimension is obtained similarly. \(\square \)

Note the main difference between Proposition 2.2 and the usual mass distribution principle is that a family of measures \(\{\mu _\delta \}\) is used instead of a single measure. Since each measure \(\mu _\delta \) is only required to describe a range of scales, in practice one can often use finite sums of point masses. Whilst the measures \(\mu _\delta \) may vary, it is essential that they all assign mass at least \(a>0\) to F.

2.3 A Frostman type lemma for \(\underline{\mathrm{dim}}_{\,\theta }\)

Frostman’s lemma is another powerful tool in fractal geometry, which asserts the existence of measures of the type considered by the mass distribution principle, see [2, page 77] or [8, page 112]. The following analogue of Frostman’s lemma holds for intermediate dimensions and is a useful dual to Proposition 2.2.

Proposition 2.3

Let F be a compact subset of \({\mathbb {R}}^n\), let \(0< \theta \le 1\), and suppose \(0< s< \underline{\mathrm{dim}}_{\,\theta }F\). There exists a constant \(c >0\) such that for all \(\delta \in (0,1)\) we can find a Borel probability measure \(\mu _\delta \) supported on F such that for all \(x \in {\mathbb {R}}^n\) and \(\delta ^{1/\theta } \le r \le \delta \),

$$\begin{aligned} \mu _\delta (B(x,r)) \le c r^s . \end{aligned}$$

Moreover, \(\mu _\delta \) can be taken to be a finite collection of atoms.

Proof

This proof follows the proof of the classical version of Frostman’s lemma given in [8, pages 112–114]. For \(m \ge 0\) let \({\mathcal {D}}_m\) denote the familiar partition of \([0,1]^n\) consisting of \(2^{nm}\) pairwise disjoint half-open dyadic cubes of sidelength \(2^{-m}\), that is cubes of the form \([a_1, a_1+2^{-m}) \times \cdots \times [a_n, a_n+2^{-m})\). By translating and rescaling we may assume without loss of generality that \(F \subseteq [0,1]^n\) and that F is not contained in any \(Q \in {\mathcal {D}}_1\). It follows from the definition of \(\underline{\mathrm{dim}}_{\,\theta }F\) that there exists \(\varepsilon >0\) such that for all \(\delta \in (0,1)\) and for all covers \(\{U_i\}_i\) of F satisfying \(\delta ^{1/\theta } \le |U_i| \le \delta \),

$$\begin{aligned} \sum _i |U_i|^s > \varepsilon . \end{aligned}$$
(2.5)

Given \(\delta \in (0,1)\), let \(m \ge 0\) be the unique integer satisfying \(2^{-m-1}< \delta ^{1/\theta } \le 2^{-m}\) and let \(\mu _m\) be a measure defined on F as follows: for each \(Q \in {\mathcal {D}}_m\) such that \(Q \cap F \ne \emptyset \), then choose an arbitrary point \( x_Q \in Q \cap F\) and let

$$\begin{aligned} \mu _m = \sum _{ Q \in {\mathcal {D}}_m : Q \cap F \ne \emptyset } 2^{-ms} \delta _{x_Q} \end{aligned}$$

where \(\delta _{x_Q}\) is a point mass at \(x_Q\). Modify \(\mu _m\) to form a measure \(\mu _{m-1}\), supported on the same finite set, defined by

$$\begin{aligned} \mu _{m-1}\vert _Q = \min \{1, 2^{-(m-1)s} \mu _m(Q)^{-1}\} \mu _m\vert _Q \end{aligned}$$

for all \(Q \in {\mathcal {D}}_{m-1}\), where \(\nu \vert _E\) denotes the restriction of \(\nu \) to E. The purpose of this modification is to reduce the mass of cubes which carry too much measure. This is done since we are ultimately trying to construct a measure which we can estimate uniformly from above. Continuing inductively, \(\mu _{m-k-1}\) is obtained from \(\mu _{m-k}\) by

$$\begin{aligned} \mu _{m-k-1}\vert _Q = \min \{1, 2^{-(m-k-1)s} \mu _{m-k} (Q)^{-1}\} \mu _{m-k} \vert _Q \end{aligned}$$

for all \(Q \in {\mathcal {D}}_{m-k-1}\). We terminate this process when we define \(\mu _{m-l}\) where l is the largest integer satisfying \(2^{-(m-l)}n^{1/2} \le \delta \). (We may assume that \(l \ge 0\) by choosing \(\delta \) sufficiently small to begin with.) In particular, cubes \(Q \in {\mathcal {D}}_{m-l}\) satisfy \(|Q| = 2^{-(m-l)}n^{1/2} \le \delta \). By construction we have

$$\begin{aligned} \mu _{m-l}(Q) \le 2^{-(m-k)s} = |Q|^s n^{-s/2} \end{aligned}$$
(2.6)

for all \(k =0, \dots , l\) and \(Q \in {\mathcal {D}}_{m-k}\). Moreover, for all \(x \in F\), there is at least one \(k \in \{0, \dots , l\}\) and \(Q \in {\mathcal {D}}_{m-k}\) with \(x \in Q\) such that the inequality in (2.6) is an equality. This is because all cubes at level m satisfy the equality for \(\mu _m\) and if a cube Q satisfies the equality for \(\mu _{m-k}\), then either Q or its parent cube satisfies the equality for \(\mu _{m-k-1}\). For each \(x \in F\), choosing the largest such Q yields a finite collection of cubes \(Q_1, \dots , Q_t\) which cover F and satisfy \(\delta ^{1/\theta } \le |Q_i| \le \delta \) for \(i = 1, \dots , t\). Therefore, using (2.5),

$$\begin{aligned} \mu _{m-l}(F) = \sum _{i=1}^t \mu _{m-l}(Q_i) = \sum _{i=1}^t |Q_i|^s n^{-s/2} > \varepsilon n^{-s/2}. \end{aligned}$$

Let \(\mu _\delta = \mu _{m-l}(F)^{-1} \mu _{m-l}\), which is clearly a probability measure supported on a finite collection of points. Moreover, for all \(x \in {\mathbb {R}}^n\) and \(\delta ^{1/\theta } \le r \le \delta \), B(xr) is certainly contained in at most \(c_n\) cubes in \({\mathcal {D}}_{m-k}\) where k is chosen to be the largest integer satisfying \(0 \le k \le l\) and \(2^{-(m-k+1)} < r\), and \(c_n\) is a constant depending only on n. Therefore, using (2.6),

$$\begin{aligned} \mu _\delta (B(x,r)) \le c_n \mu _{m-l}(F)^{-1} 2^{-(m-k) s} \le c_n \varepsilon ^{-1} n^{s/2} 2^s r^s \end{aligned}$$

which completes the proof, setting \(c= c_n \varepsilon ^{-1} n^{s/2} 2^s \). \(\square \)

2.4 General bounds

Here we consider general bounds which rely on the Assouad dimension and which have interesting consequences for continuity. Namely, they provide numerous examples where the intermediate dimensions are discontinuous at \(\theta =0\) and also provide another proof that the intermediate dimensions are continuous at \(\theta = 1\). The Assouad dimension of \(F \subseteq {\mathbb {R}}^n\) is defined by

$$\begin{aligned} \dim _\text {A} F= & {} \inf \bigg \{ s \ge 0 \ : \ \text {there exists }C>0\text { such that for all }x \in F, \\&\qquad \qquad \text {and for all }0<r<R,\text { we have} \ N_r(F \cap B(x,R)) \le C \left( \frac{R}{r} \right) ^s \bigg \} \end{aligned}$$

where \(N_r(A)\) denotes the smallest number of sets of diameter at most r required to cover a set A. In general \({\underline{\dim }}_\text {B} F \le {\overline{\dim }}_\text {B} F \le \dim _\text {A} F \le n\), but equality of these three dimensions occurs in many cases, even if the Hausdorff dimension and box dimension are distinct, for example if the box dimension is equal to the ambient spatial dimension. See [4, 11] for more background on the Assouad dimension. The following proposition gives lower bounds for the intermediate dimensions in terms of Assouad dimensions.

Proposition 2.4

Given any non-empty bounded \(F \subseteq {\mathbb {R}}^n\) and \(\theta \in (0,1)\),

$$\begin{aligned} \underline{\mathrm{dim}}_{\,\theta }F \ge \dim _\text {A} F - \frac{\dim _\text {A} F - {\underline{\dim }}_\text {B} F}{\theta }, \end{aligned}$$

and

$$\begin{aligned} \overline{\mathrm{dim}}_{\,\theta }F \ge \dim _\text {A} F - \frac{\dim _\text {A} F - {\overline{\dim }}_\text {B} F}{\theta }. \end{aligned}$$

In particular, if \({\underline{\dim }}_\text {B} F = \dim _\text {A} F\), then \(\underline{\mathrm{dim}}_{\,\theta }F ={\overline{\dim }}_\theta F = {\underline{\dim }}_\text {B} F = \dim _\text {A} F\) for all \(\theta \in (0,1]\).

Proof

We will prove the lower bound for \(\underline{\mathrm{dim}}_{\,\theta }F\), the proof for \(\overline{\mathrm{dim}}_{\,\theta }F\) is similar. Fix \(\theta \in (0,1)\) and assume that \({\underline{\dim }}_\text {B} F >0\), otherwise the result is trivial. Let

$$\begin{aligned} 0<b< {\underline{\dim }}_\text {B} F \le \dim _\text {A} F< d < \infty \end{aligned}$$

and \(\delta \in (0,1)\) be given. By the definition of lower box dimension, there exists a uniform constant \(C_0\), depending only on F and b, such that there is a \(\delta \)-separated set of points in F of cardinality at least \(C_0 \delta ^{-b}\). Let \(\mu _\delta \) be a uniformly distributed probability measure on these points, i.e. a sum of \(C_0\delta ^{-b}\) point masses each with mass \(C_0^{-1} \delta ^{b}\). We use our mass distribution principle with this measure to prove the proposition.

Let \(U \subseteq {\mathbb {R}}^n\) be a Borel set with \(|U| = \delta ^\gamma \) for some \(\gamma \in [\theta , 1]\). By the definition of Assouad dimension there exists a uniform constant \(C_1\), depending only on F and d, such that U intersects at most \(C_1 ( \delta ^\gamma /\delta )^d\) points in the support of \(\mu _\delta \). Therefore

$$\begin{aligned} \mu _\delta (U) \le C_1\delta ^{(\gamma -1)d} C_0^{-1} \delta ^{b} = C_1C_0^{-1} \ |U|^{(\gamma d - d+ b)/ \gamma } \le C_1C_0^{-1} \ |U|^{(\theta d - d+ b)/ \theta } \end{aligned}$$

which, using Proposition 2.2, implies that

$$\begin{aligned} \underline{\mathrm{dim}}_{\,\theta }F \ge (\theta d - d+ b)/ \theta = d - \frac{d-b}{\theta }. \end{aligned}$$

Letting \(d \rightarrow \dim _\text {A} F\) and \(b \rightarrow {\underline{\dim }}_\text {B} F\) yields the desired result. \(\square \)

This proposition implies that for bounded sets with \(\dim _\text {H} F < {\underline{\dim }}_\text {B} F = \dim _\text {A} F\), the intermediate dimensions \(\underline{\mathrm{dim}}_{\,\theta }F\) and \(\overline{\mathrm{dim}}_{\,\theta }F\) are necessarily discontinuous at \(\theta = 0\). In fact the intermediate dimensions are constant on (0, 1] in this case. On the other hand, this gives an alternative demonstration that \(\underline{\mathrm{dim}}_{\,\theta }F\) and \(\overline{\mathrm{dim}}_{\,\theta }F\) are always continuous at \(\theta =1\). Moreover, the proposition provides a quantitative lower bound near \(\theta =1\). In Sect. 3.2 we will use Proposition 2.4 to construct examples exhibiting a range of behaviours.

2.5 Product formulae

A well-studied problem in dimension theory is how dimensions of product sets behave. The following product formulae for intermediate dimensions may be of interest in their own right, but in Sect. 3.2 they will be used to construct examples.

Proposition 2.5

Let \(E \subseteq {\mathbb {R}}^n\) and \(F \subseteq {\mathbb {R}}^m\) be bounded and \(\theta \in [0,1]\). Then

$$\begin{aligned} \underline{\mathrm{dim}}_{\,\theta }E + \underline{\mathrm{dim}}_{\,\theta }F\ \le \ \underline{\mathrm{dim}}_{\,\theta }(E \times F)\ \le \ {\overline{\dim }}_\theta ( E \times F)\ \le \ {\overline{\dim }}_\theta E + {\overline{\dim }}_\text {B} F. \end{aligned}$$

Proof

Fix \(\theta \in (0,1)\) throughout, noting that the cases when \(\theta =0,1\) are well-known, see [2, Chapter 7]. We begin by demonstrating the left-hand inequality. We may assume that \(\underline{\mathrm{dim}}_{\,\theta }E , \underline{\mathrm{dim}}_{\,\theta }F >0\) as otherwise the conclusion follows by monotonicity. Moreover, since EF are bounded we may assume they are compact since all the dimensions considered are unchanged under taking closure. Let \(0< s< \underline{\mathrm{dim}}_{\,\theta }E\) and \(0<t< \underline{\mathrm{dim}}_{\,\theta }F\). It follows from Proposition 2.3 that there exist constants \(C_s,C_t > 0\) such that for all \(\delta \in (0,1)\) there exist Borel probability measures \(\mu _\delta \) supported on E and \(\nu _\delta \) supported on F such that for all \(x \in {\mathbb {R}}^n\) and \(\delta ^{1/\theta } \le r \le \delta \),

$$\begin{aligned} \mu _\delta (B(x,r)) \le C_s r^s \quad \text {and} \quad \nu _\delta (B(x,r)) \le C_t r^t. \end{aligned}$$

Consider the product measure \(\mu _\delta \times \nu _\delta \) which is supported on \(E \times F\). For \(z \in {\mathbb {R}}^n\times {\mathbb {R}}^m\) and \(\delta ^{1/\theta } \le r \le \delta \),

$$\begin{aligned} (\mu _\delta \times \nu _\delta ) (B(z,r)) \le C_s C_t r^{s+t} \end{aligned}$$

and Proposition 2.2 yields \(\underline{\mathrm{dim}}_{\,\theta }(E \times F) \ge s+t\); letting \( s\rightarrow \underline{\mathrm{dim}}_{\,\theta }E\) and \(t\rightarrow \underline{\mathrm{dim}}_{\,\theta }F\) gives the desired inequality.

The middle inequality is trivial and so it remains to prove the right-hand inequality. Let \(s > {\overline{\dim }}_\theta E\) and \(d > {\overline{\dim }}_\text {B} F\). From the definition of \({\overline{\dim }}_\text {B} F\) there exists a constant \(\delta _1 \in (0,1)\) such that for all \(0<r<\delta _1\) there is a cover of F by at most \(r^{-d}\) sets of diameter r. Let \(\varepsilon >0\). By the definition of \({\overline{\dim }}_\theta E\) there exists \(\delta _0\in (0,\delta _1)\) such that for all \(0<\delta < \delta _0\) there is a cover of E by sets \(\{U_i\}_{i}\) with \(\delta ^{1/\theta } \le |U_i| \le \delta \) for all i and

$$\begin{aligned} \sum _{i} |U_i|^s \le \varepsilon . \end{aligned}$$

Given such a cover of E, for each i let \(\{U_{i,j}\}_{j}\) be a cover of F by at most \(|U_i|^{-d}\) sets with diameters \(|U_{i,j}|=|U_{i}|\) for all j. Then

$$\begin{aligned} E\times F\ \subseteq \ \bigcup _i \bigcup _j \big (U_i \times U_{i,j}\big ), \end{aligned}$$

with

$$\begin{aligned} \sum _i\sum _j |U_i \times U_{i,j}|^{s+d} \ \le \ \sum _i|U_i|^{-d}\big ( \sqrt{2}|U_i|\big )^{s+d} \ =\ 2^{(s+d)/2}\sum _i|U_i|^{s} \ \le \ 2^{(s+d)/2}\varepsilon . \end{aligned}$$

Since \(\delta ^{1/\theta } \le |U_i \times U_{i,j}| \le \sqrt{2}\delta \) for all ij, each set \(U_i \times U_{i,j}\) may be covered by at most \(c_{n+m}\) sets \(\{V_{i,j,k}\}_k\) with diameters \(\delta ^{1/\theta }\le |V_{i,j,k}| \le \min \{|U_i \times U_{i,j}|,\delta \}\le \delta \), where \(c_{q}\) is the least number such that every set in \({\mathbb {R}}^q\) of diameter \(\sqrt{2}\) can be covered by at most \(c_{q}\) sets of diameter 1. Hence

$$\begin{aligned} \sum _i\sum _j \sum _k|V_{i,j,k}|^{s+d}\ \le \ c_{n+m}2^{(s+d)/2}\varepsilon . \end{aligned}$$

As \(\varepsilon \) may be taken arbitrarily small, \({\overline{\dim }}_\theta (E \times F) \le s+d\); letting \( s\rightarrow \overline{\mathrm{dim}}_{\,\theta }E\) and \(d\rightarrow {\overline{\dim }}_\text {B} F\) completes the proof. \(\square \)

3 Examples

In this section we construct several simple examples where the intermediate dimensions exhibit a range of phenomena. All of our examples are compact subsets of \({\mathbb {R}}\) or \({\mathbb {R}}^2\) and in all examples the upper and the lower intermediate dimensions coincide for all \(\theta \in [0,1]\).

3.1 Convergent sequences

Let \(p>0\) and

$$\begin{aligned} F_p = \bigg \{0, \frac{1}{1^p}, \frac{1}{2^p},\frac{1}{3^p},\ldots \bigg \}. \end{aligned}$$

Since \(F_p\) is countable, \(\mathrm{dim}_\mathrm{H}\,F_p=0\). It is well-known that \(\mathrm{dim}_\mathrm{B}F_p=1/(p+1)\), see [2, Chapter 2]. We obtain the intermediate dimensions of \(F_p\).

Proposition 3.1

For \(p>0\) and \(0\le \theta \le 1\),

$$\begin{aligned} \underline{\mathrm{dim}}_{\,\theta }F_p = {\overline{\dim }}_\theta F_p = \frac{\theta }{p+\theta }. \end{aligned}$$

Proof

We first bound \({\overline{\dim }}_\theta F_p\) above. Let \(0<\delta <1\) and let \(M =\lceil \delta ^{-(s +\theta (1-s))/(p+1)}\rceil \). Write B(xr) for the closed interval (ball) of centre x and length 2r. Take a covering \({{\mathcal {U}}}\) of \(F_p\) consisting of M intervals \(B(k^{-p}, \delta /2)\) of length \(\delta \) for \(1\le k\le M\) and \(\lceil M^{-p} /\delta ^\theta \rceil \le M^{-p}/\delta ^\theta +1\) intervals of length \(\delta ^\theta \) that cover \([0, M^{-p}].\) Then

$$\begin{aligned} \sum _{U\in {{\mathcal {U}}}} |U|^s\le & {} M\delta ^s + \delta ^{\theta s}\left( \frac{1}{M^p\delta ^\theta }+ 1\right) \\= & {} M\delta ^s + \frac{\delta ^{\theta (s-1)}}{M^p}+ \delta ^{\theta s}\\\le & {} ( \delta ^{-(s +\theta (1-s))/(p+1)} +1) \delta ^s + \delta ^{\theta (s-1)}\delta ^{(s +\theta (1-s))p/(p+1)} +\delta ^{\theta s}\\= & {} 2\delta ^{(\theta (s-1)+sp)/(p+1)} + \delta ^s+ \delta ^{\theta s} \ \rightarrow \ 0 \end{aligned}$$

as \(\delta \rightarrow 0\) if \(s(\theta +p) > \theta \). Thus \({\overline{\dim }}_\theta F_p \le \theta /(p+\theta )\).

For the lower bound we put suitable measures on \(F_p\) and apply Proposition 2.2. Fix \(s = \theta /(p+\theta )\). Let \(0<\delta < 1\) and again let \(M =\lceil \delta ^{-(s +\theta (1-s))/(p+1)}\rceil \). Define \(\mu _\delta \) as the sum of point masses on the points \(1/k^p \ (1\le k<\infty )\) with

$$\begin{aligned} \mu _\delta \left( \left\{ \frac{1}{k^p}\right\} \right) \ = \ \left\{ \begin{array}{cl} \delta ^s &{} \hbox { if } \; 1\le k \le M \\ 0 &{} \hbox { if }\; M+1\le k <\infty \end{array} \right. . \end{aligned}$$
(3.1)

Then

$$\begin{aligned} \mu _\delta (F_p)= & {} M\delta ^s \\\ge & {} \delta ^{-(s +\theta (1-s))/(p+1)}\delta ^s \\= & {} \delta ^{(ps +\theta (s-1))/(p+1)} = 1 \end{aligned}$$

by the choice of s.

To see that (2.4) is satisfied, note that if \(2\le k\le M\) then, by a mean value theorem estimate,

$$\begin{aligned} \frac{1}{(k-1)^p} - \frac{1}{k^p}\ \ge \ \frac{p}{k^{p+1}}\ \ge \ \frac{p}{M^{p+1}}; \end{aligned}$$

thus the gap between any two points of \(F_p\) carrying mass is at least \(p/M^{p+1}\). Let U be such that \(\delta \le |U|\le \delta ^\theta \). Then U intersects at most \(1+ |U|/(p/M^{p+1}) = 1+|U|M^{p+1}/p \) of the points of \(F_p\) which have mass \(\delta ^s\). Hence

$$\begin{aligned} \mu _\delta (U)\le & {} \delta ^s + \frac{1}{p}|U|\delta ^s \delta ^{-(s +\theta (1-s))} \\= & {} \delta ^s + \frac{1}{p}|U| \delta ^{(\theta (s-1))}\\\le & {} |U|^s + \frac{1}{p} |U||U|^{s-1} \\= & {} \left( 1+ \frac{1}{p}\right) |U|^s. \end{aligned}$$

From Proposition  2.2, \(\underline{\mathrm{dim}}_{\,\theta }F_p \ge s = \theta /(p+\theta )\). \(\square \)

3.2 Simple examples exhibiting different phenomena

We use the convergent sequences \(F_p\) from the previous section, the product formulae from Proposition 2.5, and that the upper intermediate dimensions are finitely stable to construct examples displaying a range of different features which are illustrated in Fig. 1.

The natural question which began this investigation is ‘does \(\underline{\mathrm{dim}}_{\,\theta }\) vary continuously between the Hausdorff and lower box dimension?’. This indeed happens for the convergent sequences considered in the previous section, but turns out to be false in general. The first example in this direction is provided by another convergent sequence.

Fig. 1
figure 1

Plots of \(\underline{\mathrm{dim}}_{\,\theta }F\) for the four examples in Sect. 3.2

Example 1: Discontinuous at 0, otherwise constant. Let

$$\begin{aligned} F_{\log } = \{ 0, 1/\log 2, 1/\log 3, 1/\log 4, \dots \}. \end{aligned}$$

This sequence converges slower than any of the polynomial sequences \(F_p\) and it is well-known and easy to prove that \(\mathrm{dim}_\mathrm{B}F_{\log } = \dim _\text {A} F_{\log }= 1\). It follows from Proposition 2.4 that

$$\begin{aligned} \underline{\mathrm{dim}}_{\,\theta }F_{\log } = {\overline{\dim }}_\theta F_{\log } = 1,\quad \theta \in (0,1]. \end{aligned}$$

Since \(\dim _0 F_{\log }=\mathrm{dim}_\mathrm{H}\,F_{\log }=0\) there is a discontinuity at \(\theta = 0\).

Example 2: Continuous at 0, part constant, part strictly increasing. In the opposite direction, it is possible that \(\underline{\mathrm{dim}}_{\,\theta }F = \mathrm{dim}_\mathrm{H}\,F < \underline{\mathrm{dim}}_\mathrm{B}\,F\) for some \(\theta >0\). Indeed, let \(F =F_1 \cup E\) where \(F_1 = \{0, 1/1, 1/2, 1/3, \dots \}\) as before, and let \(E \subset {\mathbb {R}}\) be any compact set with \(\mathrm{dim}_\mathrm{H}\,E = \overline{\mathrm{dim}}_\mathrm{B}\,E = 1/3\) (for example an appropriately chosen self-similar set). Then it is straightforward to deduce that

$$\begin{aligned} \underline{\mathrm{dim}}_{\,\theta }F = {\overline{\dim }}_\theta F = \max \left\{ \frac{ \theta }{1+ \theta }, \, 1/3\right\} ,\quad \theta \in [0,1]. \end{aligned}$$

It is also possible for \(\underline{\mathrm{dim}}_{\,\theta }F\) to approach a value strictly in between \(\mathrm{dim}_\mathrm{H}\,F\) and \(\underline{\mathrm{dim}}_\mathrm{B}\,F\) as \(\theta \rightarrow 0\). This is the subject of the next two examples.

Example 3: Discontinuous at 0, part constant, part strictly increasing. For an example that is constant on an interval adjacent to 0, let \(F =E \cup F_{1}\) where this time \(E\subset {\mathbb {R}}\) is any closed countable set with \(\underline{\mathrm{dim}}_\mathrm{B}\,E = \dim _\text {A} E = 1/4\). It is immediate that

$$\begin{aligned} \underline{\mathrm{dim}}_{\,\theta }F = {\overline{\dim }}_\theta F = \max \left\{ \frac{ \theta }{1+ \theta }, \, 1/4\right\} ,\quad \theta \in (0,1], \end{aligned}$$

with \(\mathrm{dim}_\mathrm{H}\,F=0\) and \(\mathrm{dim}_\mathrm{B}F=1/2\).

Example 4: Discontinuous at 0, strictly increasing. Finally, for an example where \(\underline{\mathrm{dim}}_{\,\theta }F\) is smooth, strictly increasing but not continuous at \(\theta =0\), let

$$\begin{aligned} F = F_{1} \times F_{\log }\subset {\mathbb {R}}^2. \end{aligned}$$

Here \(\mathrm{dim}_\mathrm{H}\,F=0\) and \(\mathrm{dim}_\mathrm{B}F=3/2\) and Proposition 2.5 gives

$$\begin{aligned} \underline{\mathrm{dim}}_{\,\theta }F = {\overline{\dim }}_\theta F = \frac{ \theta }{1 + \theta }+ 1,\quad \theta \in (0,1], \end{aligned}$$

noting that \(\underline{\mathrm{dim}}_{\,\theta }F_{\log } = {\overline{\dim }}_\theta F_{\log } = \underline{\mathrm{dim}}_\mathrm{B}\,F_{\log } = \overline{\mathrm{dim}}_\mathrm{B}\,F_{\log } = \dim _\text {A} F_{\log }= 1\) for \(\theta \in (0,1]\).

4 Bedford–McMullen carpets

A well-known class of fractals where the Hausdorff and box dimensions differ are the self-affine carpets; this is a consequence of the alignment of the component rectangles in the iterated construction. The first studies of planar self-affine carpets were by Bedford [1] and McMullen [9] independently, see also [10] and these Bedford–McMullen carpets have been widely studied and generalised, see for example [3] and references therein. Indeed, the question of the distribution of scales of covering sets for Hausdorff and box dimensions of Bedford–McMullen carpets was one of our motivations for studying intermediate dimensions. Finding an exact formula for the intermediate dimensions of Bedford–McMullen carpets seems a difficult problem, so here we obtain some lower and upper bounds, which in particular establish continuity at \(\theta =0\) and that the intermediate dimensions take a strict minimum at \(\theta =0\).

We introduce the notation we need, generally following that of McMullen [9]. Choose integers \(n > m \ge 2\). Let \({I}=\left\{ 0,\ldots , m-1 \right\} \) and \({J}=\left\{ 0,\ldots , n-1 \right\} \). Choose a fixed digit set \(D \subseteq I \times J\) with at least two elements. For \(( p,q )\in D\) we define the affine contraction \(S_{(p,q )}:[0,1]^2 \rightarrow [0,1]^2\) by

$$\begin{aligned} S_{ \left( p,q \right) }\left( x,y\right) = \left( \frac{x+p}{m},\frac{y+q}{n} \right) . \end{aligned}$$

There exists a unique non-empty compact set \(F \subseteq [0,1]^2\) satisfying

$$\begin{aligned} F=\bigcup _{(p,q) \in D}S_{ ( p,q )}(F); \end{aligned}$$

that is F is the attractor of the iterated function system \(\{ S_{ (p,q)} \}_{ ( p,q )\in D}\). We call such a set F a Bedford–McMullen self-affine carpet. It is sometimes convenient to denote pairs in D by \(\ell = (p_\ell ,q_\ell )\).

We model our carpet F via the symbolic space \(D^{{\mathbb {N}}}\), which consists of all infinite words over D and is equipped with the product topology. We write \({\mathbf {i}} \equiv (i_1,i_2,\ldots )\) for elements of \(D^{{\mathbb {N}}}\) and \((i_1,\ldots ,i_k)\) for words of length k in \(D^k\), where \(i_j \in D\). Then the canonical projection \(\tau :D^{{\mathbb {N}}} \rightarrow [0,1]^2\) is defined by

$$\begin{aligned} \{\tau ({\mathbf {i}})\}\equiv \{\tau (i_1,i_2,\ldots )\}=\bigcap _{k \in {\mathbb {N}}} S_{i_1} \circ \cdots \circ S_{i_k}([0,1]^2). \end{aligned}$$

This allows us to switch between symbolic and geometric notation since

$$\begin{aligned} \tau (D^{{\mathbb {N}}})=F. \end{aligned}$$

Bedford [1] and McMullen [9] showed that

$$\begin{aligned} \mathrm{dim}_\mathrm{B}F = \frac{\log m_0}{\log m} + \frac{\log |D| - \log m_0}{\log n} \end{aligned}$$
(4.1)

where \(m_0\) is the number of p such that there is a q with \((p,q) \in D\), that is the number of columns of the array containing at least one rectangle. They also showed that

$$\begin{aligned} \mathrm{dim}_\mathrm{H}\,F = \log \left( \sum _{p=1}^m n_p^{\log _n m} \right) \Big / \log m, \end{aligned}$$
(4.2)

where \(n_p \, (1\le p\le m)\) is the number of q such that \((p,q)\in D\), that is the number of selected rectangles in the \(p\hbox {th}\) column of the array.

For each \(\ell \in D\) we let \(a_\ell \) be the number of rectangles of D in the same column as \(\ell \). Then, writing \(d = \mathrm{dim}_\mathrm{H}\,F\), (4.2) may be written as

$$\begin{aligned} m^d \ = \ \sum _{p=1}^m {n}_p^{\log _n m} = \sum _{\ell \in D} a_{\ell }^{(\log _n m -1)}, \end{aligned}$$
(4.3)

where equality of the sums follows from the definitions of \(n_p\) and \(a_\ell \).

We assume that the non-zero \(n_p\) are not all equal, otherwise \(\mathrm{dim}_\mathrm{H}\,F = \mathrm{dim}_\mathrm{B}F\); in particular this implies that \(a := \max _{\ell \in D}a_\ell \ge 2\).

We denote the \(k\hbox {th}\)-level iterated rectangles by

$$\begin{aligned} R_k (i_1,\ldots ,i_k)\ =\ S_{i_1}\circ \cdots \circ S_{i_k}([0,1]^2). \end{aligned}$$

We also write \(R_k({\mathbf {i}}) \equiv R_k(i_1,i_2, \ldots )\) for this rectangle when we wish to indicate the \(k\hbox {th}\)-level iterated rectangle containing the point \(\tau ({\mathbf {i}}) = \tau (i_1,i_2, \ldots )\).

We will associate a probability vector \(\ \left\{ b_{\ell } \right\} _{\ell \in D}\) with D and let \({\widetilde{\mu }}=\prod _{k\in {\mathbb {N}}}( \sum _{ i_k\in D}b_{ i_k}\delta _{ i_k})\) be the natural Borel product probability measure on \(D^{{\mathbb {N}}}\), where \(\delta _{ \ell }\) is the Dirac measure on D concentrated at \( \ell \). Then the measure

$$\begin{aligned} \mu ={\widetilde{\mu }}\circ \tau ^{-1} \end{aligned}$$

is a self-affine measure supported on F. Following McMullen [9] we set

$$\begin{aligned} b_\ell = a_\ell ^{(\log _n m -1)}/m^d\quad (\ell \in D), \end{aligned}$$
(4.4)

noting that \(\sum _{\ell \in D} b_\ell =1\), to get a measure \(\mu \) on F; thus the measures of the iterated rectangles are

$$\begin{aligned} \mu \big ( R_k (i_1,\ldots ,i_k)\big )\ =\ b_{i_1}\ldots b_{i_k}\ =\ m^{-kd} ( a_{i_1} \ldots a_{i_k})^{(\log _n m -1)}. \end{aligned}$$
(4.5)

Approximate squares are well-known tools in the study of self-affine carpets. Given \(k\in {\mathbb {N}}\) let \(l(k) = \lfloor k \log _n m\rfloor \) so that

$$\begin{aligned} k\log _n m \ \le \ l(k) \ \le k\log _n m \ + \ 1 \end{aligned}$$
(4.6)

For such k and \({\mathbf {i}} = (i_1,i_2, \ldots ) \in D^{{\mathbb {N}}}\) we define the approximate square containing \(\tau ({\mathbf {i}})\) as the union of \(m^{-k}\times n^{-k}\) rectangles:

$$\begin{aligned} Q_k({\mathbf {i}}) = Q_k(i_1,i_2, \ldots ) = \bigcup \Big \{&R_k(i_1',\ldots ,i_k'): p_{i_{j}'}=p_{i_{j}} \text { for } j= 1, \ldots , k \\&\text { and } q_{i_{j}'}=q_{i_{j}} \text { for } j= 1, \ldots , l(k) \ \Big \}, \end{aligned}$$

recalling that \(\ell = (p_\ell ,q_\ell )\). This approximate square has sides \(m^{-k}\times n^{-l(k)}\) where \(n^{-1}m^{-k} \le n^{-l(k)}\le m^{-k}\).

Note that, by virtue of self-affinity and since the sequence \((p_{i_1},\ldots p_{i_k})\) is the same for all level-k rectangles \(R_k (i_1,\ldots ,i_k)\) in the same approximate square, the \(a_{i_{l(k)+1}}\ldots a_{i_k}\) level-k rectangles that comprise the approximate square \(Q_k({\mathbf {i}})\) all have equal \(\mu \)-measure. Thus, writing \(L= \log _n m\),

$$\begin{aligned} \mu (Q_k({\mathbf {i}}))= & {} m^{-kd} a_{i_1}^{(L -1)}\ldots a_{i_k}^{(L -1)} \times a_{i_{l(k)+1}}\ldots a_{i_k} \end{aligned}$$
(4.7)
$$\begin{aligned}= & {} m^{-kd} a_{i_1}^{L}\ldots a_{i_{k}}^{L} \times a_{i_{1}}^{-1}\ldots a_{i_{l(k)}}^{-1}. \end{aligned}$$
(4.8)

We now obtain an upper bound for \({\overline{\dim }}_\theta F\) which implies continuity at \(\theta = 0\) and so on [0, 1]. Recall that \(a = \max _{\ell \in D}a_\ell \ge 2\).

Proposition 4.1

Let F be the Bedford–McMullen carpet as above. Then for \(0<\theta <\frac{1}{4}(\log _n m)^2\),

$$\begin{aligned} {\overline{\dim }}_\theta F \ \le \ \mathrm{dim}_\mathrm{H}\,F + \bigg (\frac{2\log (\log _m n)\log a}{\log n}\bigg ) \frac{1}{-\log \theta }. \end{aligned}$$
(4.9)

In particular, \( {\underline{\dim }}_\theta F\) and \( {\overline{\dim }}_\theta F\) are continuous at \(\theta = 0\) and so are continuous on [0, 1].

Proof

For \({\mathbf {i}} = (i_1,i_2,\ldots )\), rewriting (4.8) gives

$$\begin{aligned} \mu \big (Q_k({\mathbf {i}})\big ) = m^{-kd} \bigg (\frac{ (a_{i_1}\ldots a_{i_k})^{1/k}}{(a_{i_1}\ldots a_{i_{l(k)}})^{1/l(k)}}\bigg )^{Lk} \big (a_{i_1}\ldots a_{i_{l(k)}}\big )^{(kL/l(k))-1}. \end{aligned}$$
(4.10)

We consider the two bracketed terms on the right-hand side of (4.10) in turn. We show that the first term cannot be too small for too many consecutive k and that the second term is bounded below by 1.

For \(k>1\) with \(l(k) =\lfloor k \log _n m\rfloor \) as usual, define

$$\begin{aligned} f_k({\mathbf {i}})\ \equiv \ f_k(i_1,i_2,\ldots ) \ =\ \bigg (\frac{ (a_{i_1}\ldots a_{i_k})^{1/k}}{(a_{i_1}\ldots a_{i_{l(k)}})^{1/l(k)}}\bigg ) \end{aligned}$$
(4.11)

We claim that for all \(K\ge L/(1-L)\) and all \({\mathbf {i}} = (i_1,i_2,\ldots ) \in D^{{\mathbb {N}}}\), there exists k with \(K\le k \le K/ \theta \) such that

$$\begin{aligned} f_k({\mathbf {i}})\ \ge \ a^{\log L/\log (L/2\theta )}. \end{aligned}$$
(4.12)

Suppose this is false for some \((i_1,i_2,\ldots )\) and \(K\ge 1/L(L-1)\), so for all \(K\le k \le K/ \theta \), \( f_k({\mathbf {i}}) < \lambda := a^{\log L/\log (L/2\theta )}\), that is

$$\begin{aligned} (a_{i_1} a_{i_2}\ldots a_{i_k})^{1/k}\ <\ \lambda (a_{i_1} a_{i_2}\ldots a_{i_l})^{1/l(k)}. \end{aligned}$$
(4.13)

Define a sequence of integers \(k_{r}\, (r=0,1,2,\ldots )\) inductively by \(k_{0} = K\) and for \(r \ge 1\) taking \(k_{r}\) to be the least integer such that \( \lfloor k_{r} \log _n m \rfloor = k_{r-1}\). Then \(k_{r} \le k_{r-1}/L +1\), and a simple induction shows that

$$\begin{aligned} k_{r}\ \le \ L^{-r}\big (K + L/(1-L)\big )-L/(1-L) \ \le \ L^{-r}\big (K + L/(1-L)\big )\quad (r\ge 0). \end{aligned}$$

Fix N to be the greatest integer such that \(k_N \le K/\theta \). Then

$$\begin{aligned} L^{-(N+1)} \big (K+L\big /(1 -L)\big )\ \ge \ k_{N+1}\ >\ K/\theta \end{aligned}$$

so rearranging, provided \(K\ge L/(1-L)\),

$$\begin{aligned} L^N\ <\ \frac{\theta \big (K + L\big /(1-L)\big )}{LK}\ \le \ \frac{2\theta }{L}, \end{aligned}$$

that is

$$\begin{aligned} N > \frac{\log (2\theta /L)}{\log L}. \end{aligned}$$
(4.14)

From (4.13)

$$\begin{aligned} (a_{i_1} a_{i_2}\ldots a_{i_{k_r}})^{1/{k_r}}\ <\ \lambda (a_{i_1} a_{i_2}\ldots a_{i_{k_{r-1}}})^{1/k_{r-1}} \quad (1\le r\le N) \end{aligned}$$

so iterating

$$\begin{aligned} 1\ \le \ (a_{i_1} a_{i_2}\ldots a_{i_{k_N}})^{1/k_N}\ <\ \lambda ^{N} (a_{i_1} a_{i_2}\ldots a_{i_{K}})^{1/K}\ \le \ \lambda ^{N}a. \end{aligned}$$

Combining with (4.14) we obtain

$$\begin{aligned} \lambda \ >\ a^{-1/N} \ge a^{\log L/\log (L/2\theta )} \end{aligned}$$

which contradicts the definition of \(\lambda \). Thus the claim (4.12) is established.

For the second bracket on the right-hand side of (4.10) note that

$$\begin{aligned} 0\ \le \ (kL/l(k))-1\ =\ \frac{ k\log _n m -\lfloor k\log _n m \rfloor }{l(k)}\ \le \frac{1}{l(k)} \end{aligned}$$

so that

$$\begin{aligned} 1\ \le \ \big (a_{i_1}\ldots a_{i_{l(k)}}\big )^{(kL/l(k))-1}\ \le a. \end{aligned}$$
(4.15)

Putting together (4.10), (4.12) and (4.15), we conclude that for all \(K\ge L/(1-L)\) and all \({\mathbf {i}} \in D^{{\mathbb {N}}}\) there exists \( K\le k \le K/ \theta \) such that

$$\begin{aligned} \mu \big (Q_k({\mathbf {i}})\big ) \ \ge \ m^{-dk} a^{kL\log L/\log (L/2\theta )} \ \ge \ m^{-dk} a^{2kL\log L/\log (1/\theta )} \ =\ m^{-k(d+\varepsilon (\theta ))}, \end{aligned}$$

as \(\theta \le L^2/4\), where

$$\begin{aligned} \varepsilon (\theta )\ =\ \frac{ -(\log a)\, 2L\log L}{\log m \log (1/\theta )} \ =\ \frac{2 \log (\log _m n)\log a}{\log n} \frac{1}{-\log \theta }. \end{aligned}$$

Geometrically this means that for \(K\ge L/(1-L)\) every point \(z \in F\) belongs to at least one approximate square, \(Q_{k(z)}\) say, with \(K \le {k(z)} \le K/\theta \) and with \(\mu (Q_{k(z)}) \ge m^{-k(d+\varepsilon (\theta ))}\). Since the approximate squares form a nested hierarchy we may choose a subset \(\{Q_{k(z_n)}\}_{n=1}^N \subset \{ Q_{k(z)}: z\in F\}\) that is disjoint (except possibly at boundaries of approximate squares) and which cover F. Thus

$$\begin{aligned} 1\ =\ \mu (F)\ = \ \sum _{n=1}^N \mu (Q_{k(z_n)})\ \ge \ \sum _{n=1}^N m^{-k(z_n)(d+\varepsilon (\theta ))} \ \ge \ \sum _{n=1}^N (2^{-1/2}|Q_{k(z_n)}|)^{(d+\varepsilon (\theta ))} \end{aligned}$$

where \(| Q_k|\) denotes the diameter of the approximate square \(Q_k\), noting that \(| Q_k|\le 2^{1/2} m^{-k}\). It follows that \( {\overline{\dim }}_\theta F \le d+\varepsilon (\theta )\) as claimed. \(\square \)

The following lemma brings together some basic estimates that we will need to obtain a lower bound for the intermediate dimensions of F.

Lemma 4.2

Let \(\varepsilon >0\). There exists \(K_0\in {\mathbb {N}}\) and a set \(E\subset F\) with \(\mu (E) \ge \frac{1}{2}\) such that for all \({\mathbf {i}}\) with \(\tau ({\mathbf {i}})\in E\) and \(k\ge K_0\),

$$\begin{aligned} \mu (Q_k({\mathbf {i}}))\le m^{-k(d-\varepsilon )} \end{aligned}$$
(4.16)

and

$$\begin{aligned} \mu (R_k({\mathbf {i}}))\ge \exp (-k(H(\mu )+\varepsilon )), \end{aligned}$$
(4.17)

where \(d=\dim _\text {H} F\) and \(H(\mu ) \in (0,\log |D|)\) is the entropy of the measure \(\mu \).

Proof

McMullen [9, Lemmas 3,4(a)] shows that for \({\widetilde{\mu }}\)-almost all \({\mathbf {i}} \in D^{{\mathbb {N}}}\)

$$\begin{aligned} \lim _{k\rightarrow \infty } \mu (Q_k({\mathbf {i}}))^{1/k} \rightarrow m^{-d}. \end{aligned}$$

Thus by Egorov’s theorem we may find a set \({\widetilde{E}}_1\subset D^{{\mathbb {N}}}\) with \({\widetilde{\mu }}({\widetilde{E}}_1) \ge \frac{3}{4}\), and \(K_1\in {\mathbb {N}}\) such that (4.16) holds for all \({\mathbf {i}} \in {\widetilde{E}}_1\) and \(k\ge K_1\).

Furthermore, it is immediate from the Shannon–MacMillan–Breimann Theorem and (4.5) that for \({\widetilde{\mu }}\)-almost all \({\mathbf {i}} \in D^{{\mathbb {N}}}\),

$$\begin{aligned} \lim _{k\rightarrow \infty } \mu (R_k({\mathbf {i}})))^{1/k} \rightarrow \exp (-H(\mu )), \end{aligned}$$

and again by Egorov’s theorem there is a set \({\widetilde{E}}_2\subset D^{{\mathbb {N}}}\) with \({\widetilde{\mu }}({\widetilde{E}}_2) \ge \frac{3}{4}\), and \(K_2\in {\mathbb {N}}\) such that (4.17) holds for all \({\mathbf {i}} \in {\widetilde{E}}_2\) and \(k\ge K_2\). The conclusion of the lemma follows taking \(E =\tau ({\widetilde{E}}_1\cap {\widetilde{E}}_2)\) and \(K_0 = \max \{K_1,K_2\}\). \(\square \)

We now obtain a lower bound for \({\underline{\dim }}_{\theta }F\), showing in particular that \({\underline{\dim }}_{\theta }F> \mathrm{dim}_\mathrm{H}\,F\) for all \(\theta >0\).

Proposition 4.3

Let F be the Bedford–McMullen carpet as above. Then for \(0\le \theta \le \log _n m\),

$$\begin{aligned} {\underline{\dim }}_{\theta }F\ge \mathrm{dim}_\mathrm{H}\,F+\theta \frac{\log |D|-H(\mu )}{\log m}. \end{aligned}$$
(4.18)

Proof

Fix \(\theta \in (0,1)\), let \(E\subset F\) and \(K_0\) be given by Lemma 4.2, and let \(K\ge K_0\). We define a measure \(\nu _K\) which assigns equal mass to all level-K rectangles, and then subdivides this mass among sub-rectangles using the same weights as for the measure \(\mu \), given by (4.4). This gives a measure to which we can apply the mass distribution principle, Lemma 2.2. Thus for \(k\ge K\), writing \(b_\ell = a_\ell ^{(\log _n m -1)}/m^d\) as in (4.4),

$$\begin{aligned} \nu _K\big ( R_k (i_1,\ldots ,i_k)\big ) := |D|^{-K}b_{i_{K+1}}\ldots b_{i_k}= |D|^{-K} m^{-(k-K)d} ( a_{i_{K+1}} \ldots a_{i_k})^{L-1}.\quad \end{aligned}$$
(4.19)

We now consider an approximate square \(Q_k({\mathbf {i}})\) containing the point \({\mathbf {i}}\). This approximate square is a union of rectangles \(R_k({\mathbf {j}})\) which, as explained in our comment before (4.7), each have the same \(\mu \) measure equal to \(\mu (R_k({\mathbf {i}}))\). The same argument gives that for any \(R_k({\mathbf {j}})\subset Q_k({\mathbf {i}})\)

$$\begin{aligned} \nu _K(R_k({\mathbf {j}}))=\nu _K(R_k({\mathbf {i}}))=\mu (R_k({\mathbf {i}})) \dfrac{|D|^{-K}}{\mu (R_K({\mathbf {i}}))} \end{aligned}$$

where the final equality holds since the formula for \(\nu _{K}\) differs from that of \(\mu \) only in the mass it assigns according to the first K letters. Putting this together allows one to express the \(\nu _{K}\)-measure of an approximate square of side length \(m^{-k}\) in relation to the \(\mu \)-measure of such a square. For \(\tau ({\mathbf {i}})\in E\) and \(k\ge K\), the approximate square \(Q_k({\mathbf {i}})\) that contains the point \({\mathbf {i}}\) has \(\nu _K\)-measure

$$\begin{aligned} \nu _{K}(Q_k({\mathbf {i}}))= & {} \dfrac{|D|^{-K}}{\mu (R_K({\mathbf {i}}))}\mu (Q_k({\mathbf {i}}))\nonumber \\\le & {} \dfrac{|D|^{-K} m^{-k(d-\varepsilon )}}{\exp (-K(H(\mu )+\varepsilon ))}, \end{aligned}$$
(4.20)

using Lemma 4.2. (Alternatively (4.20) may be verified directly using (4.5), (4.8) and (4.19).) Then

$$\begin{aligned} \nu _{K}(Q_k({\mathbf {i}}))\le & {} m^{-k(d-\varepsilon ) -K \log _m \big (|D|\exp (-H(\mu )-\varepsilon )\big )}\nonumber \\\le & {} m^{-k\big (d-\varepsilon +\frac{K}{k}(\log |D|-H(\mu )-\varepsilon )/\log m\big )}. \end{aligned}$$
(4.21)

We need bounds that are valid for all \(k\in [K,K/\theta ]\) corresponding to approximate squares of sides between (approximately) \(m^{-K}\) and \(m^{-K/\theta }\). The exponent in (4.21) is maximised when \(k = K/\theta \), so that

$$\begin{aligned} \nu _{K}(Q_k({\mathbf {i}}))\ \le \ m^{-k\big (d-\varepsilon +\theta (\log |D|-H(\mu )-\varepsilon )/\log m\big )} \end{aligned}$$
(4.22)

for all \({\mathbf {i}}\) with \(\tau ({\mathbf {i}})\in E\) and integers \(k\in [K,K/\theta ]\), where \(\mu (E) \ge \frac{1}{2}\).

To use our mass distribution principle we need equation (4.22) to hold on a set of \({\mathbf {i}}\) of large \(\nu _K\) mass, whereas currently we have that it holds on a set E of large \(\mu \) mass. Let

Firstly we observe that \(Q_k(\mathbf {i})\) depends only on \((i_1,\ldots i_k)\), and we are dealing with \(k\le K/\theta \), so the question of whether \(\tau (\mathbf {i})\in E'\) is independent of \((i_{\lfloor K/\theta \rfloor +1}, i_{\lfloor K/\theta \rfloor +2},\ldots )\). Secondly, \(Q_k(\mathbf {i})\) is a union of rectangles \(R_k(\mathbf {i})\), where \(\nu _K(R_k(\mathbf {i}))\) is independent of \((i_1, \ldots ,i_K)\). Furthermore, since \(k\ge K(\log n/\log m)\), the number of rectangles \(R_k(\mathbf {i}) \in Q_k(\mathbf {i})\) is independent of \((i_1, \ldots , i_K)\). Thus the question of whether \(\tau (\mathbf {i})\in E'\) is independent of \((i_1,\ldots ,i_K)\). Thus we can write

$$\begin{aligned} E'=\bigcup _{i_{K+1}\ldots i_{\lfloor K/\theta \rfloor }\in I''} \left( \bigcup _{i_1\ldots i_K\in D^K} R_{\lfloor K/\theta \rfloor }(i_1,\ldots ,i_{\lfloor K/\theta \rfloor })\right) \end{aligned}$$

for some set \(I''\subset D^{\lfloor K/\theta \rfloor -K}\). But using (4.19) gives

$$\begin{aligned} \nu _K \left( \bigcup _{i_1\ldots i_K\in D^K} R_{\lfloor K/\theta \rfloor }(i_1,\ldots , i_{\lfloor K/\theta \rfloor })\right)= & {} \sum _{i_1\ldots i_K\in D^K}\frac{1}{|D|^K}b_{i_{K+1}}\ldots b_{i_{\lfloor K/\theta \rfloor }}\\= & {} b_{i_{K+1}}\ldots b_{i_{\lfloor K/\theta \rfloor }} \end{aligned}$$

and (4.5) gives

$$\begin{aligned} \mu \left( \bigcup _{i_1\ldots i_K\in D^K} R_{\lfloor K/\theta \rfloor }(i_1,\ldots , i_{\lfloor K/\theta \rfloor })\right)= & {} \sum _{i_1\ldots i_K\in D^K}b_{i_1}\ldots b_{i_K}b_{i_{K+1}}\ldots b_{i_{\lfloor K/\theta \rfloor }}\\= & {} \bigg (\sum _{i_1\ldots i_K\in D^K}b_{i_1}\ldots b_{i_K}\bigg )b_{i_{K+1}}\ldots b_{i_{\lfloor K/\theta \rfloor }}\\= & {} b_{i_{K+1}}\ldots b_{i_{\lfloor K/\theta \rfloor }}, \end{aligned}$$

as \(\sum _{i\in D} b_i =1\). Since these quantities are equal we conclude that

$$\begin{aligned} \nu _K(E')\ =\ \mu (E')\ \ge \ \mu (E)\ \ge \ {\textstyle \frac{1}{2}} \end{aligned}$$

as required.

Since (4.22) holds for all \({\mathbf {i}} \in E'\), a straightforward variant on our mass distribution principle, where we use approximate squares instead of balls, gives

$$\begin{aligned} {\underline{\dim }}_{\theta }F\ \ge \ \mathrm{dim}_\mathrm{H}\,F-\varepsilon +\theta \frac{\log |D|-H(\mu )-\varepsilon }{\log m}. \end{aligned}$$

Since \(\varepsilon \) can be chosen arbitrarily small, (4.18) follows. \(\square \)

Note that, in (4.18),

$$\begin{aligned} H(\mu )\ =\ -m^{-d}\sum _{\ell \in D }(a_\ell ^{L-1}\big ((L-1)\log a_\ell - d\log m\big )\ \le \ \log | D| \end{aligned}$$

with equality if and only if \(\mu \) gives equal mass to all cylinders of the same length, which happens if and only if each column in our construction contains the same number of rectangles. This happens exactly when the box and Hausdorff dimension coincide. Thus our lower bounds give that \({\underline{\dim }}_{\theta }F> \mathrm{dim}_\mathrm{H}\,F\) whenever \(\theta >0\), provided that the Hausdorff and box dimensions of F are different.

Since the measures that we have constructed to give lower bounds on \({\underline{\dim }}_{\theta }F\) are rather crude, it is unlikely that our lower bound for \({\underline{\dim }}_{\theta }F\) converges to \({\dim }_\text {B} F\) as \(\theta \rightarrow 1\). However, a lower bound which does approach \({\dim }_\text {B} F\) as \(\theta \rightarrow 1\) is given by Proposition 2.4, noting that \(\dim _\text {A} F > {\underline{\dim }}_\text {B} F = {\dim }_\text {B} F\) provided \({\dim }_\text {B} F > {\dim }_\text {H} F\), see [4, 7].

Many questions on the intermediate dimensions of Bedford–McMullen carpets remain, most notably finding the exact forms of \({\underline{\dim }}_{\theta }F\) and \({\overline{\dim }}_{\theta }F\). In that direction we would at least conjecture that these intermediate dimensions are equal and strictly monotonic. One might hope to get better estimates using alternative definitions of \(\mu \) in Proposition 4.1 and \(\nu _K\) in Proposition 4.3, but McMullen’s measure and our modifications seemed to work best when optimising mass distribution type estimates across F and over the required range of scales.