1 Introduction

Model theory investigates the relationship between formal languages and semantics. From this perspective, among the most important results are the so called preservationFootnote 1 or characterisation theorems, linking the syntactic shape of formulas to some semantic property. Typically, these results characterise a certain language as the fragment of another, richer language consisting of those formulas that satisfy the given model-theoretic property. In the case of classical first-order logic, notable examples are the Łoś–Tarski theorem, stating that a first-order formula is equivalent to a universal one if and only if the class of its models is closed under taking submodels, and Lyndon’s theorem, stating that a first-order formula is equivalent to one for which each occurrence of a relation symbol R is positive if and only if it is monotone with respect to the interpretation of R (see e.g. [17]).

The aim of this paper is to show that similar results also hold for the predicate logic \(\texttt {M} \texttt {E} ^{\infty }\) that allows only monadic predicate symbols and no function symbols, but that goes beyond standard first-order logic with equality in that it features the generalised quantifier ‘there are infinitely many’.

Generalised quantifiers were introduced by Mostowski in [24], and in a more general sense by Lindström in [21], the main motivation being the observation that standard first-order quantifiers ‘there are some’ and ‘for all’ are not sufficient for expressing some basic mathematical concepts. Since then, they have attracted a lot of interest, insomuch that their study constitutes nowadays a well-established field of logic with important ramifications in disciplines such as linguistics and computer science.Footnote 2

Despite the fact that the absence of polyadic predicates clearly restricts its expressive power, monadic first-order logic (with identity) displays nice properties, both from a computational and a model-theoretic point of view. Indeed, the satisfiability problem becomes decidable [4, 22], and, in addition to an immediate application of Łoś–Tarski and Lyndon’s theorems, one can also obtain a Lindström like characterisation result [26]. Moreover, adding the possibility of quantifying over predicates does not increase the expressiveness of the language [2], meaning that when restricted to monadic predicates, monadic second-order logic collapses into first-order logic.

Concerning monadic first-order logic extended with an infinity quantifier, Mostowski [24] already proved a decidability result, whereas from work of Väänänen [27] we know that its expressive power coincides with that of weak monadic second-order logic restricted to monadic predicates, that is monadic first-order logic extended with a second-order quantifier ranging over finite sets.Footnote 3

1.1 Characterisation results and proof outline

A characterisation result involves some fragment \(\texttt {L} _{\mathfrak {P}}\) of a given yardstick logic \(\texttt {L} \), related to a certain semantic property \(\mathfrak {P}\). It is usually formulated as

$$\begin{aligned} \varphi \in \texttt {L} \text { has the property } \mathfrak {P} \text { iff } \varphi \text { is equivalent to some } \varphi ' \in \texttt {L} _{\mathfrak {P}}. \end{aligned}$$
(1)

In this work, our main yardstick logic will be \(\texttt {M} \texttt {E} ^{\infty }\). Table 1 summarises the semantic properties (\(\mathfrak {P}\)) we are going to consider, the corresponding expressively complete fragment (\(\texttt {L} _{\mathfrak {P}}\)) and the actual characterisation theorem.

Table 1 A summary of our characterisation theorems

The proof of each characterisation theorem is composed of two parts. The first, simpler one concerns the claim that each sentence in the fragment satisfies the concerned property. It is usually proved by a straightforward induction on the structure of the formula. The other direction is the expressive completeness statement, stating that within the considered logic, the fragment is expressively complete for the property. Its verification generally requires more effort. In this paper, we will actually verify a stronger expressive completeness statement. Namely, for each semantic property \(\mathfrak {P}\) and corresponding fragment \(\texttt {L} _{\mathfrak {P}}\) from Table 1, we are going to provide an effective translation operation \((\cdot )^\mathsf{p}: \texttt {M} \texttt {E} ^{\infty }\rightarrow \texttt {L} _{\mathfrak {P}}\) such that

$$\begin{aligned} \text {if }\varphi \in \texttt {M} \texttt {E} ^{\infty }\text { has the property } \mathfrak {P} \text { then } \varphi \text { is equivalent to } \varphi ^\mathsf{p}. \end{aligned}$$
(2)

The proof of each instance of (2) will follow a uniform pattern, analogous to the one employed in the aim of obtaining similar results in the context of the modal \(\mu \)-calculus [11, 15, 19]. The crux of the adopted proof method is the following. Extending known results on monadic first-order logic and using an appropriate version of Ehrenfeucht–Fraïssé games, for each sentence \(\varphi \) in \( \texttt {M} \texttt {E} ^{\infty }\) it is possible to compute a logically equivalent sentence in basic normal norm. Such normal forms will take the shape of a disjunction \(\bigvee \nabla _{\texttt {M} \texttt {E} ^{\infty }}\), where each disjunct \(\nabla _{\texttt {M} \texttt {E} ^{\infty }}\) characterises a class of models of \(\varphi \) satisfying the same set of \({\texttt {M} \texttt {E} ^{\infty }}\)-sentences of equal quantifier rank as \(\varphi \). Based on this, it will therefore be enough to define an effective translation \((\cdot )^\mathsf{p}\) for sentences in normal form, point-wise in each disjunct \(\nabla _{\texttt {M} \texttt {E} ^{\infty }}\), and then verify that it indeed satisfies (2).

As a corollary of the employed proof method, we obtain effective normal forms for sentences satisfying the considered property.

In addition to \(\texttt {M} \texttt {E} ^{\infty }\), in this paper we also consider monadic first-order logic with and without equality, denoted by \(\texttt {M} \texttt {E} \) and \(\texttt {M} \), respectively. Table 2 shows a summary of the expressive completeness and normal form results presented in this paper.

Table 2 An overview of our expressive completeness and normal form results

Since the satisfiability problem for \(\texttt {M} \texttt {E} ^{\infty }\) is decidable and the translation \((\cdot )^\mathsf{p}\) is effectively computable, we obtain, as an immediate corollary of (2), that for each property \(\mathfrak {P}\) listed in Table 1

$$\begin{aligned} \text {the problem whether a }\texttt {M} \texttt {E} ^{\infty }-\text {sentence satisfies property} \mathfrak {P} \text {or not is decidable.} \end{aligned}$$
(3)

We consider these decidability results as a byproduct of our characterisation results, and we do not explore, for instance, computational complexity questions. Addressing these would involve a study of the complexity of the procedure that brings a formula \(\varphi \) into normal form and then translates it into a formula \((\varphi )^\mathsf{p}\) of the required shape. There are easier ways to prove the mentioned decidability resultsFootnote 4 and these may be useful as well to obtain complexity results.

1.2 Application of obtained results: the companion paper

Our original motivation to study characterisation results for these logics stems from our interest in so-called parity automata: these are finite-state systems that play a crucial role in obtaining decidability and expressiveness results in fixpoint logics and monadic second-order logics over trees and labelled transition systems (see e.g. [30]). Parity automata are specified by a finite set of states A, a distinguished, initial state \(a \in A\), a function \(\Omega \) assigning to each states a priority (a natural number), and a transition map \(\varDelta \). In various interesting cases, the co-domain of this transition map is given by a monadic logic in which the set of (monadic) predicates coincides with A. Hence, each monadic logic \(\texttt {L} \) induces its own class of automata \(\texttt {Aut} (\texttt {L} )\).

A landmark result in this area is Janin and Walukiewicz’s theorem stating that the bisimulation-invariant fragment of monadic second-order logic coincides with the modal \(\mu \)-calculus [19], and the proof of this result is an interesting mix of the theory of parity automata and the model theory of monadic predicate logic. First, normal forms results and characterisation theorems are used to verify that (on tree models) \(\texttt {Aut} (\texttt {Pos} _{}(\texttt {M} \texttt {E} ))\) is the class of automata characterising the expressive power of monadic second-order logic [32], whereas \(\texttt {Aut} (\texttt {Pos} _{}(\texttt {M} ))\) corresponds to the modal \(\mu \)-calculus [18], where \(\texttt {Pos} _{}(\texttt {L} )\) denote the positive fragment of the monadic logic \(\texttt {L} \). Then, Janin and Walukiewicz’ expressiveness theorem is a consequence of these automata characterisations and the fact that positive monadic first-order logic without equality provides the quotient-invariant fragment of positive monadic first-order logic with equality (see Theorem 7).

In our companion paper [10], among other things we provide a Janin–Walukiewicz type characterisation result for weak monadic second-order logic. Analogous to the case of full monadic second-order logic discussed previously, our proof crucially employs normal form results and characterisation theorems for \(\texttt {M} \texttt {E} ^{\infty }\), as listed in the Tables 1 and 2.

1.3 Other versions

Results in this paper first appeared in the first author’s PhD thesis ( [8, Chapter 5]); this journal version largely expands material first published as part of the conference papers [9, 14]. In particular, the whole of Sect. 6 below contains new results.

2 Basics

In this section we provide the basic definitions of the monadic predicate logics that we study in this paper.

Throughout this paper we fix a finite set A of objects that we shall refer to as (monadic) predicate symbols or names. We shall also assume an infinite set \(\mathsf {iVar}\) of individual variables.

Definition 1

Given a finite set A we define a (monadic) model to be a pair \(\mathbb {D}= (D,V )\) consisting of a set D, which we call the domain of \(\mathbb {D}\), and an interpretation or valuation \(V : A \rightarrow \wp (D)\). The class of all models will be denoted by \(\mathfrak {M}\).

Remark 1

Note that we make the somewhat unusual choice of allowing the domain of a monadic model to be empty. In view of the applications of our results to automata theory (see Sect. 1) this choice is very natural, even if it means that some of our proofs here become more laborious in requiring an extra check. Observe that there is exactly one monadic model based on the empty domain; we shall denote this model as \({\mathbb {D}_\varnothing }{:=}(\varnothing , \varnothing )\).

Definition 2

Observe that a valuation \(V: A \rightarrow \wp (D)\) can equivalently be presented via its associated colouring \(V^{\flat }:D \rightarrow \wp (A)\) given by

$$\begin{aligned} V^{\flat }(d) {:=}\{a \in A \mid d \in V(a)\}. \end{aligned}$$

We will use these perspectives interchangeably, calling the set \(V^{\flat }(d) \subseteq A\) the colour or type of d. In case \(D = \varnothing \), \(V^{\flat }\) is simply the empty map.

In this paper we study three languages of monadic predicate logic: the languages \(\texttt {M} \texttt {E} \) and \(\texttt {M} \) of monadic first-order logic with and without equality, respectively, and the extension \(\texttt {M} \texttt {E} ^{\infty }\) of \(\texttt {M} \texttt {E} \) with the generalised quantifiers \(\exists ^\infty \) and \(\forall ^\infty \). Probably the most concise definition of the full language of monadic predicate logic would be given by the following grammar:

$$\begin{aligned} \varphi \mathrel {::=}a(x) \mid x \approx y \mid \lnot \varphi \mid (\varphi \vee \varphi ) \mid \exists x.\varphi \mid \exists ^\infty x.\varphi , \end{aligned}$$

where \(a \in A\) and x and y belong to the set \(\mathsf {iVar}\) of individual variables. In this set-up we would need to introduce the quantifiers \(\forall \) and \(\forall ^\infty \) as abbreviations of \(\lnot \exists \lnot \) and \(\lnot \exists ^\infty \lnot \), respectively. However, for our purposes it will be more convenient to work with a variant of this language where all formulas are in negation normal form; that is, we only permit the occurrence of the negation symbol \(\lnot \) in front of an atomic formula. In addition, for technical reasons we will add \(\bot \) and \(\top \) as constants, and we will write \(\lnot (x \approx y)\) as \(x \not \approx y\). Thus we arrive at the following definition of our syntax.

Definition 3

The set \(\texttt {M} \texttt {E} ^{\infty }(A)\) of monadic formulas is given by the following grammar:

$$\begin{aligned} \varphi&\mathrel {::=}\top \mid \bot \mid a(x) \mid \lnot a(x) \mid x \approx y \mid x \not \approx y \mid (\varphi \vee \varphi ) \mid (\varphi \wedge \varphi ) \\&\quad \quad \times \mid \exists x.\varphi \mid \forall x.\varphi \mid \exists ^\infty x.\varphi \mid \forall ^\infty x.\varphi \end{aligned}$$

where \(a \in A\) and \(x,y\in \mathsf {iVar}\). The language \(\texttt {M} \texttt {E} (A)\) of monadic first-order logic with equality is defined as the fragment of \(\texttt {M} \texttt {E} ^{\infty }(A)\) where occurrences of the generalised quantifiers \(\exists ^\infty \) and \(\forall ^\infty \) are not allowed:

$$\begin{aligned} \varphi \mathrel {::=}\top \mid \bot \mid a(x) \mid \lnot a(x) \mid x \approx y \mid x \not \approx y \mid (\varphi \vee \varphi ) \mid (\varphi \wedge \varphi ) \mid \exists x.\varphi \mid \forall x.\varphi \end{aligned}$$

Finally, the language \(\texttt {M} (A)\) of monadic first-order logic is the equality-free fragment of \(\texttt {M} \texttt {E} (A)\); that is, atomic formulas of the form \(x \approx y\) and \(x \not \approx y\) are not permitted either:

$$\begin{aligned} \varphi \mathrel {::=}\top \mid \bot \mid a(x) \mid \lnot a(x) \mid (\varphi \vee \varphi ) \mid (\varphi \wedge \varphi ) \mid \exists x.\varphi \mid \forall x.\varphi \end{aligned}$$

In all three languages we use the standard definition of free and bound variables, and we call a formula a sentence if it has no free variables. In the sequel we will often use the symbol \(\texttt {L} \) to denote either of the languages \(\texttt {M} \), \(\texttt {M} \texttt {E} \) or \(\texttt {M} \texttt {E} ^{\infty }\).

For each of the languages \(\texttt {L} \in \{ \texttt {M} , \texttt {M} \texttt {E} , \texttt {M} \texttt {E} ^{\infty }\}\), we define the positive fragment \(\texttt {L} ^{+}\) of \(\texttt {L} \) as the language obtained by almost the same grammar as for \(\texttt {L} \), but with the difference that we do not allow negative formulas of the form \(\lnot a(x)\).

To define the semantics of these languages we need to make a case distinction. For non-empty models we use the standard truth definition, which applies to arbitrary formulas since we can introduce the notion of an assignment, mapping individual variables to elements of the domain. In the case of the empty model, however, it is not possible to define assignments, so here we restrict the truth definition to sentences.

Definition 4

The meaning of sentences in the languages \(\texttt {M} , \texttt {M} \texttt {E} \) and \(\texttt {M} \texttt {E} ^{\infty }\) is given in the form of a truth relation \(\models \). To define this truth relation on a model \(\mathbb {D}= (D ,V)\), we distinguish cases.

Case \(D=\varnothing \)::

We define the truth relation \(\models \) on the empty model \({\mathbb {D}_\varnothing }\) for all formulas that are Boolean combinations of sentences of the form \(Qx. \varphi \), where \(Q \in \{ \exists , \exists ^\infty , \forall , \forall ^\infty \}\) is a quantifier. The definition is by induction on the complexity of such sentences; the “atomic” clauses, where the sentence is of the form \(Qx. \varphi \), are as follows:

$$\begin{aligned} \begin{array}{lll} {\mathbb {D}_\varnothing }\not \models Qx. \varphi &{} \text {if}\quad Q \in \{ \exists , \exists ^\infty \},&{} \\ {\mathbb {D}_\varnothing }\models Qx. \varphi &{} \text {if}\quad Q \in \{ \forall , \forall ^\infty \}.&{} \end{array} \end{aligned}$$

The clauses for the Boolean connectives are standard.

Case \(D \ne \varnothing \)::

In the case of a non-empty model \(\mathbb {D}\), we extend the truth relation to arbitrary formulas in a standard way, involving assignments of individual variables to elements of the domain. That is, given a model \(\mathbb {D}= (D,V)\), an assignment \(g :\mathsf {iVar}\rightarrow D\) and a formula \(\varphi \in \texttt {M} \texttt {E} ^{\infty }(A)\) we define the truth relation \(\models \) by a straightforward induction on the complexity of \(\varphi \). Below we explicitly provide the clauses of the quantifiers:

$$\begin{aligned}\begin{array}{llll} &{}\mathbb {D},g \models \exists x.\varphi &{} \text {iff}\quad \mathbb {D},g [x\mapsto d] \models \varphi \text { for some }d\in D,\\ &{}\mathbb {D},g \models \forall x.\varphi &{} \text {iff}\quad \mathbb {D},g [x\mapsto d] \models \varphi \text { for all }d\in D,\\ &{}\mathbb {D},g \models \exists ^\infty x.\varphi &{} \text {iff}\quad \mathbb {D},g [x\mapsto d] \models \varphi \text { for infinitely many }d\in D,\\ &{}\mathbb {D},g \models \forall ^\infty x.\varphi &{} \text {iff}\quad \mathbb {D},g [x\mapsto d] \models \varphi \text { for all but at most finitely many }d\in D. \end{array} \end{aligned}$$

The clauses for the atomic formulas and for the Boolean connectives are standard.

In what follows, when discussing the truth of \(\varphi \) on the empty model, we always implicitly assume that \(\varphi \) is a sentence.

As mentioned in the introduction, general quantifiers such as \(\exists ^\infty \) and \(\forall ^\infty \) were introduced by Mostowski [24], who proved the decidability for the language obtained by extending \(\texttt {M} \) with such quantifiers. The decidability of the full language \(\texttt {M} \texttt {E} ^{\infty }\) was then proved by Slomson in [25].Footnote 5 The case for \(\texttt {M} \) and \(\texttt {M} \texttt {E} \) goes back already to [4, 22].

Fact 1

For each logic \(\texttt {L} \in \{ \texttt {M} , \texttt {M} \texttt {E} , \texttt {M} \texttt {E} ^{\infty }\}\), the problem of whether a given \(\texttt {L} \)-sentence \(\varphi \) is satisfiable, is decidable.

In the remainder of the section we fix some further definitions and notations, starting with some useful syntactic abbreviations.

Definition 5

Given a list \(\overline{{\mathbf {y}}} = y_1\ldots y_n\) of individual variables, we use the formula

$$\begin{aligned} \text {diff}(\overline{{\mathbf {y}}}) {:=}\bigwedge _{1\le m < m^{\prime } \le n} (y_m \not \approx y_{m^{\prime }}) \end{aligned}$$

to state that the elements \(\overline{{\mathbf {y}}}\) are all distinct. An A-type is a formula of the form

$$\begin{aligned} \tau _{S}(x) {:=}\bigwedge _{a\in S}a(x) \wedge \bigwedge _{a\in A\setminus S}\lnot a(x), \end{aligned}$$

where \(S \subseteq A\). Here and elsewhere we use the convention that \(\bigwedge \varnothing \!=\! \top \) (and \(\bigvee \varnothing \!=\! \bot \)). The positive A-type \(\tau _{S}^+(x)\) only bears positive information, and is defined as

$$\begin{aligned} \tau _{S}^+(x) {:=}\bigwedge _{a\in S}a(x). \end{aligned}$$

Given a monadic model \(\mathbb {D}= (D,V)\) and a subset S of A, we define

$$\begin{aligned} |S|_\mathbb {D}{:=}|\{d\in D \mid \mathbb {D}\models \tau _S(d) \}| \end{aligned}$$

as the number of elements of \(\mathbb {D}\) that realise the type \(\tau _{S}\).

We often blur the distinction between the formula \(\tau _{S}(x)\) and the subset \(S \subseteq A\), calling S an A-type as well. Note that we have \(\mathbb {D}\models \tau _S(d)\) iff \(V^{\flat }(d) = S\), so that we may refer to \(V^{\flat }(d)\) as the type of \(d \in D\) indeed.

Definition 6

The quantifier rank \(\texttt {qr} (\varphi )\) of a formula \(\varphi \in \texttt {M} \texttt {E} ^{\infty }\) (hence also for \(\texttt {M} \) and \(\texttt {M} \texttt {E} \)) is defined as follows:

$$\begin{aligned} \begin{array}{llll} \texttt {qr} (\varphi ) &{}{:=}&{} 0 &{} \text {if }\varphi \text {is atomic},\\ \texttt {qr} (\lnot \psi ) &{} {:=}&{} \texttt {qr} (\psi ) \\ \texttt {qr} (\psi _{1}\mathrel {\heartsuit }\psi _{2}) &{} {:=}&{} \max \{\texttt {qr} (\psi _1),\texttt {qr} (\psi _2)\} &{} \text {where } \heartsuit \in \{ \wedge ,\vee \}\\ \texttt {qr} (Qx.\psi ) &{} {:=}&{} 1+\texttt {qr} (\psi ), &{} \text {where } Q \in \{\exists ,\forall ,\exists ^\infty ,\forall ^\infty \} \end{array} \end{aligned}$$

Given a monadic logic \(\texttt {L} \) we write \(\mathbb {D}\equiv _k^{\texttt {L} } \mathbb {D}'\) to indicate that the models \(\mathbb {D}\) and \(\mathbb {D}'\) satisfy exactly the same sentences \(\varphi \in \texttt {L} \) with \(\texttt {qr} (\varphi ) \le k\). We write \(\mathbb {D}\equiv ^{\texttt {L} } \mathbb {D}'\) if \(\mathbb {D}\equiv _k^{\texttt {L} } \mathbb {D}'\) for all k. When clear from context, we may omit explicit reference to \(\texttt {L} \).

Definition 7

A partial isomorphism between two models (DV) and \((D',V ')\) is a partial function \(f: D \rightharpoonup D'\) which is injective and satisfies that \(d \in V (a) \Leftrightarrow f(d) \in V '(a)\) for all \(a\in A\) and \( d\in \mathsf {Dom}(f)\). Given two sequences \(\overline{{\mathbf {d}}} \in D^k\) and \(\overline{{\mathbf {d'}}} \in {D'}^k\) we use \(f:\overline{{\mathbf {d}}} \mapsto \overline{{\mathbf {d'}}}\) to denote the partial function \(f:D\rightharpoonup D'\) defined as \(f(d_i) {:=}d'_i\). We will take care to avoid cases where there exist \(d_i,d_j\) such that \(d_i = d_j\) but \(d'_i \ne d'_j\).

Finally, for future reference we briefly discuss the notion of Boolean duals. We first give a concrete definition of a dualisation operator on the set of monadic formulas.

Definition 8

The (Boolean) dual \(\varphi ^{\delta } \in {\texttt {M} \texttt {E} ^{\infty }}(A)\) of \(\varphi \in {\texttt {M} \texttt {E} ^{\infty }}(A)\) is the formula given by:

$$\begin{aligned} (a(x))^{\delta }&{:=}a(x)&(\lnot a(x))^{\delta }&{:=}\lnot a(x)\\ (\top )^{\delta }&{:=}\bot&(\bot )^{\delta }&{:=}\top \\ (x \approx y)^{\delta }&{:=}x \not \approx y&(x \not \approx y)^{\delta }&{:=}x \approx y \\ (\varphi \wedge \psi )^{\delta }&{:=}\varphi ^{\delta } \vee \psi ^{\delta }&(\varphi \vee \psi )^{\delta }&{:=}\varphi ^{\delta } \wedge \psi ^{\delta }\\ (\exists x.\psi )^{\delta }&{:=}\forall x.\psi ^{\delta }&(\forall x.\psi )^{\delta }&{:=}\exists x.\psi ^{\delta }\\ (\exists ^{\infty } x.\psi )^{\delta }&{:=}\forall ^{\infty } x.\psi ^{\delta }&(\forall ^{\infty } x.\psi )^{\delta }&{:=}\exists ^{\infty } x.\psi ^{\delta } \end{aligned}$$

Remark 2

Where \(\texttt {L} \in \{\texttt {M} ,\texttt {M} \texttt {E} ,\texttt {M} \texttt {E} ^{\infty }\}\), observe that if \(\varphi \in \texttt {L} (A)\) then \(\varphi ^{\delta } \in \texttt {L} (A)\). Moreover, the operator preserves positivity of the predicates, that is, if \(\varphi \in \texttt {L} ^+(A)\) then \(\varphi ^{\delta } \in \texttt {L} ^+(A)\).

The following proposition states that the formulas \(\varphi \) and \(\varphi ^{\delta }\) are Boolean duals. We omit its proof, which is a routine check.

Proposition 1

Let \(\varphi \in \texttt {M} \texttt {E} ^{\infty }(A)\) be a monadic formula. Then \(\varphi \) and \(\varphi ^{\delta }\) are indeed Boolean duals, in the sense that for every monadic model (DV) we have that

$$\begin{aligned} (D,V) \models \varphi \text { iff } (D,V^{c}) \not \models \varphi ^{\delta }, \end{aligned}$$

where \(V^{c}: A \rightarrow \wp (D)\) is the valuation given by \(V^{c}(a) {:=}D \setminus V(a)\).

3 Normal forms

In this section we provide, for each of the logics \(\texttt {M} \), \(\texttt {M} \texttt {E} \) and \(\texttt {M} \texttt {E} ^{\infty }\), normal forms that will be pivotal for characterising the different fragments of these logics in later sections. Our approach will be game-theoretic, based on Ehrenfeucht–Fraïssé style model comparison games. These games were introduced by Ehrenfeucht [13] to study Fraïssé’s analyis of first-order logic using so-called back-and-forth systems. Over the years, similar games have been introduced for various other logics, including extensions of first-order logic with generalised quantifiers [20]. As an important application of Ehrenfeucht–Fraïssé games one may use the notion of a winning strategy to obtain certain normal forms for formulas in the formalism under scrutiny. In the case of monadic first-order logic, one may extract relatively simple normal forms; this observation goes back to (at least) the work of Walukiewicz [32]. Our contribution here is that we use the method to obtain normal forms for the logic \(\texttt {M} \texttt {E} ^{\infty }\).

Convention

Here and in the sequel it will often be convenient to blur the distinction between lists and sets. For instance, identifying the list \(\overline{{\mathbf {T}}} = T_{1}\ldots T_{n}\) with the set \(\{ T_{1}, \ldots , T_{n} \}\), we may write statements like \(S \in \overline{{\mathbf {T}}}\) or \(\Pi \subseteq \overline{{\mathbf {T}}}\). Moreover, given a finite set \(\varPhi = \{\varphi _1, \ldots , \varphi _n\}\), we write \(\varphi _1 \wedge \cdots \wedge \varphi _n\) as \(\bigwedge \varPhi \), and \(\varphi _1 \vee \cdots \vee \varphi _n\) as \(\bigvee \varPhi \). If \(\varPhi \) is empty, we set as usual \(\bigwedge \varPhi = \top \) and \(\bigvee \varPhi = \bot \). Finally, notice that we write \(\bigvee _{1\le m < m^{\prime } \le n} (y_m \approx y_{m^{\prime }}) \vee \psi \) as \(\text {diff}(\overline{{\mathbf {y}}}) \rightarrow \psi \).

3.1 Normal form for \(\texttt {M} \)

We start by introducing a normal form for monadic first-order logic without equality.

Definition 9

Given sets of types \(\Sigma , \Pi \subseteq \wp (A)\), we define the following formulas:

$$\begin{aligned} \begin{array}{lll} \nabla _{\texttt {M} }(\Sigma ,\Pi ) &{}{:=}&{} \bigwedge _{S\in \Sigma } \exists x. \tau _S(x) \wedge \forall x. \bigvee _{S\in \Pi } \tau _S(x)\\ \nabla _{\texttt {M} }(\Sigma ) &{}{:=}&{} \nabla _{\texttt {M} }(\Sigma ,\Sigma ) \end{array} \end{aligned}$$

A sentence of \(\texttt {M} (A)\) is in basic form if it is a disjunction of formulas of the form \(\nabla _{\texttt {M} }(\Sigma )\).

Observe that \(\nabla _{\texttt {M} }(\Sigma ,\Pi ) \equiv \bot \) in case \(\Sigma \not \subseteq \Pi \) and that \(\nabla _{\texttt {M} }(\Sigma ,\Pi ) = \nabla _{\texttt {M} }(\Sigma ) = \forall x. \bot \) if \(\Sigma =\Pi = \varnothing \). The meaning of the formula \(\nabla _{\texttt {M} }(\Sigma )\) is that \(\Sigma \) is a complete description of the collection of types that are realised in a monadic model. The formula \(\nabla _{\texttt {M} }(\varnothing )\) distinghuishes the empty model from the non-empty ones.

Every \(\texttt {M} \)-formula is effectively equivalent to a formula in basic form.

Fact 2

There is an effective procedure that transforms an arbitrary \(\texttt {M} \)-sentence \(\varphi \) into an equivalent formula \(\varphi ^{*}\) in basic form.

This observation is easy to prove using Ehrenfeucht–Fraïssé games (proof sketches can be found in [16, Lemma 16.23] and [31, Proposition 4.14]), and the decidability of the satisfiability problem for \(\texttt {M} \) (Fact 1). We omit a full proof because it is very similar to the following more complex cases.

3.2 Normal form for \(\texttt {M} \texttt {E} \)

Due to the additional expressive power provided by the (in-)equalities, the basic normal forms of \(\texttt {M} \texttt {E} \) take a more involved shape than those of \(\texttt {M} \).

Definition 10

We say that a formula \(\varphi \in \texttt {M} \texttt {E} (A)\) is in basic form if \(\varphi = \bigvee \nabla _{\texttt {M} \texttt {E} }(\overline{{\mathbf {T}}},\Pi )\) where each disjunct is of the form

$$\begin{aligned} \nabla _{\texttt {M} \texttt {E} }(\overline{{\mathbf {T}}},\Pi ) = \exists \overline{{\mathbf {x}}}.\big (\text {diff}(\overline{{\mathbf {x}}}) \wedge \bigwedge _i \tau _{T_i}(x_i) \wedge \forall z.(\text {diff}(\overline{{\mathbf {x}}},z) \rightarrow \bigvee _{S\in \Pi } \tau _S(z))\big ) \end{aligned}$$

with \(\overline{{\mathbf {T}}} \in \wp (A)^k\) for some k and \(\Pi \subseteq \overline{{\mathbf {T}}}\).

We prove that every sentence of monadic first-order logic with equality is equivalent to a formula in basic form. Although this result seems to be folklore, we provide a detailed proof because some of its ingredients will be used later, when we give a normal form for \(\texttt {M} \texttt {E} ^{\infty }\). We start by defining the following relation between monadic models.

Definition 11

For every \(k \in \mathbb {N}\) we define the relation \(\sim ^=_k\) on the class \(\mathfrak {M}\) of monadic models by putting

$$\begin{aligned} \mathbb {D}\sim ^=_k \mathbb {D}' \Longleftrightarrow \forall S\subseteq A \ \big ( |S|_\mathbb {D}= |S|_{\mathbb {D}'} < k \text { or } |S|_\mathbb {D},|S|_{\mathbb {D}'} \ge k \big ), \end{aligned}$$

where \(\mathbb {D}\) and \(\mathbb {D}'\) are arbitrary monadic models.

Intuitively, two models are related by \(\sim ^=_k\) when their type information coincides ‘modulo k’. Later on we prove that this is the same as saying that they cannot be distinguished by a sentence of \(\texttt {M} \texttt {E} \) with quantifier rank at most k. As a special case, observe that any two monadic models are related by \(\sim ^{=}_{0}\).

For the moment, we record the following properties of these relations.

Proposition 2

The following hold:

  1. 1.

    The relation \(\sim ^=_k\) is an equivalence relation of finite index.

  2. 2.

    Every \(E \in \mathfrak {M}/{\sim ^=_k}\) is characterised by a sentence \(\varphi ^=_E \in \texttt {M} \texttt {E} (A)\) with \(\texttt {qr} (\varphi ^=_E) = k\).

Proof

We only prove the second statement, and first we consider the case where \(k=0\). The equivalence relation \(\sim ^{=}_{0}\) has the class \(\mathfrak {M}\) of all monadic models as its unique equivalence class, so here we may define \(\varphi ^{=}_{\mathfrak {M}} {:=}\top \).

From now on we assume that \(k>0\). Take some equivalence class \(E \in \mathfrak {M}/{\sim ^=_k}\), and some representative \(\mathbb {D}\in E\). Let \(S_1,\ldots ,S_n \subseteq A\) be the types such that \(|S_i|_\mathbb {D}= l_i<k\) and let \(S'_1,\ldots ,S'_m \subseteq A\) be those satisfying \(|S'_i|_\mathbb {D}\ge k\). Note that the union of all the \(S_i\) and \(S'_i\) yields all the possible A-types, and that if a type \(S_{j}\) is not realised at all, we take \(l_j = 0\). Now define

$$\begin{aligned} \varphi ^=_E \quad {:=}\quad&\bigwedge _{i\le n} \Big (\exists x_1,\ldots ,x_{l_i}. \text {diff}(x_1,\ldots ,x_{l_i}) \ \wedge \ \bigwedge _{j\le l_i} \tau _{S_i}(x_j)\\&\qquad \qquad \qquad \wedge \forall z. \text {diff}(x_1,\ldots ,x_{l_i},z) \rightarrow \lnot \tau _{S_i}(z)\Big )\ \\&\wedge \bigwedge _{i\le m} \big (\exists x_1,\ldots ,x_k.\text {diff}(x_1,\ldots ,x_k) \wedge \bigwedge _{j\le k} \tau _{S'_i}(x_j) \big ), \end{aligned}$$

where we understand that any conjunct of the form \(\exists x_1, \ldots ,x_{l}.\psi \) with \(l = 0\) is simply omitted (or, to the same effect, defined as \(\top \)). It is easy to see that \(\texttt {qr} (\varphi ^=_E) = k\) and that \(\mathbb {D}' \models \varphi ^=_E\) iff \(\mathbb {D}' \in E\). Intuitively, \(\varphi ^=_E\) gives a specification of E “type by type”; in particular observe that \(\varphi ^=_{\mathbb {D}_\varnothing }\equiv \forall x. \bot \). \(\square \)

Next we recall a (standard) notion of Ehrenfeucht–Fraïssé game for \(\texttt {M} \texttt {E} \) which will be used to establish the connection between \({\sim ^=_k}\) and \(\equiv _k^{\texttt {M} \texttt {E} }\).

Definition 12

Let \(\mathbb {D}_0 = (D_0,V_0)\) and \(\mathbb {D}_1 = (D_1,V_1)\) be monadic models. We define the game \(\text {EF}^=_k(\mathbb {D}_0,\mathbb {D}_1)\) between \(\forall \) and \(\exists \). If \(\mathbb {D}_i\) is one of the models we use \(\mathbb {D}_{-i}\) to denote the other model. A position in this game is a pair of sequences \(\overline{{\mathbf {s_0}}} \in D_0^n\) and \(\overline{{\mathbf {s_1}}} \in D_1^n\) with \(n \le k\). The game consists of k rounds. To describe a single round of the game, assume that n rounds have passed (where \(0 \le n < k\)); round \(n+1\) then consists of the following steps:

  1. 1.

    \(\forall \) chooses an element \(d_i\) in one of the \(\mathbb {D}_i\);

  2. 2.

    \(\exists \) responds with an element \(d_{-i}\) in the model \(\mathbb {D}_{-i}\).

In this way, the sequences \(\overline{{\mathbf {s_i}}} \in D_i^n\) of elements chosen up to round n are extended to \({\overline{{\mathbf {s_i}}}' {:=}\overline{{\mathbf {s_i}}}\cdot d_i} \in D_i^{n+1}\). Player \(\exists \) survives the round iff she does not get stuck and the function \(f_{n+1}: \overline{{\mathbf {s_0}}}' \mapsto \overline{{\mathbf {s_1}}}'\) is a partial isomorphism of monadic models. Finally, player \(\exists \) wins the match iff she survives all k rounds.

Given \(n\le k\) and \(\overline{{\mathbf {s_i}}} \in D_i^n\) such that \(f_n:\overline{{\mathbf {s_0}}}\mapsto \overline{{\mathbf {s_1}}}\) is a partial isomorphism, we write \(\text {EF}_{k}^=(\mathbb {D}_0, \mathbb {D}_1)@(\overline{{\mathbf {s_0}}},\overline{{\mathbf {s_1}}})\) to denote the (initialised) game where n moves have been played and \(k-n\) moves are left to be played.

Proposition 3

The following are equivalent:

  1. (1)

    \(\mathbb {D}_0 \equiv _k^{\texttt {M} \texttt {E} } \mathbb {D}_1\),

  2. (2)

    \(\mathbb {D}_0 \sim _k^= \mathbb {D}_1\),

  3. (3)

    \(\exists \) has a winning strategy in \(\text {EF}_k^=(\mathbb {D}_0,\mathbb {D}_1)\).

Proof

The implication from (1) to (2) is direct by Proposition 2. For the implication from (2) to (3) we give a winning strategy for \(\exists \) in \(\text {EF}_k^=(\mathbb {D}_0,\mathbb {D}_1)\) by showing the following claim.

Claim 1

Let \(\mathbb {D}_0 \sim _k^= \mathbb {D}_1\) and \(\overline{{\mathbf {s_i}}} \in D_i^n\) be such that \(n<k\) and \(f_n:\overline{{\mathbf {s_0}}}\mapsto \overline{{\mathbf {s_1}}}\) is a partial isomorphism; then \(\exists \) can survive one more round in \(\text {EF}_{k}^=(\mathbb {D}_0,\mathbb {D}_1)@(\overline{{\mathbf {s_0}}},\overline{{\mathbf {s_1}}})\).

Proof of Claim 1

Let \(\forall \) pick \(d_i\in D_i\) such that the type of \(d_i\) is \(T \subseteq A\). If \(d_i\) had already been played then \(\exists \) picks the same element as before and \(f_{n+1} = f_n\). If \(d_i\) is new and \(|T|_{\mathbb {D}_i} \ge k\) then, as at most \(n<k\) elements have been played, there is always some new \(d_{-i} \in D_{-i}\) that \(\exists \) can choose to match \(d_i\). If \(|T|_{\mathbb {D}_i} = m < k\) then we know that \(|T|_{\mathbb {D}_{-i}} = m\). Therefore, as \(d_i\) is new and \(f_n\) is injective, there must be a \(d_{-i} \in D_{-i}\) that \(\exists \) can choose. \(\square \)

The implication from (3) to (1) is a standard result [12, Corollary 2.2.9] which we prove anyway because we will need to extend it later. We prove the following loaded statement.

Claim 2

Let \(\overline{{\mathbf {s_i}}} \in D_i^n\) and \(\varphi (z_1,\ldots ,z_n) \in \texttt {M} \texttt {E} (A)\) be such that \(\texttt {qr} (\varphi ) \le k-n\). If \(\exists \) has a winning strategy in the game \(\text {EF}_k^=(\mathbb {D}_0,\mathbb {D}_1)@(\overline{{\mathbf {s_0}}},\overline{{\mathbf {s_1}}})\) then \(\mathbb {D}_0 \models \varphi (\overline{{\mathbf {s_0}}})\) iff \(\mathbb {D}_1 \models \varphi (\overline{{\mathbf {s_1}}})\).

Proof of Claim 2

If \(\varphi \) is atomic the claim holds because of \(f_n:\overline{{\mathbf {s_0}}}\mapsto \overline{{\mathbf {s_1}}}\) being a partial isomorphism. The Boolean cases are straightforward. Let \(\varphi (z_1,\ldots ,z_n) = \exists x. \psi (z_1,\ldots ,z_n,x)\) and suppose \(\mathbb {D}_0 \models \varphi (\overline{{\mathbf {s_0}}})\). Hence, there exists \(d_0 \in D_0\) such that \(\mathbb {D}_0 \models \psi (\overline{{\mathbf {s_0}}},d_0)\). By hypothesis we know that \(\exists \) has a winning strategy for \(\text {EF}_k^=(\mathbb {D}_0,\mathbb {D}_1)@(\overline{{\mathbf {s_0}}},\overline{{\mathbf {s_1}}})\). Therefore, if \(\forall \) picks \(d_0\in D_0\) she can respond with some \(d_1\in D_1\) and have a winning strategy for \(\text {EF}_{k}^=(\mathbb {D}_0,\mathbb {D}_1) @(\overline{{\mathbf {s_0}}}{\cdot }d_0,\overline{{\mathbf {s_1}}}{\cdot }d_1)\). By induction hypothesis, because \(\texttt {qr} (\psi ) \le k- (n+1)\), we have that \(\mathbb {D}_0 \models \psi (\overline{{\mathbf {s_0}}},d_0)\) iff \(\mathbb {D}_1 \models \psi (\overline{{\mathbf {s_1}}},d_1)\) and hence \(\mathbb {D}_1 \models \exists x.\psi (\overline{{\mathbf {s_1}}},x)\). The opposite direction is proved by a symmetric argument. \(\square \)

We finish the proof of the proposition by combining these two claims. \(\square \)

Theorem 1

There is an effective procedure that transforms an arbitrary \(\texttt {M} \texttt {E} \)-sentence \(\varphi \) into an equivalent formula \(\varphi ^{*}\) in basic form.

Proof

Let \(\texttt {qr} (\psi ) = k\) and let \(\llbracket \psi \rrbracket \) be the class of models satisfying \(\psi \). As \(\mathfrak {M}/{\equiv _k^{\texttt {M} \texttt {E} }}\) is the same as \(\mathfrak {M}/{\sim _k^=}\) by Proposition 3, it is easy to see that \(\psi \) is equivalent to \(\bigvee \{ \varphi ^=_E \mid E \in \llbracket \psi \rrbracket /{\sim _k^=} \}\). Now it only remains to see that each \(\varphi ^=_E\) is equivalent to the sentence \(\nabla _{\texttt {M} \texttt {E} }(\overline{{\mathbf {T}}},\Pi )\) for some \(\overline{{\mathbf {T}}},\Pi \subseteq \wp (A)\) with \(\Pi \subseteq \overline{{\mathbf {T}}}\).

The crucial observation is that we will use \(\overline{{\mathbf {T}}}\) and \(\Pi \) to give a specification of the types “element by element”. Take some representative \(\mathbb {D}\) of the equivalence class E. Let \(S_1,\ldots ,S_n \subseteq A\) be the types such that \(|S_i|_\mathbb {D}= l_i < k\) and \(S'_1,\ldots ,S'_m \subseteq A\) those satisfying \(|S'_j|_\mathbb {D}\ge k\). The size of the sequence \(\overline{{\mathbf {T}}}\) is defined to be \((\sum _{i=1}^n l_i) + k\times m\) where \(\overline{{\mathbf {T}}}\) contains exactly \(l_i\) occurrences of type \(S_i\) and at least k occurrences of each \(S'_j\). On the other hand we set \(\Pi {:=}\{S'_1,\ldots ,S'_m\}\). It is straightforward to check that \(\Pi \subseteq \overline{{\mathbf {T}}}\) and \(\varphi ^=_E\) is equivalent to \(\nabla _{\texttt {M} \texttt {E} }(\overline{{\mathbf {T}}},\Pi )\). (Observe however, that the quantifier rank of the latter is only bounded by \(k\times 2^{|A|} + 1\).) In particular \(\varphi ^=_{\mathbb {D}_\varnothing }\equiv \nabla _{\texttt {M} \texttt {E} }(\varnothing ,\varnothing ) = \forall x. \bot \).

The effectiveness of the procedure follows from the fact that, given the previous bound on the size of a normal form, it is possible to non-deterministically guess the number of disjuncts, types and associated parameters for each conjunct and repeatedly check whether the formulas \(\varphi \) and \(\bigvee \nabla _{\texttt {M} \texttt {E} }(\overline{{\mathbf {T}}},\Pi )\) are equivalent, this latter problem being decidable by Fact 1. \(\square \)

3.3 Normal form for \(\texttt {M} \texttt {E} ^{\infty }\)

The logic \(\texttt {M} \texttt {E} ^{\infty }\) extends \(\texttt {M} \texttt {E} \) with the capacity to tear apart finite and infinite sets of elements. This is reflected in the normal form for \(\texttt {M} \texttt {E} ^{\infty }\) by adding extra information to the normal form of \(\texttt {M} \texttt {E} \).

Definition 13

We say that a formula \(\varphi \in \texttt {M} \texttt {E} ^{\infty }(A)\) is in basic form if \(\varphi = \bigvee \nabla _{\texttt {M} \texttt {E} ^{\infty }}(\overline{{\mathbf {T}}},\Pi ,\Sigma )\) where each disjunct is of the form

$$\begin{aligned} \nabla _{\texttt {M} \texttt {E} ^{\infty }}(\overline{{\mathbf {T}}},\Pi ,\Sigma ) {:=}\nabla _{\texttt {M} \texttt {E} }(\overline{{\mathbf {T}}},\Pi ) \wedge \nabla _{\!\!\infty }(\Sigma ) \end{aligned}$$

where \(\nabla _{\texttt {M} \texttt {E} }(\overline{{\mathbf {T}}},\Pi )\) is as in Definition 10, and

$$\begin{aligned} \nabla _{\!\!\infty }(\Sigma ) {:=}\bigwedge _{S\in \Sigma } \exists ^\infty y.\tau _S(y) \wedge \forall ^\infty y.\bigvee _{S\in \Sigma } \tau _S(y). \end{aligned}$$

Here \(\overline{{\mathbf {T}}} \in \wp (A)^{k}\) for some k, and \(\Pi ,\Sigma \subseteq \wp (A)\) are such that \(\Sigma \subseteq \Pi \subseteq \overline{{\mathbf {T}}}\).

Observe that basic formulas of \(\texttt {M} \texttt {E} \) are not basic formulas of \(\texttt {M} \texttt {E} ^{\infty }\).

Intuitively, the formula \(\nabla _{\!\!\infty }(\Sigma )\) says that (1) for every type \(S\in \Sigma \), there are infinitely many elements satisfying S and (2) only finitely many elements do not satisfy any type in \(\Sigma \). As a special case, the formula \(\nabla _{\!\!\infty }(\varnothing )\) expresses that the model is finite. A short argument reveals that, intuitively, every disjunct of the form \(\nabla _{\texttt {M} \texttt {E} ^{\infty }}(\overline{{\mathbf {T}}},\Pi ,\Sigma )\) expresses that any monadic model satisfying it admits a partition of its domain in three parts:

  1. (i)

    distinct elements \(t_1,\ldots ,t_n\) with respective types \(T_1,\ldots ,T_n\),

  2. (ii)

    finitely many elements whose types belong to \(\Pi \), and

  3. (iii)

    for each \(S\in \Sigma \), infinitely many elements with type S.

Note that this partition is not necessarily unique, unless we modify item (ii) so that it mentions finitely many elements whose type belongs to \(\Pi \setminus \Sigma \).

In the same way as before, we define an equivalence relation \(\sim ^\infty _k\) on monadic models which refines \(\sim ^=_{k}\) by adding information about the (in-)finiteness of the types.

Definition 14

For every \(k \in \mathbb {N}\) we define the relation \(\sim ^{\infty }_k\) on the class \(\mathfrak {M}\) of monadic models by putting

$$\begin{aligned} \mathbb {D}\sim ^\infty _{k} \mathbb {D}' \Longleftrightarrow \forall S\subseteq A \ \big ( |S|_\mathbb {D}= |S|_{\mathbb {D}'}< k \text { or } k \le |S|_\mathbb {D},|S|_{\mathbb {D}'} < \omega \text { or } |S|_\mathbb {D},|S|_{\mathbb {D}'} \ge \omega \big ), \end{aligned}$$

where \(\mathbb {D}\) and \(\mathbb {D}'\) are arbitrary monadic models.

As before, with this definition we find that \(\mathbb {D}\sim ^\infty _0 \mathbb {D}'\) holds always.

Proposition 4

The following hold, for every \(k \in \mathbb {N}\):

  1. 1.

    The relation \(\sim ^\infty _k\) is an equivalence relation of finite index.

  2. 2.

    The relation \(\sim ^\infty _k\) is a refinement of \(\sim ^=_k\).

  3. 3.

    Every \(E \in \mathfrak {M}/{\sim ^\infty _k}\) is characterised by a sentence \(\varphi ^\infty _E \in \texttt {M} \texttt {E} ^{\infty }(A)\) with \(\texttt {qr} (\varphi ) = k\).

Proof

We only prove the last item, for \(k>0\). Let \(E \in \mathfrak {M}/{\sim ^\infty _k}\) and let \(\mathbb {D}\in E\) be a representative of the class. Let \(E' \in \mathfrak {M}/{\sim ^=_k}\) be the equivalence class of \(\mathbb {D}\) with respect to \(\sim ^=_k\). Let \(S_1,\ldots ,S_n \subseteq A\) be all the types such that \(|S_i|_\mathbb {D}\ge \omega \), and define

$$\begin{aligned} \varphi ^\infty _E {:=}\varphi ^=_{E'} \wedge \nabla _{\!\!\infty }(\{S_1,\ldots ,S_n\}). \end{aligned}$$

It is not difficult to see that \(\texttt {qr} (\varphi ^\infty _E) = k\) and that \(\mathbb {D}' \models \varphi ^\infty _E\) iff \(\mathbb {D}' \in E\). In particular \(\varphi ^\infty _{\mathbb {D}_\varnothing }\equiv \nabla _{\texttt {M} \texttt {E} ^{\infty }}(\varnothing ,\varnothing ,\varnothing ) = \forall x. \bot \wedge \forall ^\infty y. \bot \). \(\square \)

Now we give a version of the Ehrenfeucht–Fraŝsé game for \(\texttt {M} \texttt {E} ^{\infty }\). This game, which extends \(\text {EF}^=_k\) with moves for \(\exists ^\infty \), is the adaptation of the Ehrenfeucht–Fraïssé game for monotone generalised quantifiers found in [20] to the case of full monadic first-order logic.

Definition 15

Let \(\mathbb {D}_0 = (D_0,V_0)\) and \(\mathbb {D}_1 = (D_1,V_1)\) be monadic models. We define the game \(\text {EF}^\infty _k(\mathbb {D}_0,\mathbb {D}_1)\) between \(\forall \) and \(\exists \). A position in this game is a pair of sequences \(\overline{{\mathbf {s_0}}} \in D_0^n\) and \(\overline{{\mathbf {s_1}}} \in D_1^n\) with \(n \le k\). The game consists of k rounds. To describe a single round of the game, assume that n rounds have passed (where \(0 \le n < k\)); round \(n+1\) then consists of the following steps.

First \(\forall \) chooses to perform one of the following types of moves:

  1. (a)

    second-order move:

    1. 1.

      \(\forall \) chooses an infinite set \(X_i \subseteq D_i\);

    2. 2.

      \(\exists \) responds with an infinite set \(X_{-i} \subseteq D_{-i}\);

    3. 3.

      \(\forall \) chooses an element \(d_{-i} \in X_{-i}\);

    4. 4.

      \(\exists \) responds with an element \(d_i \in X_i\).

  2. (b)

    first-order move:

    1. 1.

      \(\forall \) chooses an element \(d_i \in D_i\);

    2. 2.

      \(\exists \) responds with an element \(d_{-i} \in D_{-i}\).

The sequences \(\overline{{\mathbf {s_i}}} \in D_i^n\) of elements chosen up to round n are then extended to \({\overline{{\mathbf {s_i}}}' {:=}\overline{{\mathbf {s_i}}}\cdot d_i} \in D_{i}^{n+1}\). \(\exists \) survives the round iff she does not get stuck and the function \(f_{n+1}: \overline{{\mathbf {s_0}}}' \mapsto \overline{{\mathbf {s_1}}}'\) is a partial isomorphism of monadic models.

Note that the only items that are recorded in a play of this game are the objects picked by the players; the subsets that are picked in a round starting with a second-order move by \(\forall \), are forgotten as soon as the players have selected inhabitants of these sets (Fig. 1).

Proposition 5

The following are equivalent:

  1. (1)

    \(\mathbb {D}_0 \equiv _k^{\texttt {M} \texttt {E} ^{\infty }} \mathbb {D}_1\),

  2. (2)

    \(\mathbb {D}_0 \sim _k^\infty \mathbb {D}_1\),

  3. (3)

    \(\exists \) has a winning strategy in \(\text {EF}_k^\infty (\mathbb {D}_0,\mathbb {D}_1)\).

Proof

The implication from (1) to (2) is direct by Proposition 4. For the implication from (2)) to (3) we show the following.

Claim 1

Let \(\mathbb {D}_0 \sim _k^\infty \mathbb {D}_1\) and \(\overline{{\mathbf {s_i}}} \in D_i^n\) be such that \(n<k\) and \(f_n:\overline{{\mathbf {s_0}}}\mapsto \overline{{\mathbf {s_1}}}\) is a partial isomorphism. Then \(\exists \) can survive one more round in \(\text {EF}_{k}^\infty (\mathbb {D}_0, \mathbb {D}_1)@(\overline{{\mathbf {s_0}}},\overline{{\mathbf {s_1}}})\).

Fig. 1
figure 1

Elements of type S have coloured background

Proof of Claim 1

We focus on the second-order moves because the first-order moves are the same as in the corresponding Claim of Proposition 3. Let \(\forall \) choose an infinite set \(X_i \subseteq D_i\), we would like \(\exists \) to choose an infinite set \(X_{-i} \subseteq D_{-i}\) such that the following conditions hold:

  1. (a)

    The map \(f_n\) is a well-defined partial isomorphism between the restricted monadic models \(\mathbb {D}_0{\upharpoonright }X_0\) and \(\mathbb {D}_1{\upharpoonright }X_1\),

  2. (b)

    For every type S there is an element \(d\in X_i\) of type S which is not connected by \(f_n\) iff there is such an element in \(X_{-i}\).

First we prove that such a set \(X_{-i}\) exists. To satisfy item (a) \(\exists \) just needs to add to \(X_{-i}\) the elements connected to \(X_i\) by \(f_n\); this is not a problem.

For item (b) we proceed as follows: for every type S such that there is an element \(d\in X_i\) of type S, we add a new element \(d'\in D_{-i}\) of type S to \(X_{-i}\). To see that this is always possible, observe first that \(\mathbb {D}_0 \sim _k^\infty \mathbb {D}_1\) implies \(\mathbb {D}_0 \sim _k^= \mathbb {D}_1\). Using the properties of this relation, we divide in two cases:

  • If \(|S|_{D_i} \ge k\) we know that \(|S|_{D_{-i}} \ge k\) as well. From the elements of \(D_{-i}\) of type S, at most \(n<k\) are used by \(f_n\). Hence, there is at least one \(d'\in D_{-i}\) of type S to choose from.

  • If \(|S|_{D_i} < k\) we know that \(|S|_{D_{i}} = |S|_{D_{-i}}\). From the elements of \(D_{i}\) of type S, at most \(|S|_{D_{i}}-1\) are used by \(f_n\). (The reason for the ‘\(-1\)’ is that we are assuming that we have just chosen a \(d\in X_i\) which is not in \(f_n\).) Using that \(|S|_{D_{i}} = |S|_{D_{-i}}\) and that \(f_n\) is a partial isomorphism we can again conclude that there is at least one \(d'\in D_{-i}\) of type S to choose from.

Finally, we need to show that \(\exists \) can choose \(X_{-i}\) to be infinite. To see this, observe that \(X_{i}\) is infinite, while there are only finitely many types. Hence there must be some S such that \(|S|_{X_i} \ge \omega \). It is then safe to add infinitely many elements for S in \(X_{-i}\) while considering point (b). Moreover, the existence of infinitely many elements satisfying S in \(D_{-i}\) is guaranteed by \(\mathbb {D}_0 \sim _k^\infty \mathbb {D}_1\).

Having shown that \(\exists \) can choose a set \(X_{-i}\) satisfying the above conditions, it is now clear that using point (b) \(\exists \) can survive the “first-order part” of the second-order move we were considering. This finishes the proof of the claim. \(\square \)

Returning to the proof of Proposition 5, for the implication from (3) to (1) we prove the following.

Claim 2

Let \(\overline{{\mathbf {s_i}}} \in D_i^n\) and \(\varphi (z_1,\ldots ,z_n) \in \texttt {M} \texttt {E} ^{\infty }(A)\) be such that \(\texttt {qr} (\varphi ) \le k-n\). If \(\exists \) has a winning strategy in \(\text {EF}_k^\infty (\mathbb {D}_0,\mathbb {D}_1)@(\overline{{\mathbf {s_0}}},\overline{{\mathbf {s_1}}})\) then \(\mathbb {D}_0 \models \varphi (\overline{{\mathbf {s_0}}})\) iff \(\mathbb {D}_1 \models \varphi (\overline{{\mathbf {s_1}}})\).

Proof of Claim 2

All the cases involving operators of \(\texttt {M} \texttt {E} \) are the same as in Proposition 3. We prove the inductive case for the generalised quantifier. Let \(\varphi (z_1,\ldots ,z_n)\) be of the form \(\exists ^\infty x.\psi (z_1,\ldots ,z_n,x)\) and let \(\mathbb {D}_0 \models \varphi (\overline{{\mathbf {s_0}}})\). Hence, the set \(X_{0} {:=}\{ d_{0} \in D_{0} \mid \mathbb {D}_0 \models \psi (\overline{{\mathbf {s_0}}},d_0) \}\) is infinite.

By assumption \(\exists \) has a winning strategy in \(\text {EF}_k^\infty (\mathbb {D}_0,\mathbb {D}_1)@(\overline{{\mathbf {s_0}}},\overline{{\mathbf {s_1}}})\). Therefore, if \(\forall \) plays a second-order move by picking \(X_0 \subseteq D_0\) she can respond with some infinite set \(X_1 \subseteq D_1\). We claim that \(\mathbb {D}_1 \models \psi (\overline{{\mathbf {s_1}}},d_1)\) for every \(d_1\in X_1\). First observe that if this holds then the set \(X'_1 {:=}\{ d_1 \in D_1 \mid \mathbb {D}_1 \models \psi (\overline{{\mathbf {s_1}}},d_1)\}\) must be infinite, and hence \(\mathbb {D}_1 \models \exists ^\infty x.\psi (\overline{{\mathbf {s_1}}},x)\).

Assume, for a contradiction, that \(\mathbb {D}_1 \not \models \psi (\overline{{\mathbf {s_1}}},d'_1)\) for some \(d'_1\in X_1\). Let \(\forall \) play this \(d'_1\) as the second part of his move. Then, as \(\exists \) has a winning strategy, she will respond with some \(d'_0 \in X_0\) for which she has a winning strategy in \(\text {EF}_{k}^\infty (\mathbb {D}_0,\mathbb {D}_1) @(\overline{{\mathbf {s_0}}}{\cdot }d'_0,\overline{{\mathbf {s_1}}}{\cdot }d'_1)\). But then by our induction hypothesis, which applies since \(\texttt {qr} (\psi ) \le k-(n+1)\), we may infer from \(\mathbb {D}_1 \not \models \psi (\overline{{\mathbf {s_1}}},d'_1)\) that \(\mathbb {D}_0 \not \models \psi (\overline{{\mathbf {s_0}}},d'_0)\). This clearly contradicts the fact that \(d'_{0} \in X_{0}\). \(\square \)

Combining the claims finishes the proof of the proposition. \(\square \)

Theorem 2

There is an effective procedure that transforms an arbitrary \(\texttt {M} \texttt {E} ^{\infty }\)-sentence \(\varphi \) into an equivalent formula \(\varphi ^{*}\) in basic form.

Proof

This can be proved using the same argument as in Theorem 1 but based on Proposition 5. Hence we only focus on showing that \(\varphi _E^\infty \equiv \nabla _{\texttt {M} \texttt {E} ^{\infty }}(\overline{{\mathbf {T}}},\Pi ,\Sigma )\) for some \(\overline{{\mathbf {T}}},\Pi ,\Sigma \subseteq \wp (A)\) such that \(\Sigma \subseteq \Pi \subseteq \overline{{\mathbf {T}}}\), where \(\varphi _E^\infty \) is the sentence characterising \(E \in \mathfrak {M}/{\sim ^\infty _k}\) from Proposition 4(2). Recall that

$$\begin{aligned} \varphi ^\infty _E = \varphi ^=_{E'} \wedge \nabla _{\!\!\infty }(\Sigma ) \end{aligned}$$

where \(\Sigma \) is the collection of types that are realised by infinitely many elements. Using Theorem 1 on \(\varphi ^=_{E'}\) we know that this is equivalent to

$$\begin{aligned} \varphi ^\infty _E = \nabla _{\texttt {M} \texttt {E} }(\overline{{\mathbf {T}}},\Pi ) \wedge \nabla _{\!\!\infty }(\Sigma ) \end{aligned}$$

where \(\Pi \subseteq \overline{{\mathbf {T}}}\). Observe that we may assume that \(\Sigma \subseteq \Pi \), otherwise the formula would be inconsistent. We can then conclude that \(\varphi ^\infty _E \equiv \nabla _{\texttt {M} \texttt {E} ^{\infty }}(\overline{{\mathbf {T}}},\Pi ,\Sigma )\). \(\square \)

4 Monotonicity

In this section we provide our first characterisation result, which concerns the notion of monotonicity.

Definition 16

Let V and \(V'\) be two A-valuations on the same domain D, and \(B \subseteq A\). Then we say that \(V'\) is a B-extension of V, notation: \(V \le _{B} V'\), if \(V(b) \subseteq V'(b)\) for every \(b \in B\), and \(V(a) = V'(a)\) for every \(a \in A \setminus B\).

Given a monadic logic \(\texttt {L} \) and a formula \(\varphi \in \texttt {L} (A)\) we say that \(\varphi \) is monotone in \(B \subseteq A\) if

$$\begin{aligned} (D,V),g \models \varphi \text { and } V \le _{B} V' \text { imply } (D,V'),g \models \varphi , \end{aligned}$$
(4)

for every pair of monadic models (DV) and \((D,V')\) and every assignment \(g:\mathsf {iVar}\rightarrow D\).

Remark 3

It is easy to prove that a formula is monotone in \(B \subseteq A\) if and only if it is monotone in every \(b \in B\).

The semantic property of monotonicity can usually be linked to the syntactic notion of positivity. Indeed, for many logics, a formula \(\varphi \) is monotone in \(a \in A\) iff \(\varphi \) is equivalent to a formula where all occurrences of a have a positive polarity, that is, they are situated in the scope of an even number of negations.

Definition 17

For \(\texttt {L} \in \{ \texttt {M} , \texttt {M} \texttt {E} \}\) we define the fragment of A-formulas that are positive in all predicates in B, in short: the B-positive formulas by the following grammar:

$$\begin{aligned} \varphi \mathrel {::=}\psi \mid b(x) \mid (\varphi \wedge \varphi ) \mid (\varphi \vee \varphi ) \mid \exists x.\varphi \mid \forall x.\varphi , \end{aligned}$$

where \(b \in B\) and \(\psi \in \texttt {L} (A\setminus B)\) (that is, there are no occurrences of any \(b \in B\) in \(\psi \)). Similarly, the B-positive fragment of \(\texttt {M} \texttt {E} ^{\infty }\) is given by

$$\begin{aligned} \varphi \mathrel {::=}\psi \mid b(x) \mid (\varphi \wedge \varphi ) \mid (\varphi \vee \varphi ) \mid \exists x.\varphi \mid \forall x.\varphi \mid \exists ^\infty x.\varphi \mid \forall ^\infty x.\varphi , \end{aligned}$$

where \(b\in B\) and \(\psi \in \texttt {M} \texttt {E} ^{\infty }(A\setminus B)\).

In all three cases, we let \(\texttt {Pos} _{B}(\texttt {L} (A))\) denote the set of B-positive sentences.

Note that the difference between the fragments \(\texttt {Pos} _{B}(\texttt {M} (A))\) and \(\texttt {Pos} _{B}(\texttt {M} \texttt {E} (A))\) lies in the fact that in the latter case, the ‘B-free’ formulas \(\psi \) may contain the equality symbol, both positively (\(\approx \)) and negatively (\(\not \approx \)). Clearly \(\texttt {Pos} _{A}(\texttt {L} (A))= \texttt {L} ^+\).

Remark 4

Perhaps a more natural presentation of the fragment \(\texttt {Pos} _{B}(\texttt {L} (A))\) would be via the following grammar (in the case of \(\texttt {M} \), the other cases would be similar):

$$\begin{aligned} \varphi \mathrel {::=}\top \mid \bot \mid a(x) \mid \lnot a(x) \mid b(x) \mid (\varphi \wedge \varphi ) \mid (\varphi \vee \varphi ) \mid \exists x.\varphi \mid \forall x.\varphi , \end{aligned}$$
(5)

where \(a \in A \setminus B\) and \(b \in B\). It is not difficult to see that the above grammar produces the same formulas as the one in Definition 17. The latter presentation, however, is more convenient in the context of our companion paper [10], and in line with the definition of the fragments \(\texttt {Con} _{B}(\texttt {M} \texttt {E} ^{\infty }(A))\) studied in the next section.

Theorem 3

Let \(\varphi \) be a sentence of the monadic logic \(\texttt {L} (A)\), where \(\texttt {L} \in \{ \texttt {M} , \texttt {M} \texttt {E} , \texttt {M} \texttt {E} ^{\infty }\}\). Then \(\varphi \) is monotone in a set \(B \subseteq A\) if and only if there is an equivalent formula \(\varphi ^{\oslash } \in \texttt {Pos} _{B}(\texttt {L} (A))\). Furthermore, it is decidable whether a sentence \(\varphi \in \texttt {L} (A)\) has this property or not.

The ‘easy’ direction of the first claim of the theorem is taken care of by the following proposition.

Proposition 6

Every formula \(\varphi \in \texttt {Pos} _{B}(\texttt {L} (A))\) is monotone in B, where \(\texttt {L} \) is one of the logics \(\{ \texttt {M} , \texttt {M} \texttt {E} , \texttt {M} \texttt {E} ^{\infty }\}\).

Proof

The case for \(D= \varnothing \) being immediate, we assume \(D \ne \varnothing \). The proof is a routine argument by induction on the complexity of \(\varphi \). That is, we show by induction, that any formula \(\varphi \) in the B-positive fragment (which may not be a sentence) satisfies (4), for every monadic model (DV), valuation \(V' \ge _{B} V\) and assignment \({g:\mathsf {iVar}\rightarrow D}\). We focus on the generalised quantifiers. Let \((D,V),g \models \varphi \) and \(V \le _{B} V'\).

  • Case \(\varphi = \exists ^\infty x.\varphi '(x)\). By definition there exists an infinite set \(I\subseteq D\) such that for all \(d\in I\) we have \((D,V),g[x\mapsto d] \models \varphi '(x)\). By induction hypothesis \((D,V'),g[x\mapsto d] \models \varphi '(x)\) for all \(d \in I\). Therefore \((D,V'),g \models \exists ^\infty x.\varphi '(x)\).

  • Case \(\varphi = \forall ^\infty x.\varphi '(x)\). Hence there exists \(C\subseteq D\) such that for all \(d\in C\) we have \((D,V),g[x\mapsto d] \models \varphi '(x)\) and \(D\setminus C\) is finite. By induction hypothesis \((D,V'),g[x\mapsto d] \models \varphi '(x)\) for all \(d \in C\). Therefore \((D,V'),g \models \forall ^\infty x.\varphi '(x)\).

This finishes the proof. \(\square \)

The ‘hard’ direction of the first claim of Theorem 3 states that the fragment \(\texttt {Pos} _{B}(\texttt {M} )\) is complete for monotonicity in B. In order to prove this, we need to show that every sentence which is monotone in B is equivalent to some formula in \(\texttt {Pos} _{B}(\texttt {M} )\). We actually are going to prove a stronger result.

Proposition 7

Let \(\texttt {L} \) be one of the logics \(\{ \texttt {M} , \texttt {M} \texttt {E} , \texttt {M} \texttt {E} ^{\infty }\}\). There exists an effective translation \((-)^{\oslash }:\texttt {L} (A) \rightarrow \texttt {Pos} _{B}(\texttt {L} (A))\) such that a sentence \({\varphi \in \texttt {L} (A)}\) is monotone in \(B \subseteq A\) only if \(\varphi \equiv \varphi ^\oslash \).

We prove the three manifestations of Proposition 7 separately, in three respective subsections.

Proof of Theorem 3

The first claim of the Theorem is an immediate consequence of Proposition 7. By effectiveness of the translation and Fact 1, it is decidable whether a sentence \(\varphi \in \texttt {L} (A)\) is monotone in \(B \subseteq A\) or not. \(\square \)

The following definition will be used throughout in the remainder of the section.

Definition 18

Given \(S \subseteq A\) and \(B \subseteq A\) we use the following notation

$$\begin{aligned} \tau ^{B}_S(x) {:=}\bigwedge _{b\in S} b(x) \wedge \bigwedge _{b\in A\setminus (S\cup B)}\lnot b(x), \end{aligned}$$

for what we call the B-positive A-type \(\tau ^{B}_S\).

Intuitively, \(\tau ^{B}_S\) works almost like the A-type \(\tau _S\), the difference being that \(\tau ^{B}_S\) discards the negative information for the names in B. If \(B = \{a\}\) we write \(\tau ^a_S\) instead of \(\tau ^{\{a\}}_S\). Observe that with this notation, \(\tau ^+_S\) is equivalent to \(\tau ^A_S\).

4.1 Monotone fragment of \(\texttt {M} \)

In this subsection we prove the \(\texttt {M} \)-variant of Proposition 7. That is, we give a translation that constructively maps arbitrary sentences into \(\texttt {Pos} _{B}(\texttt {M} )\) and moreover preserves truth iff the given sentence is monotone in B. To formulate the translation we need to introduce some new notation.

Definition 19

Let \(B\subseteq A\) be a finite set of names and \(\Sigma \subseteq \wp (A)\) be types. The B-positive variant of \(\nabla _{\texttt {M} }(\Sigma )\) is given as follows:

$$\begin{aligned} \nabla ^{B}_\texttt {M} (\Sigma ) {:=}\bigwedge _{S\in \Sigma } \exists x. \tau ^{B}_S(x) \wedge \forall x. \bigvee _{S\in \Sigma } \tau ^{B}_S(x). \end{aligned}$$

We also introduce the following generalised forms of the above notation, with types \(\Pi \subseteq \wp (A)\):

$$\begin{aligned} \nabla ^{B}_\texttt {M} (\Sigma ,\Pi ) {:=}\bigwedge _{S\in \Sigma } \exists x. \tau ^{B}_S(x) \wedge \forall x. \bigvee _{S\in \Pi } \tau ^{B}_S(x). \end{aligned}$$

The positive variants of the above notations are defined as \(\nabla ^+_{\texttt {M} }(\Sigma ) {:=}\nabla ^{A}_\texttt {M} (\Sigma )\) and \(\nabla ^+_{\texttt {M} }(\Sigma ,\Pi ) {:=}\nabla ^{A}_\texttt {M} (\Sigma ,\Pi )\).

Proposition 8

There exists an effective translation \((-)^\oslash :\texttt {M} (A) \rightarrow \texttt {Pos} _{B}(\texttt {M} (A))\) such that a sentence \({\varphi \in \texttt {M} (A)}\) is monotone in \(B\subseteq A\) if and only if \(\varphi \equiv \varphi ^\oslash \).

Proof

To define the translation, by Fact 2, we may assume without loss of generality that \(\varphi \) is in the normal form \(\bigvee \nabla _{\texttt {M} }(\Sigma )\). We define the translation as

$$\begin{aligned} \left( \bigvee \nabla _{\texttt {M} }(\Sigma )\right) ^\oslash {:=}\bigvee \nabla ^{B}_\texttt {M} (\Sigma ). \end{aligned}$$

From the construction it is clear that \(\varphi ^\oslash \in \texttt {Pos} _{B}(\texttt {M} (A))\). Then, the right-to-left direction of the proposition is immediate by Proposition 6. For the left-to-right direction, assume that \(\varphi \) is monotone in B. It suffices to prove that \((D,V) \models \varphi \) if and only if \((D,V) \models \varphi ^\oslash \).

This direction is trivial.

Assume \((D,V) \models \varphi ^\oslash \) and let \(\Sigma \) be such that \((D,V) \models \nabla ^{B}_\texttt {M} (\Sigma )\). If \(D = \varnothing \), then \(\Sigma = \varnothing \) and \(\nabla ^{B}_\texttt {M} (\Sigma )= \nabla _{\texttt {M} }(\Sigma )\). Hence, assume \(D \ne \varnothing \), and clearly \(\Sigma \ne \varnothing \).

We claim the existence of a surjective map \(T: D \rightarrow \Sigma \) such that \((D,V) \models \tau ^{B}_{T_{d}}(d)\), for every d in D. To see this, first note that, because of the existential part of \(\nabla ^{B}_\texttt {M} (\Sigma )\), every type \(S \in \Sigma \) has a ‘B-witness’ in \(\mathbb {D}\), that is, an element \(d_{S} \in D\) such that \((D,V) \models \tau ^{B}_{S}(d_{S})\). It is in fact safe to assume that all these witnesses are distinct (this is because (DV) can be proved to be \(\texttt {M} \)-equivalent to such a model, cf. Proposition 18). This means that we may define \(T: d_{S} \mapsto S\), which will ensure the surjectivity of T. It remains to extend the definition of T to the elements of D that are not of the form \(d_{S}\) for some \(S \in \Sigma \). But this is easy: because of the universal part of \(\nabla ^{B}_\texttt {M} (\Sigma )\), we may find for every element d in D some type \(S_{d}\) in \(\Sigma \) such that \((D,V) \models \tau ^{B}_{S_{d}}(d)\). Putting these observations together, it should be clear that the map \(T: D \rightarrow \wp (A)\) given by \(T(d) {:=}S\) if \(d = d_{S}\) for some \(S \in \Sigma \), and \(T(d) {:=}S_{d}\) otherwise, satisfies the requirements.

Now let \(U: A \rightarrow \wp (D)\) be the valuation of which T is the associated colouring, cf. Definition 2. That is, we put \(U(a) {:=}\{ d \in D \mid a \in T_{d} \}\). The definition of U is tailored towards the claim that

$$\begin{aligned} (D,U) \models \nabla _{\texttt {M} }(\Sigma ). \end{aligned}$$
(6)

To see why this is the case, first take an arbitrary \(d \in D\); it is immediate by the definitions that \((D,U) \models \tau _{T_{d}}(d)\), and since \(T_{d} \in \Sigma \), this takes care of the universal conjunct of the formula \(\nabla _{\texttt {M} }(\Sigma )\). Now take an arbitrary \(S \in \Sigma \). It follows by the surjectivity of T that there is a \(d \in D\) such that \(\Sigma = T_{d}\); and since we saw that \((D,U) \models \tau _{T_{d}}(d)\), this takes care of the existential part.

Clearly it follows from (6) that \((D,U) \models \varphi \). But then by monotonicity of \(\varphi \), we are done if we can show that

$$\begin{aligned} U \le _{B} V. \end{aligned}$$
(7)

To see this, observe that for \(a \in A \setminus B\) we have the following equivalences:

$$\begin{aligned} d \in U(a) \iff a \in T_{d} \iff (D,V) \models a(d) \iff d \in V(a), \end{aligned}$$

while for \(b \in B\) we can prove

$$\begin{aligned} d \in U(b) \iff b \in T_{d} \Longrightarrow (D,V) \models b(d) \iff d \in V(b). \end{aligned}$$

This suffices to prove (7), and finishes the proof of the Proposition. \(\square \)

A careful analysis of the translation gives us the following corollary, providing normal forms for the monotone fragment of \(\texttt {M} \).

Corollary 1

For any sentence \(\varphi \in \texttt {M} (A)\), the following hold.

  1. 1.

    The formula \(\varphi \) is monotone in \(B \subseteq A\) iff it is equivalent to a formula in the basic form \(\bigvee \nabla ^{B}_\texttt {M} (\Sigma )\) for some types \(\Sigma \subseteq \wp (A)\).

  2. 2.

    The formula \(\varphi \) is monotone in every \(a\in A\) iff \(\varphi \) is equivalent to a formula \(\bigvee \nabla ^+_{\texttt {M} }(\Sigma )\) for some types \(\Sigma \subseteq \wp (A)\).

In both cases the normal forms are effective.

4.2 Monotone fragment of \(\texttt {M} \texttt {E} \)

In order to prove the \(\texttt {M} \texttt {E} \)-variant of Proposition 7, we need to introduce some new notation.

Definition 20

Let \(B\subseteq A\) be a finite set of names, \(\Pi \subseteq \wp (A)\) be some types, and \(\overline{{\mathbf {T}}} \in \wp (A)^k\) some list of types. The B-monotone variant of \(\nabla _{\texttt {M} \texttt {E} }(\overline{{\mathbf {T}}},\Pi )\) is given as follows:

$$\begin{aligned} \nabla ^{B}_{\texttt {M} \texttt {E} }(\overline{{\mathbf {T}}},\Pi )&{:=}\exists \overline{{\mathbf {x}}}.\big (\text {diff}(\overline{{\mathbf {x}}}) \wedge \bigwedge _i \tau ^{B}_{T_i}(x_i) \wedge \forall z.(\text {diff}(\overline{{\mathbf {x}}},z) \rightarrow \bigvee _{S\in \Pi } \tau ^{B}_S(z)) \big ). \end{aligned}$$

When the set B is a singleton \(\{a\}\) we will write a instead of B. The positive variant \(\nabla ^+_{\texttt {M} \texttt {E} }(\overline{{\mathbf {T}}},\Pi )\) of \(\nabla _{\texttt {M} \texttt {E} }(\overline{{\mathbf {T}}},\Pi )\) is defined as above but with \(+\) in place of B.

Proposition 9

There exists an effective translation \((-)^\oslash :\texttt {M} \texttt {E} (A) \rightarrow \texttt {Pos} _{B}(\texttt {M} \texttt {E} (A))\) such that a sentence \({\varphi \in \texttt {M} \texttt {E} (A)}\) is monotone in B if and only if \(\varphi \equiv \varphi ^\oslash \).

Proof

In proposition 10 this result is proved for \(\texttt {M} \texttt {E} ^{\infty }\) (i.e., \(\texttt {M} \texttt {E} \) extended with generalised quantifiers). It is not difficult to adapt the proof for \(\texttt {M} \texttt {E} \). The translation is defined as follows. By Theorem 1 we may assume without loss of generality that \(\varphi \) is in basic normal form \(\bigvee \nabla _{\texttt {M} \texttt {E} }(\overline{{\mathbf {T}}},\Pi )\). Then \(\varphi ^\oslash {:=}\bigvee \nabla ^{B}_{\texttt {M} \texttt {E} }(\overline{{\mathbf {T}}},\Pi )\). \(\square \)

Combining the normal form for \(\texttt {M} \texttt {E} \) and the proof of the above proposition, we obtain a normal form for the monotone fragment of \(\texttt {M} \texttt {E} \).

Corollary 2

For any sentence \(\varphi \in \texttt {M} \texttt {E} (A)\), the following hold.

  1. 1.

    The formula \(\varphi \) is monotone in \(B\subseteq A\) iff it is equivalent to a formula in the basic form \(\bigvee \nabla ^{B}_{\texttt {M} \texttt {E} }(\overline{{\mathbf {T}}},\Pi )\) where for each disjunct we have \(\overline{{\mathbf {T}}} \in \wp (A)^k\) for some k and \(\Pi \subseteq \overline{{\mathbf {T}}}\).

  2. 2.

    The formula \(\varphi \) is monotone in all \(a\in A\) iff it is equivalent to a formula in the basic form \(\bigvee \nabla ^+_{\texttt {M} \texttt {E} }(\overline{{\mathbf {T}}},\Pi )\) where for each disjunct we have \(\overline{{\mathbf {T}}} \in \wp (A)^k\) for some k and \(\Pi \subseteq \overline{{\mathbf {T}}}\).

In both cases, normal forms are effective.

4.3 Monotone fragment of \(\texttt {M} \texttt {E} ^{\infty }\)

First, in this case too we introduce some notation for the positive variant of a sentence in normal form.

Definition 21

Let \(B\subseteq A\) be a finite set of names, \(\Sigma , \Pi \subseteq \wp (A)\) be some types, and \(\overline{{\mathbf {T}}} \in \wp (A)^k\) some list of types. The B-positive variant of \(\nabla _{\texttt {M} \texttt {E} ^{\infty }}(\overline{{\mathbf {T}}},\Pi ,\Sigma )\) is given as follows:

$$\begin{aligned} \nabla ^{B}_{\texttt {M} \texttt {E} ^{\infty }}(\overline{{\mathbf {T}}},\Pi ,\Sigma )&{:=}\nabla ^{B}_{\texttt {M} \texttt {E} }(\overline{{\mathbf {T}}},\Pi ) \wedge \nabla ^{B}_\infty (\Sigma )\\ \nabla ^{B}_\infty (\Sigma )&{:=}\bigwedge _{S\in \Sigma } \exists ^\infty y.\tau ^{B}_S(y) \wedge \forall ^\infty y.\bigvee _{S\in \Sigma } \tau ^{B}_S(y). \end{aligned}$$

When the set B is a singleton \(\{a\}\) we will write a instead of B. The positive variant of \(\nabla _{\texttt {M} \texttt {E} ^{\infty }}(\overline{{\mathbf {T}}},\Pi ,\Sigma )\) is defined as \(\nabla ^+_{\texttt {M} \texttt {E} ^{\infty }}(\overline{{\mathbf {T}}},\Pi ,\Sigma ) {:=}\nabla ^{A}_{\texttt {M} \texttt {E} ^{\infty }}(\overline{{\mathbf {T}}},\Pi ,\Sigma )\).

We are now ready to proceed with the proof of the \(\texttt {M} \texttt {E} ^{\infty }\)-variant of Proposition 7 and to give the translation.

Proposition 10

There is an effective translation \((-)^\oslash :\texttt {M} \texttt {E} ^{\infty }(A) \rightarrow \texttt {Pos} _{B}(\texttt {M} \texttt {E} ^{\infty }(A))\) such that a sentence \({\varphi \in \texttt {M} \texttt {E} ^{\infty }(A)}\) is monotone in B if and only if \(\varphi \equiv \varphi ^\oslash \).

Proof

By Theorem 2, we assume that \(\varphi \) is in the normal form \(\bigvee \nabla _{\texttt {M} \texttt {E} ^{\infty }}(\overline{{\mathbf {T}}},\Pi ,\Sigma ) = \nabla _{\texttt {M} \texttt {E} }(\overline{{\mathbf {T}}},\Pi \cup \Sigma ) \wedge \nabla _{\!\!\infty }(\Sigma )\) for some sets and list of types \(\Pi ,\Sigma , \overline{{\mathbf {T}}} \subseteq \wp (A)\) with \(\Sigma \subseteq \Pi \subseteq \overline{{\mathbf {T}}}\). For the translation we define

$$\begin{aligned} \Big (\bigvee \nabla _{\texttt {M} \texttt {E} ^{\infty }}(\overline{{\mathbf {T}}},\Pi ,\Sigma )\Big )^\oslash {:=}\bigvee \nabla ^{B}_{\texttt {M} \texttt {E} ^{\infty }}(\overline{{\mathbf {T}}},\Pi ,\Sigma ). \end{aligned}$$

From the construction it is clear that \(\varphi ^\oslash \in \texttt {Pos} _{B}(\texttt {M} \texttt {E} ^{\infty }(A))\). Then, the right-to-left direction of the proposition is immediate by Proposition 6. For the left-to-right direction, assume that \(\varphi \) is monotone in B, we have to prove that \((D,V) \models \varphi \) if and only if \((D,V) \models \varphi ^\oslash \).

This direction is trivial.

Assume \((D,V) \models \varphi ^\oslash \), and in particular that \((D,V) \models \nabla ^{B}_{\texttt {M} \texttt {E} ^{\infty }}(\overline{{\mathbf {T}}},\Pi ,\Sigma )\). If \(D = \varnothing \), then \(\Sigma = \Pi = \overline{{\mathbf {T}}} = \varnothing \) and \(\nabla ^{B}_{\texttt {M} \texttt {E} ^{\infty }}(\overline{{\mathbf {T}}},\Pi ,\Sigma )= \nabla _{\texttt {M} \texttt {E} ^{\infty }}(\overline{{\mathbf {T}}},\Pi ,\Sigma )\). Hence, assume \(D \ne \varnothing \). Observe that the elements of D can be partitioned in the following way:

  1. (a)

    distinct elements \(t_i \in D\) such that each \(t_i\) satisfies \(\tau ^{B}_{T_i}(x)\),

  2. (b)

    for every \(S \in \Sigma \) an infinite set \(D_S\), such that every \(d \in D_S\) satisfies \(\tau ^{B}_{S}\),

  3. (c)

    a finite set \(D_\Pi \) of elements, each satisfying one of the B-positive types \(\tau ^{B}_{S}\) with \(S \in \Pi \setminus \Sigma \).

Following this partition, with every element \(d\in D\) we may associate a type \(S_{d}\) in, respectively, (a) \(\overline{{\mathbf {T}}}\), (b) \(\Sigma \), or (c) \(\Pi \setminus \Sigma \), such that d satisfies \(\tau ^{B}_{S_{d}}\). As in the proof of Proposition 8, we now consider the valuation U defined as \(U^{\flat }(d) {:=}S_d\), and as before we can show that \(U \le _{B} V\). Finally, it easily follows from the definitions that \((D,U) \models \nabla _{\texttt {M} \texttt {E} ^{\infty }}(\overline{{\mathbf {T}}},\Pi ,\Sigma )\), implying that \((D,U) \models \varphi \). But then by the assumed B-monotonicity of \(\varphi \) it is immediate that \((D,V) \models \varphi \), as required. \(\square \)

As with the previous two cases, the translation provides normal forms for the monotone fragment of \(\texttt {M} \texttt {E} ^{\infty }\).

Corollary 3

For any sentence \(\varphi \in \texttt {M} \texttt {E} ^{\infty }(A)\), the following hold:

  1. 1.

    The formula \(\varphi \) is monotone in \(B \subseteq A\) iff it is equivalent to a formula \(\bigvee \nabla ^{B}_{\texttt {M} \texttt {E} ^{\infty }}(\overline{{\mathbf {T}}},\Pi ,\Sigma )\) for \(\Sigma \subseteq \Pi \subseteq \wp (A)\) and \(\overline{{\mathbf {T}}} \in \wp (A)^k\) for some k.

  2. 2.

    The formula \(\varphi \) is monotone in every \(a\in A\) iff it is equivalent to a formula in the basic form \(\bigvee \nabla ^+_{\texttt {M} \texttt {E} ^{\infty }}(\overline{{\mathbf {T}}},\Pi ,\Sigma )\) for types \(\Sigma \subseteq \Pi \subseteq \wp (A)\) and \(\overline{{\mathbf {T}}} \in \wp (A)^k\) for some k.

In both cases, normal forms are effective.

5 Continuity

In this section we study the sentences that are continuous in some set B of monadic predicate symbols.

Definition 22

Let U and V be two A-valuations on the same domain D. For a set \(B \subseteq A\), we write \(U \le ^{\omega }_{B} V\) if \(U \le _{B} V\) and U(b) is finite, for every \(b \in B\).

Given a monadic logic \(\texttt {L} \) and a formula \(\varphi \in \texttt {L} (A)\) we say that \(\varphi \) is continuous in \(B \subseteq A\) if \(\varphi \) is monotone in B and satisfies the following:

$$\begin{aligned} \text {if } (D, V), g \models \varphi \text { then } (D, U), g \models \varphi \text { for some } U \le ^{\omega }_{B} V. \end{aligned}$$
(8)

for every monadic model (DV) and every assignment \(g:\mathsf {iVar}\rightarrow D\).

Remark 5

As for monotonicity it is straightforward to show that a formula \(\varphi \) is continuous in a set B iff it is continuous in every \(b \in B\).

What explains both the name and the importance of this property is its equivalence to so-called Scott continuity. To understand it, we may formalise the dependence of the meaning of a monadic sentence \(\varphi \) with m free variables \(\overline{{\mathbf {x}}}\) in a monadic model \(\mathbb {D}= (D,V )\) on a fixed name \(b \in A\) as a map \(\varphi ^\mathbb {D}_b : \wp (D) \rightarrow \wp (D^m)\) defined by

$$\begin{aligned} X \subseteq D \mapsto \{ \overline{{\mathbf {d}}} \in D^m \mid (D,V[b \mapsto X]) \models \varphi (\overline{{\mathbf {d}}}) \}. \end{aligned}$$

One can then verify that a sentence \(\varphi \) is continuous in b if and only if the operation \(\varphi ^\mathbb {D}_b\) is continuous with respect to the Scott topology on the powerset algebras.Footnote 6 Scott continuity is of key importance in many areas of theoretical computer sciences where ordered structures play a role, such as domain theory (see e.g. [1]).

Similarly as for monotonicity, the semantic property of continuity can be provided with a syntactic characterisation.

Definition 23

Let \(\texttt {L} \in \{ \texttt {M} , \texttt {M} \texttt {E} \}\) The fragment of \(\texttt {L} (A)\)-formulas that are syntactically continuous in a subset \(B \subseteq A\) is defined by the following grammar:

$$\begin{aligned} \varphi \mathrel {::=}\psi \mid b(x) \mid (\varphi \wedge \varphi ) \mid (\varphi \vee \varphi ) \mid \exists x.\varphi , \end{aligned}$$

where \(b\in B\) and \(\psi \in \texttt {L} (A\setminus B)\). In both cases, we let \(\texttt {Con} _{B}(\texttt {L} (A))\) denote the set of B-continuous sentences.

To define the syntactically continuous fragment of \(\texttt {M} \texttt {E} ^{\infty }\), we first introduce the following binary generalised quantifier \(\mathbf {W} \): given two formulas \(\varphi (x)\) and \(\psi (x)\), we set

$$\begin{aligned} \mathbf {W} x.(\varphi ,\psi ) {:=}\forall x.(\varphi (x) \vee \psi (x)) \wedge \forall ^\infty x.\psi (x). \end{aligned}$$

The intuition behind \(\mathbf {W} \) is the following. If \((D,V),g \models \mathbf {W} x.(\varphi , \psi )\), then because of the second conjunct there are only finitely many \(d \in D\) refuting \(\psi \). The point is that this weakens the universal quantification of the first conjunct to the effect that only the finitely many mentioned elements refuting \(\psi \) need to satisfy \(\varphi \).

Definition 24

The fragment of \(\texttt {M} \texttt {E} ^{\infty }(A)\)-formulas that are syntactically continuous in a subset \(B \subseteq A\) is given by the following grammar:

$$\begin{aligned} \varphi \mathrel {::=}\psi \mid b(x) \mid (\varphi \wedge \varphi ) \mid (\varphi \vee \varphi ) \mid \exists x.\varphi \mid \mathbf {W} x.(\varphi ,\psi ), \end{aligned}$$

where \(b\in B\) and \(\psi \in \texttt {M} \texttt {E} ^{\infty }(A\setminus B)\). We let \(\texttt {Con} _{B}(\texttt {M} \texttt {E} ^{\infty }(A))\) denote the set of B-continuous \(\texttt {M} \texttt {E} ^{\infty }\)-sentences.

For \(\texttt {M} \) and \(\texttt {M} \texttt {E} \), the equivalence between the semantic and syntactic properties of continuity was established by van Benthem in [5]. To keep this paper self-contained, we give a sketch of this proof, which is based on a compactness argument.

Proposition 11

Let \(\varphi \) be a sentence of the monadic logic \(\texttt {L} (A)\), where \(\texttt {L} \in \{ \texttt {M} , \texttt {M} \texttt {E} \}\). Then \(\varphi \) is continuous in a set \(B \subseteq A\) if and only if there is an equivalent sentence \(\varphi ^{\ominus } \in \texttt {Con} _{B}(\texttt {L} (A))\).

Proof

The direction from right to left is covered by Proposition 12 below, so we immediately turn to the completeness part of the statement. The case of \(\texttt {M} \) being treated in Sect. 5.1, we only discuss the statement for \(\texttt {M} \texttt {E} \). Hence, let \(\varphi \in \texttt {M} \texttt {E} (A)\) be continuous in B. For simplicity in the exposition, we assume \(B=\{b\}\); the case of an arbitrary B can easily be generalised from what follows.

Let \(y_0, y_{1}, \ldots \) be an infinite list of variables not occurring in \(\varphi \). For \(k \in \omega \), consider the formula

$$\begin{aligned} \varphi _k {:=}\exists y_{0} \cdots \exists y_{k-1}\, \left( \bigwedge _{\ell <k}b(y_{\ell }) \wedge \varphi ([\overline{{\mathbf {y}}}/b])\right) , \end{aligned}$$

where \(\varphi ([\overline{{\mathbf {y}}}/b])\big )\) is obtained from \(\varphi \) by substituting each occurrence of an atomic formula of the form b(x) with the formula \(\bigvee _{\ell < k} x \approx y_\ell \). Intuitively, \(\varphi _{k}\) expresses that \(\varphi \) holds if we reduce the current interpretation of b to some subset of size k. Define \(\varPhi {:=}\{ \varphi _k \mid k \in \omega \} \cup \{ \varphi _{{\mathbb {D}_\varnothing }}\}\), where \( \varphi _{{\mathbb {D}_\varnothing }} {:=}\forall x. \bot \) if \({\mathbb {D}_\varnothing }\models \varphi \) and \( \varphi _{{\mathbb {D}_\varnothing }} {:=}\exists x. \bot \) otherwise. Then by construction \(\varPhi \subset \texttt {Con} _{B}(\texttt {M} \texttt {E} (A))\). Now by continuity of \(\varphi \) we find that

$$\begin{aligned} \varphi \models \bigvee \varPhi , \end{aligned}$$

that is, any non-empty monadic model that validates \(\varphi \) must validate one of the \(\varphi _{k}\). But then by compactness of first-order logic, there is an \(n \in \omega \) such that \(\varphi \models \bigvee _{k<n} \varphi _k \vee \varphi _{{\mathbb {D}_\varnothing }}\). By monotonicity, \(\varphi _{k} \models \varphi \), for every \(k \in \omega \), and by definition \(\varphi _{{\mathbb {D}_\varnothing }} \models \varphi \). We therefore conclude that \(\varphi \equiv \bigvee _{k<n} \varphi _k \vee \varphi _{{\mathbb {D}_\varnothing }}\). As \(\texttt {Con} _{B}(\texttt {M} \texttt {E} (A))\) is closed under disjunctions, this ends the proof of the statement. \(\square \)

In this paper, we extend such a characterisation to \(\texttt {M} \texttt {E} ^{\infty }\). Moreover, analogously to what we did in the previous section, for \(\texttt {M} \) and \(\texttt {M} \texttt {E} ^{\infty }\) we provide both an explicit translation and a decidability result. The corresponding results in the case of \(\texttt {M} \texttt {E} \) remain open.

Theorem 4

Let \(\varphi \) be a sentence of the monadic logic \(\texttt {L} (A)\), where \(\texttt {L} \in \{ \texttt {M} , \texttt {M} \texttt {E} ^{\infty }\}\). Then \(\varphi \) is continuous in a set \(B \subseteq A\) if and only if there is an equivalent sentence \(\varphi ^{\ominus } \in \texttt {Con} _{B}(\texttt {L} (A))\). Furthermore, it is decidable whether a sentence \(\varphi \in \texttt {L} (A)\) has this property or not.

Analogously to the previous case of monotonicity, the proof of the theorem is composed of two parts. We start with the right-left implication of the first claim (the preservation statement), which also holds for \(\texttt {M} \texttt {E} \).

Proposition 12

Every sentence \(\varphi \in \texttt {Con} _{B}(\texttt {L} (A))\) is continuous in B, where \(\texttt {L} \in \{ \texttt {M} , \texttt {M} \texttt {E} , \texttt {M} \texttt {E} ^{\infty }\}\).

Proof

First observe that \(\varphi \) is monotone in B by Proposition 6. The case for \(D=\varnothing \) being clear, we assume \(D\ne \varnothing \). We show, by induction, that any first-order formula \(\varphi \) in the fragment satisfies (8), for every non-empty monadic model (DV) and assignment \({ g:\mathsf {iVar}\rightarrow D}\).

  • If \(\varphi = \psi \in \texttt {L} (A\setminus B)\), changes in the B part of the valuation will not affect the truth value of \(\varphi \) and hence the condition is trivial.

  • Case \(\varphi = b(x)\) for some \(b \in B\): if \((D, V), g \models b(x)\) then \(g(x)\in V(b)\). Let U be the valuation given by \(U(b) {:=}\{ g(x) \}\), \(U(a) {:=}\varnothing \) for \(a \in B \setminus \{b \}\) and \(U(a) {:=}V(a)\) for \(a \in A \setminus B\). Then it is obvious that \((D, U), g \models b(x)\), while it is immediate by the definitions that \(U \le ^{\omega }_{B} V\).

  • Case \(\varphi = \varphi _1 \vee \varphi _2\): assume \((D, V), g \models \varphi \). Without loss of generality we can assume that \((D, V), g \models \varphi _1\) and hence by induction hypothesis there is \(U \le ^{\omega }_{B} V\) such that \((D, U), g \models \varphi _1\) which clearly implies \((D, U), g \models \varphi \).

  • Case \(\varphi = \varphi _1 \wedge \varphi _2\): assume \((D, V), g \models \varphi \). By induction hypothesis we have \(U_1,U_2 \le ^{\omega }_{B} V\) such that \((D,U_{1}), g \models \varphi _1\) and \((D, U_2), g \models \varphi _2\). Let U be the valuation defined by putting \(U(a) {:=}U_{1}(a) \cup U_{2}(a)\); then clearly we have \(U \le ^{\omega }_{B} V\), while it follows by monotonicity that \((D,U), g \models \varphi _1\) and \((D, U), g \models \varphi _2\). Clearly then \((D, U), g \models \varphi \).

  • Case \(\varphi = \exists x.\varphi '(x)\) and \((D, V), g \models \varphi \). By definition there exists \(d\in D\) such that \((D, V), g[x\mapsto d] \models \varphi '(x)\). By induction hypothesis there is a valuation \(U \le ^{\omega }_{B} V\) such that \((D, U), g[x\mapsto d] \models \varphi '(x)\) and hence \((D, U), g \models \exists x.\varphi '(x)\).

  • Case \(\varphi = \mathbf {W} x.(\varphi ',\psi )\in \texttt {Con} _{B}(\texttt {M} \texttt {E} ^{\infty }(A))\) and \((D, V), g \models \varphi \). Define the formulas \(\alpha (x)\) and \(\beta \) as follows:

    $$\begin{aligned} \varphi = \forall x.\underbrace{(\varphi '(x) \vee \psi (x))}_{\alpha (x)} \wedge \underbrace{\forall ^\infty x.\psi (x)}_\beta . \end{aligned}$$

    Suppose that \((D, V), g \models \varphi \). By the induction hypothesis, for every \(d \in D\) which satisfies \((D, V), g_d \models \alpha (x)\) (where we write \( g_d {:=}g[x\mapsto d]\)) there is a valuation \(U_d \le ^{\omega }_{B} V\) such that \((D, U_d), g_d \models \alpha (x)\). The crucial observation is that because of \(\beta \), only finitely many elements of d refute \(\psi (x)\). Let U be the valuation defined by putting \(U(a) {:=}\bigcup \{U_d(a) \mid (D, V), g_d \not \models \psi (x) \}\). Note that for each \(b \in B\), the set U(b) is a finite union of finite sets, and hence finite itself; it follows that \(U \le ^{\omega }_{B} V\). We claim that

    $$\begin{aligned} (D, U), g \models \varphi . \end{aligned}$$
    (9)

    It is clear that \((D, U), g \models \beta \) because \(\psi \) (and hence \(\beta \)) is B-free. To prove that \((D, U), g \models \forall x\, \alpha (x)\), we have to show that \((D, U), g_d \models \varphi '(x) \vee \psi (x)\) for any \(d \in D\). We consider two cases: If \((D, V), g_d \models \psi (x)\) we are done, again because \(\psi \) is B-free. On the other hand, if \((D, V), g_d \not \models \psi (x)\), then \((D, U_d), g_d \models \alpha (x)\) by assumption on \(U_{d}\), while it is obvious that \(U_{d} \le _{B} U\); but then it follows by monotonicity of \(\alpha \) that \((D, U), g_d \models \alpha (x)\).

This finishes the proof. \(\square \)

The second part of the proof of the theorem, is constituted by the following stronger version of the expressive completeness result that provides, as a corollary, normal forms for the syntactically continuous fragments.

Proposition 13

Let \(\texttt {L} \) be one of the logics \(\{ \texttt {M} , \texttt {M} \texttt {E} ^{\infty }\}\). There exists an effective translation \((-)^\ominus :\texttt {L} (A) \rightarrow \texttt {Con} _{B}(\texttt {L} (A))\) such that a sentence \({\varphi \in \texttt {L} (A)}\) is continuous in \(B \subseteq A\) if and only if \(\varphi \equiv \varphi ^\ominus \).

We prove the two manifestations of Proposition 13 separately, in two respective subsections.

By putting together the two propositions above, we are able to conclude.

Proof of Theorem 4

The first claim follows from Proposition 13. Hence, by applying Fact 1 to Proposition 13, the problem of checking whether a sentence \(\varphi \in \texttt {L} (A)\) is continuous in \(B \subseteq A\) or not, is decidable. \(\square \)

We conjecture that Proposition 13, and therefore Theorem 4, holds also for \(\texttt {L} =\texttt {M} \texttt {E} \).

5.1 Continuous fragment of \(\texttt {M} \)

Since continuity implies monotonicity, by Theorem 3, in order to verify the \(\texttt {M} \)-variant of Proposition 13, it is enough to prove the following result.

Proposition 14

There is an effective translation \((-)^\ominus :\texttt {Pos} _{B}(\texttt {M} (A)) \rightarrow \texttt {Con} _{B}(\texttt {M} (A))\) such that a sentence \(\varphi \in \texttt {Pos} _{B}(\texttt {M} (A))\) is continuous in \(B \subseteq A\) if and only if \(\varphi \equiv \varphi ^\ominus \).

Proof

By Corollary 1, to define the translation we may assume without loss of generality that \(\varphi \) is in the basic form \(\bigvee \nabla ^{B}_\texttt {M} (\Sigma )\). For the translation, let

$$\begin{aligned} \left( \bigvee \nabla ^{B}_\texttt {M} (\Sigma )\right) ^\ominus {:=}\bigvee \nabla ^{B}_\texttt {M} (\Sigma ,\Sigma ^{-}_{B}) \end{aligned}$$

where \(\Sigma ^{-}_{B} {:=}\{S\in \Sigma \mid B \cap S = \varnothing \}\). From the construction, it is clear that \(\varphi ^\ominus \in \texttt {Con} _{B}(\texttt {M} (A))\). Then the right-to-left direction of the proposition is immediate by Proposition 12.

For the left-to-right direction, assume that \(\varphi \) is continuous in B. We have to prove that \((D, V) \models \varphi \) iff \((D, V) \models \varphi ^\ominus \), for every monadic model (DV). Our proof strategy consists of proving the same equivalence for the model \((D\times \omega , V_\pi )\), where \(D\times \omega \) consists of \(\omega \) many copies of each element in D and \(V_\pi \) is the valuation given by \(V_{\pi }(a) {:=}\{(d,k) \mid d\in V(a), k\in \omega \}\). It is easy to see that \((D, V) \equiv ^{\texttt {M} } (D\times \omega , V_\pi )\) (see Proposition 18) and so it suffices indeed to prove that

$$\begin{aligned} (D\times \omega , V_\pi ) \models \varphi \text { iff } (D\times \omega , V_\pi ) \models \varphi ^\ominus . \end{aligned}$$

Consider first the case where \(D= \varnothing \). Then \((D\times \omega , V_\pi ) = {\mathbb {D}_\varnothing }\), and then the claim is true since \(\nabla ^{B}_\texttt {M} (\varnothing ) = \nabla ^{B}_\texttt {M} (\varnothing ,\varnothing ^{-}_{B})\) and \({\mathbb {D}_\varnothing }\models \nabla ^{B}_\texttt {M} (\Sigma )\) iff \(\Sigma =\varnothing \).

In the remainder of the proof we focus on the case where \(D \ne \varnothing \).

Let \((D\times \omega , V_\pi ) \models \varphi \). As \(\varphi \) is continuous in B there is a valuation \(U \le ^{\omega }_{B} V_\pi \) satisfying \((D\times \omega , U) \models \varphi \). This means that \((D\times \omega , U) \models \nabla ^{B}_\texttt {M} (\Sigma )\) for some disjunct \(\nabla ^{B}_\texttt {M} (\Sigma )\) of \(\varphi \). Below we will use the following fact (which can easily be verified):

$$\begin{aligned} (D\times \omega ),U \models \tau ^{B}_{S}(d,k) \text { iff } U^{\flat }(d,k) \setminus B = S \setminus B \text { and } U^{\flat }(d,k) \subseteq S \cap B. \end{aligned}$$
(10)

Our claim is now that \((D\times \omega , U) \models \nabla ^{B}_\texttt {M} (\Sigma ,\Sigma ^{-}_{B})\).

The existential part of \(\nabla ^{B}_\texttt {M} (\Sigma ,\Sigma ^{-}_{B})\) is trivially true. To cover the universal part, it remains to show that every element of \((D\times \omega , U)\) realizes a B-positive type in \(\Sigma ^{-}_{B}\). Take an arbitrary pair \((d,k) \in D\times \omega \) and let T be the (full) type of (dk), that is, let \(T {:=}U^{\flat }(d,k)\). If \(B \cap T = \varnothing \) then trivially \(T\in \Sigma ^{-}_{B}\) and we are done. So suppose \(B \cap T \ne \varnothing \). Observe that in \(D\times \omega \) we have infinitely many copies of \(d\in D\). Hence, as U(b) is finite for every \(b \in B\), there must be some \((d,k')\) with type \(U^{\flat }(d,k') = V_{\pi }^{\flat }(d,k') \setminus B = V_{\pi }^{\flat }(d,k) \setminus B = T \setminus B\). It follows from \((D\times \omega , U) \models \nabla ^{B}_\texttt {M} (\Sigma )\) and (10) that there is some \(S \in \Sigma \) such that \(S \setminus B = U^{\flat }(d,k') \setminus B = U^{\flat }(d,k')\) and \(S \cap B \subseteq U^{\flat }(d,k') \cap B = \varnothing \). From this we easily derive that \(S = U^{\flat }(d,k')\) and \(S \in \Sigma ^{-}_{B}\). Finally, we observe that \(S \setminus B = U^{\flat }(d,k') \setminus B = U^{\flat }(d,k) \setminus B\) and \(S \cap B = \varnothing \subseteq U^{\flat }(d,k)\), so that by (10) we find that \(D \times \omega ,U) \models \tau ^{B}_{S}(d,k)\) indeed.

Finally, by monotonicity it directly follows from \((D\times \omega , U) \models \nabla ^{B}_\texttt {M} (\Sigma ,\Sigma ^{-}_{B})\) that \((D\times \omega , V_{\pi }) \models \nabla ^{B}_\texttt {M} (\Sigma ,\Sigma ^{-}_{B})\), and from this it is immediate that \((D\times \omega , V_\pi ) \models \varphi ^\ominus \).

Let \((D\times \omega , V_\pi ) \models \nabla ^{B}_\texttt {M} (\Sigma ,\Sigma ^{-}_{B})\). To show that \((D\times \omega , V_\pi ) \models \nabla ^{B}_\texttt {M} (\Sigma )\), the existential part is trivial. For the universal part just observe that \(\Sigma ^{-}_{B} \subseteq \Sigma \). \(\square \)

A careful analysis of the translation provides us with normal forms for the continuous fragment of \(\texttt {M} \). We also formulate a version of this result which holds when we restrict to the positive fragment of \(\texttt {M} \); this version, which can be proved in the same manner as the main result, will be needed in our companion paper.

Corollary 4

For any sentence \(\varphi \in \texttt {M} (A)\), the following hold.

  1. 1.

    The formula \(\varphi \) is continuous in \(B \subseteq A\) iff it is equivalent to a formula \(\bigvee \nabla ^{B}_\texttt {M} (\Sigma ,\Sigma ^{-}_{B})\) for some types \(\Sigma \subseteq \wp (A)\), where \(\Sigma ^{-}_{B} {:=}\{S\in \Sigma \mid B \cap S = \varnothing \}\).

  2. 2.

    If \(\varphi \) positive in A (i.e., \(\varphi \in {\texttt {M} }^+(A)\)) then \(\varphi \) is continuous in \(B \subseteq A\) iff it is equivalent to a formula in the basic form \(\bigvee \nabla ^+_{\texttt {M} }(\Sigma ,\Sigma ^{-}_{B})\) for some types \(\Sigma \subseteq \wp (A)\), where \(\Sigma ^{-}_{B} {:=}\{S\in \Sigma \mid B \cap S = \varnothing \}\).

5.2 Continuous fragment of \(\texttt {M} \texttt {E} ^{\infty }\)

As for the previous case, the \(\texttt {M} \texttt {E} ^{\infty }\)-variant of Proposition 13 is an immediate consequence of Theorem 3 and the following proposition.

Proposition 15

There is an effective translation \((-)^\ominus :\texttt {Pos} _{B}(\texttt {M} \texttt {E} ^{\infty }(A)) \rightarrow \texttt {Con} _{B}(\texttt {M} \texttt {E} ^{\infty }(A))\) such that a sentence \(\varphi \in \texttt {Pos} _{B}(\texttt {M} \texttt {E} ^{\infty }(A))\) is continuous in B if and only if \(\varphi \equiv \varphi ^\ominus \).

Proof

By Corollary 3, we may assume that \(\varphi \) is in basic normal form, i.e., \(\varphi = \bigvee \nabla ^{B}_{\texttt {M} \texttt {E} ^{\infty }}(\overline{{\mathbf {T}}},\Pi ,\Sigma )\), with \(\Sigma \subseteq \Pi \subseteq \overline{{\mathbf {T}}}\). For the translation let \(\big (\bigvee \nabla ^{B}_{\texttt {M} \texttt {E} ^{\infty }}(\overline{{\mathbf {T}}},\Pi ,\Sigma )\big )^\ominus {:=}\bigvee \nabla ^{B}_{\texttt {M} \texttt {E} ^{\infty }}(\overline{{\mathbf {T}}},\Pi ,\Sigma )^\ominus \) where

$$\begin{aligned} \nabla ^{B}_{\texttt {M} \texttt {E} ^{\infty }}(\overline{{\mathbf {T}}},\Pi ,\Sigma )^\ominus {:=}{\left\{ \begin{array}{ll} \nabla ^{B}_{\texttt {M} \texttt {E} ^{\infty }}(\overline{{\mathbf {T}}},\Pi ,\Sigma ) &{} \text { if } B \cap \bigcup \Sigma = \varnothing \\ \bot &{}\text { if } B \cap \bigcup \Sigma \ne \varnothing . \end{array}\right. } \end{aligned}$$

First we prove the right-to-left direction of the proposition. By Proposition 12 it is enough to show that \(\varphi ^\ominus \in \texttt {Con} _{B}(\texttt {M} \texttt {E} ^{\infty }(A))\). We focus on the disjuncts of \(\varphi ^\ominus \). The interesting case is where \(B \cap \bigcup \Sigma = \varnothing \). Define the formulas \(\varphi '(\overline{{\mathbf {x}}},z)\) and \(\psi (z)\) as follows:

$$\begin{aligned} \begin{array}{lll} \varphi '(\overline{{\mathbf {x}}},z) &{} {:=}&{} \lnot \text {diff}(\overline{{\mathbf {x}}},z) \vee \bigvee _{S\in \Pi \setminus \Sigma } \tau ^B_S(z)\\ \psi (z) &{} {:=}&{} \bigvee _{S\in \Sigma } \tau ^B_S(y). \end{array} \end{aligned}$$

Then we may rearrange the internal structure of the formula \(\nabla ^{B}_{\texttt {M} \texttt {E} ^{\infty }}(\overline{{\mathbf {T}}},\Pi ,\Sigma )\) somewhat, arriving at the following:

$$\begin{aligned}&\exists \overline{{\mathbf {x}}}.\Big ( \text {diff}(\overline{{\mathbf {x}}}) \wedge \bigwedge _i \tau ^B_{T_i}(x_i)\ \wedge \forall z.(\underbrace{\lnot \text {diff}(\overline{{\mathbf {x}}},z) \vee \bigvee _{S\in \Pi \setminus \Sigma } \tau ^B_S(z)}_{\varphi '(\overline{{\mathbf {x}}},z)} \vee \underbrace{\bigvee _{S\in \Sigma } \tau ^B_S(z)}_{\psi (z)})\\&\quad \wedge \forall ^\infty y.\underbrace{\bigvee _{S\in \Sigma } \tau ^B_S(y)}_{\psi (y)} \Big ) \wedge \bigwedge _{S\in \Sigma } \exists ^\infty y.\tau ^B_S(y), \end{aligned}$$

so that we find

$$\begin{aligned} \nabla ^{B}_{\texttt {M} \texttt {E} ^{\infty }}(\overline{{\mathbf {T}}},\Pi ,\Sigma ) \equiv \exists \overline{{\mathbf {x}}}.\Big (\text {diff}(\overline{{\mathbf {x}}}) \wedge \bigwedge _i \tau ^B_{T_i}(x_i) \wedge \mathbf {W} z.(\varphi '(\overline{{\mathbf {x}}},z),\psi (z)) \Big ) \wedge \bigwedge _{S\in \Sigma } \exists ^\infty y.\tau ^B_S(y), \end{aligned}$$

which belongs to the required fragment because \(B \cap \bigcup \Sigma = \varnothing \).

For the left-to-right direction of the proposition, we have to prove that \(\varphi \equiv \varphi ^\ominus \).

Let \((D, V) \models \varphi \). Because \(\varphi \) is continuous in B we may assume that V(b) is finite, for all \(b \in B\). Let \(\nabla ^{B}_{\texttt {M} \texttt {E} ^{\infty }}(\overline{{\mathbf {T}}},\Pi ,\Sigma )\) be a disjunct of \(\varphi \) such that \((D, V) \models \nabla ^{B}_{\texttt {M} \texttt {E} ^{\infty }}(\overline{{\mathbf {T}}},\Pi ,\Sigma )\). If \(D = \varnothing \), then \( {\overline{{\mathbf {T}}}}={\Pi }={\Sigma }=\varnothing \), and \(\nabla ^{B}_{\texttt {M} \texttt {E} ^{\infty }}(\overline{{\mathbf {T}}},\Pi ,\Sigma ) = (\nabla ^{B}_{\texttt {M} \texttt {E} ^{\infty }}(\overline{{\mathbf {T}}},\Pi ,\Sigma ))^\ominus \). Hence, let \(D \ne \varnothing \). Suppose for contradiction that \(B \cap \bigcup \Sigma \ne \varnothing \), then there must be some \(S\in \Sigma \) with \(B \cap S \ne \varnothing \). Because \((D, V) \models \nabla ^{B}_{\texttt {M} \texttt {E} ^{\infty }}(\overline{{\mathbf {T}}},\Pi ,\Sigma )\) we have, in particular, that \((D, V) \models \exists ^\infty x.\tau ^B_S(x)\) and hence V(b) must be infinite, for any \(b \in B \cap S\), which is absurd. It follows that \(B \cap \bigcup \Sigma = \varnothing \), but then we trivially conclude that \((D, V) \models \varphi ^\ominus \) because the disjunct remains unchanged.

Let \((D, V) \models \varphi ^\ominus \). This direction is trivial, because the only difference between \(\varphi \) and \(\varphi ^\ominus \) is that some disjuncts may have been replaced by \(\bot \). \(\square \)

We conclude the section by stating the following corollary, providing normal forms for the continuous fragment of \(\texttt {M} \texttt {E} ^{\infty }\). As in the case of \(\texttt {M} \) we formulate, for future reference, a variation of this result which applies to the positive fragment of \(\texttt {M} \texttt {E} ^{\infty }\).

Corollary 5

For any sentence \(\varphi \in \texttt {M} \texttt {E} ^{\infty }(A)\), the following hold.

  1. 1.

    The formula \(\varphi \) is continuous in \(B \subseteq A\) iff \(\varphi \) is equivalent to a formula, effectively obtainable from \(\varphi \), which is a disjunction of formulas \(\nabla ^{B}_{\texttt {M} \texttt {E} ^{\infty }}(\overline{{\mathbf {T}}},\Pi ,\Sigma )\) where \(\Sigma , \Pi \subseteq \wp (A)\) and \(\overline{{\mathbf {T}}} \in \wp (A)^k\) are such that \(\Sigma \subseteq \Pi \subseteq \overline{{\mathbf {T}}}\) and \(B \cap \bigcup \Sigma = \varnothing \).

  2. 2.

    If \(\varphi \) is positive (i.e., \(\varphi \in {\texttt {M} \texttt {E} ^{\infty }}^+(A)\)) then \(\varphi \) is continuous in \(B \subseteq A\) iff it is equivalent to a formula, effectively obtainable from \(\varphi \), which is a disjunction of formulas \(\bigvee \nabla ^+_{\texttt {M} \texttt {E} ^{\infty }}(\overline{{\mathbf {T}}},\Pi ,\Sigma )\), where \(\Sigma , \Pi \subseteq \wp (A)\) and \(\overline{{\mathbf {T}}} \in \wp (A)^k\) are such that \(\Sigma \subseteq \Pi \subseteq \overline{{\mathbf {T}}}\) and \(B \cap \bigcup \Sigma = \varnothing \).

6 Submodels and quotients

There are various natural notions of morphism between monadic models; the one that we will be interested here is that of a (strong) homomorphism.

Definition 25

Let \(\mathbb {D}= (D,V)\) and \(\mathbb {D}' = (D',V')\) be two monadic models. A map \(f: D \rightarrow D'\) is a homomorphism from \(\mathbb {D}\) to \(\mathbb {D}'\), notation: \(f: \mathbb {D}\rightarrow \mathbb {D}'\), if we have \(d \in V(a)\) iff \(f(d) \in V'(a)\), for all \(a \in A\) and \(d \in D\).

In this section we will be interested in the sentences of \(\texttt {M} , \texttt {M} \texttt {E} \) and \(\texttt {M} \texttt {E} ^{\infty }\) that are preserved under taking submodels and the ones that are invariant under quotients.

Definition 26

Let \(\mathbb {D}= (D,V)\) and \(\mathbb {D}' = (D',V')\) be two monadic models. We call \(\mathbb {D}\) a submodel of \(\mathbb {D}'\) if \(D \subseteq D'\) and the inclusion map \(\iota _{DD'}: D \hookrightarrow D'\) is a homomorphism, and we say that \(\mathbb {D}'\) is a quotient of \(\mathbb {D}\) if there is a surjective homomorphism \(f: \mathbb {D}\rightarrow \mathbb {D}'\).

Now let \(\varphi \) be an \(\texttt {L} \)-sentence, where \(\texttt {L} \in \{ \texttt {M} , \texttt {M} \texttt {E} , \texttt {M} \texttt {E} ^{\infty }\}\). We say that \(\varphi \) is preserved under taking submodels if \(\mathbb {D}\models \varphi \) implies \(\mathbb {D}' \models \varphi \), whenever \(\mathbb {D}'\) is a submodel of \(\mathbb {D}\). Similarly, \(\varphi \) is invariant under taking quotients if we have \(\mathbb {D}\models \varphi \) iff \(\mathbb {D}' \models \varphi \), whenever \(\mathbb {D}'\) is a quotient of \(\mathbb {D}\).

The first of these properties (preservation under taking submodels) is well known from classical model theory—it is for instance the topic of the Łos-Tarski Theorem. When it comes to quotients, in model theory one is usually more interested in the formulas that are preserved under surjective homomorphisms (and the definition of homomorphism may also differ from ours). For instance, this is the topic of Lyndon’s Theorem [23] which characterises the formulas that are preserved under a weaker notion of homomorphism as the ones that are positive in all predicates occurring in the formula. Our preference for the notion of invariance under quotients stems from the fact that the property of invariance under quotients plays a key role in characterising the bisimulation-invariant fragments of various monadic second-order logics, as is explained in our companion paper [10].

6.1 Preservation under submodels

In this subsection we characterise the fragments of our predicate logics consisting of the sentences that are preserved under taking submodels. That is, the main result of this subsection is a Łos-Tarksi Theorem for \(\texttt {M} \texttt {E} ^{\infty }\).

Definition 27

The universal fragment of the set \(\texttt {M} \texttt {E} ^{\infty }(A)\) is the collection \(\texttt {Univ} (\texttt {M} \texttt {E} ^{\infty }(A))\) of formulas given by the following grammar:

$$\begin{aligned} \varphi \mathrel {::=}\top \mid \bot \mid a(x) \mid \lnot a(x) \mid x \approx y \mid x \not \approx y \mid (\varphi \vee \varphi ) \mid (\varphi \wedge \varphi ) \mid \forall x.\varphi \mid \forall ^\infty x.\varphi \end{aligned}$$

where \(x,y\in \mathsf {iVar}\) and \(a \in A\). The universal fragment \(\texttt {Univ} (\texttt {M} \texttt {E} (A))\) is obtained by deleting the clause for \(\forall ^\infty \) from this grammar, and we obtain the universal fragment \(\texttt {Univ} (\texttt {M} (A))\) by further deleting both clauses involving the equality symbol.

Theorem 5

Let \(\varphi \) be a sentence of the monadic logic \(\texttt {L} (A)\), where \(\texttt {L} \in \{ \texttt {M} , \texttt {M} \texttt {E} , \texttt {M} \texttt {E} ^{\infty }\}\). Then \(\varphi \) is preserved under taking submodels if and only if there is an equivalent formula \(\varphi ^{\otimes } \in \texttt {Univ} (\texttt {L} (A))\). Furthermore, it is decidable whether a sentence \(\varphi \in \texttt {L} (A)\) has this property or not.

We start by verifying that universal formulas satisfy the property.

Proposition 16

Let \(\varphi \in \texttt {Univ} (\texttt {L} (A))\) be a universal sentence of the monadic logic \(\texttt {L} (A)\), where \(\texttt {L} \in \{ \texttt {M} , \texttt {M} \texttt {E} , \texttt {M} \texttt {E} ^{\infty }\}\). Then \(\varphi \) is preserved under taking submodels.

Proof

It is enough to directly consider the case \(\texttt {L} = \texttt {M} \texttt {E} ^{\infty }\). Let \((D',V')\) be a submodel of the monadic model (DV). The case for \(D = \varnothing \) being immediate, let us assume \(D \ne \varnothing \). By induction on the complexity of a formula \(\varphi \in \texttt {Univ} (\texttt {M} \texttt {E} ^{\infty }(A))\) we will show that for any assignment \(g: \mathsf {iVar}\rightarrow D'\) we have

$$\begin{aligned} (D,V),g' \models \varphi \text { implies } (D',V'),g \models \varphi , \end{aligned}$$

where \(g':= g \circ \iota _{D'D}\). We will only consider the inductive step of the proof where \(\varphi \) is of the form \(\forall ^\infty x. \psi \). Define \(X_{D,V} {:=}\{ d \in D \mid (D,V), g'[x \mapsto d] \models \psi \}\), and similarly, \(X_{D',V'} {:=}\{ d \in D' \mid (D',V'), g[x \mapsto d] \models \psi \}\). By the inductive hypothesis we have that \(X_{D,V} \cap D' \subseteq X_{D',V'}\), implying that \(D' \setminus X_{D',V'} \subseteq D \setminus X_{D,V}\). But from this we immediately obtain that

$$\begin{aligned} |D \setminus X_{D,V}|< \omega \text { implies } |D' \setminus X_{D',V'}| < \omega , \end{aligned}$$

which means that \((D,V),g' \models \varphi \) implies \( (D',V'),g \models \varphi \), as required. \(\square \)

Turning to the much harder verification of the opposite implication of the theorem, we first define the appropriate translations from each monadic logic into its universal fragment.

Definition 28

We start by defining the translations for sentences in basic normal forms. Let \(\Sigma , \Pi \subseteq \wp (A)\) be some types and \(\overline{{\mathbf {T}}} \in \wp (A)^k\) some list of types. For \(\texttt {M} \)-sentences in basic form we first set

$$\begin{aligned} \Big ( \nabla _{\texttt {M} }(\Sigma ) \Big )^{\otimes } {:=}\forall z \bigvee _{S \in \Sigma } \tau _{S}(z), \end{aligned}$$

in the case of \(\texttt {M} \texttt {E} \) we define

$$\begin{aligned} \Big ( \nabla _{\texttt {M} \texttt {E} }(\overline{{\mathbf {T}}},\Pi ) \Big )^{\otimes } {:=}\forall z \bigvee _{S \in \overline{{\mathbf {T}}}\cup \Pi } \tau _{S}(z), \end{aligned}$$

while for basic formulas of \(\texttt {M} \texttt {E} ^{\infty }\), the translation \((-)^\otimes \) is given as follows:

$$\begin{aligned} (\nabla _{\texttt {M} \texttt {E} ^{\infty }}(\overline{{\mathbf {T}}},\Pi ,\Sigma ))^{\otimes } {:=}\forall z \bigvee _{S \in \overline{{\mathbf {T}}}\cup \Pi } \tau _{S}(z) \wedge \forall ^\infty z \bigvee _{S \in \Sigma } \tau _{S}(z). \end{aligned}$$

Second, in each case we define \((\bigvee _{i} \varphi _{i})^{\otimes } {:=}\bigvee \varphi _{i}^{\otimes }\).

Finally, for each \(\texttt {L} \in \{ \texttt {M} , \texttt {M} \texttt {E} , \texttt {M} \texttt {E} ^{\infty }\}\), we extend the translation \((-)^{\otimes }\) to the collection of all sentences by defining \(\varphi ^{\otimes } {:=}(\varphi ^{*})^{\otimes }\), where \(\varphi ^{*}\) is the basic normal form of \(\varphi \) as given by Fact 2 (in the case of \(\texttt {M} \)), by Theorem 1 (in the case of \(\texttt {M} \texttt {E} \)), and by Theorem 2 (in the case of \(\texttt {M} \texttt {E} ^{\infty }\)).

The missing part in the proof of the theorem is covered by the following result.

Proposition 17

For any monadic logic \(\texttt {L} \in \{ \texttt {M} , \texttt {M} \texttt {E} , \texttt {M} \texttt {E} ^{\infty }\}\) there is an effective translation \((-)^{\otimes } : \texttt {L} (A) \rightarrow \texttt {Univ} (\texttt {L} (A))\) such that a sentence \(\varphi \in \texttt {L} (A)\) is preserved under taking submodels if and only if \(\varphi \equiv \varphi ^{\otimes }\).

Proof

We only consider the case where \(\texttt {L} = \texttt {M} \texttt {E} ^{\infty }\), leaving the other cases to the reader.

It is easy to see that \(\varphi ^{\otimes } \in \texttt {Univ} (\texttt {M} \texttt {E} ^{\infty }(A))\), for every sentence \(\varphi \in \texttt {M} \texttt {E} ^{\infty }(A)\); but then it is immediate by Proposition 16 that \(\varphi \) is preserved under taking submodels if \(\varphi \equiv \varphi ^{\otimes }\).

For the left-to-right direction, assume that \(\varphi \) is preserved under taking submodels. It is easy to see that \(\varphi \) implies \(\varphi ^{\otimes }\), so we focus on proving the opposite. That is, we suppose that \((D,V) \models \varphi ^{\otimes }\), and aim to show that \((D,V) \models \varphi \).

By Theorem 2 we may assume without loss of generality that \(\varphi \) is a disjunction of sentences of the form \(\nabla _{\texttt {M} \texttt {E} ^{\infty }}(\overline{{\mathbf {T}}},\Pi ,\Sigma )\), where \(\Sigma \subseteq \Pi \subseteq \overline{{\mathbf {T}}}\). It follows that (DV) satisfies some disjunct \( \forall z \bigvee _{S \in \overline{{\mathbf {T}}}\cup \Pi } \tau _{S}(z) \wedge \forall ^\infty z \bigvee _{S \in \Sigma } \tau _{S}(z)\) of \(\Big (\nabla _{\texttt {M} \texttt {E} ^{\infty }}(\overline{{\mathbf {T}}},\Pi ,\Sigma )\Big )^{\otimes }\). Expand D with finitely many elements \(\overline{{\mathbf {d}}}\), in one-one correspondence with \(\overline{{\mathbf {T}}}\), and ensure that the type of each \(d_{i}\) is \(T_{i}\). In addition, add, for each \(S \in \Sigma \), infinitely many elements \(\{ e^{S}_{n} \mid n \in \omega \}\), each of type S. Call the resulting monadic model \(\mathbb {D}' = (D',V')\).

This construction is tailored to ensure that \((D',V') \models \nabla _{\texttt {M} \texttt {E} ^{\infty }}(\overline{{\mathbf {T}}},\Pi ,\Sigma )\), and so we obtain \((D',V') \models \varphi \). But obviously, \(\mathbb {D}\) is a submodel of \(\mathbb {D}'\). This implies that \((D,V) \models \varphi \), by our assumption on \(\varphi \). \(\square \)

Proof of Theorem 5

The first part of the theorem is an immediate consequence of Proposition 17. By applying Fact 1 to Proposition 17 we finally obtain that for the three concerned formalisms the problem of deciding whether a sentence is preserved under taking submodels is decidable. \(\square \)

As an immediate consequence of the proof of the previous Proposition 17, we get effective normal forms for the universal fragments.

Corollary 6

The following hold:

  1. 1.

    A sentence \(\varphi \in \texttt {M} \texttt {E} (A)\) is preserved under taking submodels iff it is equivalent to a formula \(\bigvee \big ( \forall z \bigvee _{S \in \Sigma } \tau _{S}(z)\big )\), for types \(\Sigma \subseteq \wp (A)\).

  2. 2.

    A sentence \(\varphi \in \texttt {M} \texttt {E} (A)\) is preserved under taking submodels iff it is equivalent to a formula \(\bigvee \big ( \forall z \bigvee _{S \in \overline{{\mathbf {T}}}\cup \Pi } \tau _{S}(z)\big )\), for types \(\Pi \subseteq \wp (A)\) and \(\overline{{\mathbf {T}}} \in \wp (A)^k\) for some k.

  3. 3.

    A sentence \(\varphi \in \texttt {M} \texttt {E} ^{\infty }(A)\) is preserved under taking submodels iff it is equivalent to a formula \(\bigvee \big (\forall z \bigvee _{S \in \overline{{\mathbf {T}}}\cup \Pi } \tau _{S}(z) \wedge \forall ^\infty z \bigvee _{S \in \Sigma } \tau _{S}(z)\big )\) for types \(\Sigma \subseteq \Pi \subseteq \wp (A)\) and \(\overline{{\mathbf {T}}} \in \wp (A)^k\) for some k.

In all three cases, normal forms are effective.

6.2 Invariance under quotients

The following theorem states that monadic first-order logic without equality (\(\texttt {M} \)) provides the quotient-invariant fragment of both monadic first-order logic with equality (\(\texttt {M} \texttt {E} \)), and of infinite-monadic predicate logic (\(\texttt {M} \texttt {E} ^{\infty }\)). Recall that a formula \(\varphi \) is invariant under taking quotients if it satisfies the condition that \(\mathbb {D}\models \varphi \) iff \(\mathbb {D}' \models \varphi \), for any monadic model \(\mathbb {D}\) and any quotient \(\mathbb {D}'\) of \(\mathbb {D}\).

Theorem 6

Let \(\varphi \) be a sentence of the monadic logic \(\texttt {L} (A)\), where \(\texttt {L} \in \{ \texttt {M} \texttt {E} , \texttt {M} \texttt {E} ^{\infty }\}\). Then \(\varphi \) is invariant under taking quotients if and only if there is an equivalent sentence in \(\texttt {M} \). Furthermore, it is decidable whether a sentence \(\varphi \in \texttt {L} (A)\) has this property or not.

We first state the ‘easy’ part of the first claim of the theorem. Note that in fact, we have already been using this observation in earlier parts of the paper.

Proposition 18

Every sentence in \(\texttt {M} \) is invariant under taking quotients.

Proof

Let \(f: D \rightarrow D'\) provide a surjective homomorphism between the models (DV) and \((D',V')\), and observe that for any assignment \(g: \mathsf {iVar}\rightarrow D\) on D, the composition \(f \circ g: \mathsf {iVar}\rightarrow D'\) is an assignment on \(D'\).

In order to prove the proposition one may show that, for an arbitrary \(\texttt {M} \)-formula \(\varphi \) and an arbitrary assignment \(g: \mathsf {iVar}\rightarrow D\), we have

$$\begin{aligned} (D,V),g \models \varphi \text { iff } (D',V'), f \circ g \models \varphi . \end{aligned}$$
(11)

We leave the proof of (11), which proceeds by a straightforward induction on the complexity of \(\varphi \), as an exercise to the reader. \(\square \)

To prove the remaining part of Theorem 6, we start with providing translations from \(\texttt {M} \texttt {E} \) and from \(\texttt {M} \texttt {E} ^{\infty }\), respectively, to \(\texttt {M} \).

Definition 29

For \(\texttt {M} \texttt {E} \)-sentences in basic form we first define

$$\begin{aligned} \Big ( \nabla _{\texttt {M} \texttt {E} }(\overline{{\mathbf {T}}},\Pi ) \Big )^{\circ } {:=}\bigwedge _{i} \exists x_i. \tau _{T_i}(x_i) \wedge \forall x. \bigvee _{S\in \Pi } \tau _S(x), \end{aligned}$$

whereas for \(\texttt {M} \texttt {E} ^{\infty }\)-sentences in basic form we start with defining

$$\begin{aligned} \Big ( \nabla _{\texttt {M} \texttt {E} ^{\infty }}(\overline{{\mathbf {T}}},\Pi ,\Sigma ) \Big )^{\bullet } {:=}\bigwedge _{i} \exists x_i. \tau _{T_i}(x_i) \wedge \forall x. \bigvee _{S\in \Sigma } \tau _S(x). \end{aligned}$$

In both cases, the translation is then extended to the full language as in Definition 28.

Note that the two maps may give different translations for \(\texttt {M} \texttt {E} \)-sentences. Also observe that the \(\Pi \) ‘disappears’ in the translation of the formula \(\nabla _{\texttt {M} \texttt {E} ^{\infty }}(\overline{{\mathbf {T}}},\Pi ,\Sigma )\).

The key property of these translations is the following.

Proposition 19

  1. 1.

    For every monadic model (DV) and every \(\varphi \in \texttt {M} \texttt {E} (A)\) we have

    $$\begin{aligned} (D,V) \models \varphi ^{\circ } \text { iff } (D\times \omega ,V_\pi ) \models \varphi . \end{aligned}$$
    (12)
  2. 2.

    For every monadic model (DV) and every \(\varphi \in \texttt {M} \texttt {E} ^{\infty }(A)\) we have

    $$\begin{aligned} (D,V) \models \varphi ^{\bullet } \text { iff } (D\times \omega ,V_\pi ) \models \varphi . \end{aligned}$$
    (13)

Here \(V_{\pi }\) is the induced valuation given by \(V_{\pi }(a) {:=}\{ (d,k) \mid d \in V(a), k\in \omega \}\).

Proof

We only prove the claim for \(\texttt {M} \texttt {E} ^{\infty }\) (i.e., the second part of the proposition), the case for \(\texttt {M} \texttt {E} \) being similar. Clearly it suffices to prove (13) for formulas of the form \(\varphi = \nabla _{\texttt {M} \texttt {E} ^{\infty }}(\overline{{\mathbf {T}}},\Pi ,\Sigma )\).

First of all, if \(\mathbb {D}\) is the empty model, we find \({\overline{{\mathbf {T}}}}={\Pi } = {\Sigma } = \varnothing \), \((D, V) = (D\times \omega , V_\pi )\), and \(\nabla _{\texttt {M} \texttt {E} ^{\infty }}(\overline{{\mathbf {T}}},\Pi ,\Sigma ) = (\nabla _{\texttt {M} \texttt {E} ^{\infty }}(\overline{{\mathbf {T}}},\Pi ,\Sigma ))^\bullet \). In other words, in this case there is nothing to prove.

In the sequel we assume that \(D \ne \varnothing \).

Assume \((D, V) \models \varphi ^{\bullet }\), we will show that \((D\times \omega , V_\pi ) \models \nabla _{\texttt {M} \texttt {E} ^{\infty }}(\overline{{\mathbf {T}}},\Pi ,\Sigma )\). Let \(d_i\) be such that \(V^{\flat }(d_i) = T_{i}\) in (DV). It is clear that the \((d_i,i)\) provide distinct elements, with each \((d_i,i)\) satisfying \(\tau _{T_i}\) in \((D\times \omega , V_{\pi })\). Thus, the first-order existential part of \(\varphi \) is satisfied. With a similar argument it is straightforward to verify that the \(\exists ^\infty \)-part of \(\varphi \) is also satisfied—here we critically use the observation that \(\Sigma \subseteq \overline{{\mathbf {T}}}\), so that every type in \(\Sigma \) is witnessed in the model (DV), and hence witnessed infinitely many times in \((D\times \omega , V_\pi )\).

For the universal parts of \(\nabla _{\texttt {M} \texttt {E} ^{\infty }}(\overline{{\mathbf {T}}},\Pi ,\Sigma )\) it is enough to observe that, because of the universal part of \(\varphi ^\bullet \), every \(d\in D\) realizes a type in \(\Sigma \). By construction, the same applies to \((D\times \omega , V_{\pi })\). This takes care of both universal quantifiers.

Assuming that \((D\times \omega , V_\pi ) \models \nabla _{\texttt {M} \texttt {E} ^{\infty }}(\overline{{\mathbf {T}}},\Pi ,\Sigma )\), we will show that \((D, V) \models \varphi ^\bullet \). The existential part of \(\varphi ^{\bullet }\) is trivial. For the universal part we have to show that every element of D realizes a type in \(\Sigma \). Suppose not, and let \(d\in D\) be such that \(\lnot \tau _S(d)\) for all \(S\in \Sigma \). Then we have \((D\times \omega , V_\pi ) \not \models \tau _S(d,k)\) for all k. That is, there are infinitely many elements not realising any type in \(\Sigma \). Hence we have \((D\times \omega , V_\pi ) \not \models \forall ^\infty y.\bigvee _{S\in \Sigma } \tau _S(y)\). Absurd, because this formula is a conjunct of \(\nabla _{\texttt {M} \texttt {E} ^{\infty }}(\overline{{\mathbf {T}}},\Pi ,\Sigma )\). \(\square \)

We will now show how the theorem follows from this. First of all we verify that in both cases \(\texttt {M} \) is expressively complete for the property of being invariant under taking quotients.

Proposition 20

For any monadic logic \(\texttt {L} \in \{ \texttt {M} \texttt {E} , \texttt {M} \texttt {E} ^{\infty }\}\) there is an effective translation such that a sentence \(\varphi \in \texttt {L} (A)\) is invariant under taking quotients if and only if .

Proof

Let \(\varphi \) be a sentence of \(\texttt {M} \texttt {E} ^{\infty }\), and let (we only cover the case of \(\texttt {L} = \texttt {M} \texttt {E} ^{\infty }\), the case for \(\texttt {L} = \texttt {M} \texttt {E} \) is similar, just take ) We will show that

(14)

The direction from right to left is immediate by Proposition 18. For the other direction it suffices to observe that any model (DV) is a quotient of its ‘\(\omega \)-product’ \((D\times \omega , V_\pi )\), and to reason as follows:

$$\begin{aligned} (D,V) \models \varphi&\text { iff } (D\times \omega , V_\pi ) \models \varphi&(\text {assumption on }\varphi ) \\&\text { iff } (D,V) \models \varphi ^{\bullet }&(\text {Proposition }19) \end{aligned}$$

\(\square \)

Hence we can conclude.

Proof of Theorem 6

The theorem is an immediate consequence of Proposition 20. Finally, the effectiveness of translation \((\cdot )^{\bullet }\), decidability of \(\texttt {M} \texttt {E} ^{\infty }\) (Fact 1) and (14) yield that it is decidable whether a given \(\texttt {M} \texttt {E} ^{\infty }\)-sentence \(\varphi \) is invariant under taking quotients or not. \(\square \)

As a corollary, we obtain:

Corollary 7

Let \(\varphi \) be a sentence of the monadic logic \(\texttt {L} (A)\), where \(\texttt {L} \in \{ \texttt {M} \texttt {E} , \texttt {M} \texttt {E} ^{\infty }\}\). Then \(\varphi \) is invariant under taking quotients if and only if there is an equivalent sentence \(\nabla _{\texttt {M} }(\Sigma )\) for types \(\Sigma \subseteq \wp (A)\). Moreover, such a normal form is effective.

In our companion paper [10] on automata, we need versions of these results for the monotone and the continuous fragment. For this purpose we define some slight modifications of the translations \((\cdot )^{\circ }\) and \((\cdot )^{\bullet }\) which restricts to positive and syntactically continuous sentences.

Theorem 7

There are effective translations \((\cdot )^{\circ }: \texttt {M} \texttt {E} ^+ \rightarrow \texttt {M} ^{+} \) and \((\cdot )^{\bullet }: {\texttt {M} \texttt {E} ^{\infty }}^+ \rightarrow \texttt {M} ^+\) such that \(\varphi \equiv \varphi ^{\circ }\) (respectively, \(\varphi \equiv \varphi ^{\bullet }\)) iff \(\varphi \) is invariant under quotients. Moreover, we may assume that \((\cdot )^{\bullet }: \texttt {Con} _{B}({\texttt {M} \texttt {E} ^{\infty }}(A))\cap {\texttt {M} \texttt {E} ^{\infty }}^+ \rightarrow \texttt {Con} _{B}(\texttt {M} (A)) \cap \texttt {M} ^+\), for any \(B \subseteq A\).

Proof

We define translations \((\cdot )^{\circ }:\texttt {M} \texttt {E} ^+ \rightarrow \texttt {M} ^+\) and \((\cdot )^{\bullet }: {\texttt {M} \texttt {E} ^{\infty }}^+ \rightarrow \texttt {M} ^+\) as follows. For \(\texttt {M} \texttt {E} ^{+},{\texttt {M} \texttt {E} ^{\infty }}^{+}\)-sentences in simple basic form we define

$$\begin{aligned} \begin{array}{lll} \Big ( \nabla ^+_{\texttt {M} \texttt {E} }(\overline{{\mathbf {T}}},\Pi ) \Big )^{\circ } &{} {:=}&{} \bigwedge _{i} \exists x_i. \tau ^+_{T_i}(x_i) \wedge \forall x. \bigvee _{S\in \Pi } \tau ^+_S(x),\\ \Big ( \nabla ^+_{\texttt {M} \texttt {E} ^{\infty }}(\overline{{\mathbf {T}}},\Pi ,\Sigma ) \Big )^{\bullet } &{} {:=}&{} \bigwedge _{i} \exists x_i. \tau ^+_{T_i}(x_i) \wedge \forall x. \bigvee _{S\in \Sigma } \tau ^+_S(x), \end{array} \end{aligned}$$

and then we use, respectively, the Corollaries 2 and 3 to extend these translations to the full positive fragments \(\texttt {M} \texttt {E} ^{+}\) and \({\texttt {M} \texttt {E} ^{\infty }}^{+}\), as we did in Definition 29 for the full language.

We leave it as an exercise for the reader to prove the analogue of Proposition 19 for these translations, and to show how the first statements of the theorem follows from this.

Finally, to see why we may assume that \((\cdot )^{\bullet }\) restricts to a map from the syntactically B-continuous fragment of \({\texttt {M} \texttt {E} ^{\infty }}^+(A)\) to the syntactically B-continuous fragment of \({\texttt {M} }^+(A)\), assume that \(\varphi \in \texttt {M} \texttt {E} ^{\infty }(A)\) is continuous in \(B \subseteq A\). By Corollary 5 we may assume that \(\varphi \) is a disjunction of formulas of the form \(\nabla ^+_{\texttt {M} \texttt {E} ^{\infty }}(\overline{{\mathbf {T}}},\Pi ,\Sigma )\), where \(B \cap \bigcup \Sigma = \varnothing \). This implies that in the formula \(\varphi ^{\bullet }\) no predicate symbol \(b \in B\) occurs in the scope of a universal quantifier, and so \(\varphi ^{\bullet }\) is syntactically continuous in B indeed. \(\square \)

7 Conclusions

In this paper we established some model-theoretic results about the logic \(\texttt {M} \texttt {E} ^{\infty }\), a variation of monadic first-order logic that features the generalised quantifier \(\exists ^\infty \) (‘there are infinitely many’), and about its classical fragments \(\texttt {M} \texttt {E} \) and \(\texttt {M} \) consisting of, respectively, monadid first-order logic with and without equality. For each logic \(\texttt {L} \in \{ \texttt {M} , \texttt {M} \texttt {E} , \texttt {M} \texttt {E} ^{\infty }\}\) we used the method of Ehrenfeucht–Fraïssé games to show that arbitrary sentences can be effectively rewritten into some normal form. We subsequently used these normal forms to prove a number of characterisation theorems, covering some well-known semantic properties, viz., monotonicity and preservation under submodels, but also some properties of more specific interest, viz., continuity and invariance under quotients. In all cases we actually proved a stronger result than a mere characterisation theorem: we provided a map, effectively translating arbitrary sentences into sentences of the required syntactic shape, and we showed that an arbitrary sentence in \(\texttt {L} \) has the semantic property under scrutiny iff it is equivalent to its translation. As a consequence of this result and the fact that each \(\texttt {L} \in \{ \texttt {M} , \texttt {M} \texttt {E} , \texttt {M} \texttt {E} ^{\infty }\}\) has a decidable satisfiability problem, we showed that each of the mentioned properties is decidable for monadic first-order sentences.

Our main interest concerned the language \(\texttt {M} \texttt {E} ^{\infty }\) with the infinity quantifer. Since this operator does not make sense in finite models, we did not explicitly investigate which of our results on the other languages, \(\texttt {M} \) and \(\texttt {M} \texttt {E} \), hold as well in the setting of finite model theory. We claim, however, that all of our results on normal forms, and on characterisations of monotonicity, preservation under submodels, and invariance under quotients, hold in this setting as well, with only minor adaptations of the proofs. (The remaining property of continuity is obviously not of interest in the setting of finite models.) For instance, some of our proofs use a model-theoretic copying construction that turns an arbitrary monadic model (DV) into its \(\omega \)-fold copy \((D\times \omega , V_{\pi })\). In the setting of finite model theory, this construction needs to be replaced with a more fine-grained k-fold copying construction, with k a finite number depending on the sentence under investigation.

We finish with mentioning some suggestions for further research. First, given that many semantic properties of monadic predicate logics turn out to be decidable, a natural follow-up question would be to investigate the computational complexity of these problems. Second, by van Benthem’s result (cf. Proposition 11), a sentence \(\varphi \in \texttt {M} \texttt {E} (A)\) is continuous in a set \(B \subseteq A\) if and only if it is equivalent to some \(\varphi ^{\ominus }\) in the syntactic fragment \(\texttt {Con} _{B}(\texttt {L} (A))\). Intrigueingly we did not manage to prove this result using the normal form method; we conjecture, however, that the obvious analogues of Proposition 13 and Theorem 4 do hold for \(\texttt {L} =\texttt {M} \texttt {E} \).

Finally, one perspective on our work is that it bears further witness to the fact that failure of compactness is in itself not an obstacle for the development of model theory—as is well known, of course, from the area of finite model theory that we just mentioned. It would be interesting to see which of our characterisation results still hold if we drop the restriction to monadic predicate logic, and investigate the full language \(\texttt {FOE} ^{\infty }\) of first-order logic (with equality) extended with the infinity quantifier \(\exists ^\infty \), or fragments of \(\texttt {FOE} ^{\infty }\) that are more expressive than \(\texttt {M} \texttt {E} ^{\infty }\). A first step in this direction was taken by Ignacio Bellas Acosta, who wrote, under the supervision of the third author, an MSc thesis [3] on the modal fragment of \(\texttt {FOE} ^{\infty }\) establishing, among other things, a van Benthem-style bisimulation invariance result.