1 Introduction

There are several areas of applied category theory, from Grothendieck descent theory to the study of higher-dimensional categorical structures, which use so-called cartesian monads and monads satisfying weaker conditions, where the relevant pullbacks are replaced with weak pullbacks. In particular, previous work of the first two authors [6, 7] extensively used the fact that the ultrafilter monad on the category Set of sets preserves weak pullbacks and its multiplication is weakly cartesian (that is, satisfying (BC) in the terminology of [4, 5]).

In this paper we consider several monads on \(\mathrm{\mathbf Set}\) that determine familiar varieties of universal algebras, and make an attempt of systematic study of weakly cartesian properties for such monads. Specifically, we are interested if such a monad \(T = (T,\eta ,\mu )\) on \(\mathrm{\mathbf Set}\) has the following properties:

  • the functor \(T\) preserves weak pullbacks;

  • the natural transformations \(\eta \) and \(\mu \) are weakly cartesian.

(Definition 2.1 below recalls what this means explicitly). The results, including known, new but obvious, and less obvious ones, are:

  • Observation 3.1, listing some known varieties determined by cartesian monads.

  • Theorem 3.3, which in fact gives wide classes of monads \(T = (T,\eta ,\mu )\), where \(T\) does not preserve weak pullbacks, and where \(\mu \) is not weakly cartesian—although we then consider only one specific example.

  • Proposition 4.4, which gives another wide class of monads \(T\), where the functor \(T\) does not preserve weak pullbacks; that class contain monads that determine varieties of groups, rings, modules and all kinds of algebras over rings, and, much more generally, all universal algebras that admit a very weak form of subtraction operation. Some intermediate levels of generality are also mentioned in the same section.

  • Theorems 6.4, 7.1, and 8.10, which give necessary and sufficient conditions on a semiring \(S\), under which the free \(S\)-semimodule monad satisfies any of the above-mentioned properties. In some sense the case of semimodules is the only case, found so far, where those properties are characterized in terms of non-trivial algebraic conditions rather than just either obviously hold or obviously fail. We therefore devoted the last section to consider semimodules over some particular semirings. Note also that considering varieties of semimodules is the same as considering arbitrary varieties, in which finite coproducts (=free products) canonically coincide with finite products (see e.g. [9]), or, equivalently, pointed varieties with binary biproducts.

The paper is divided, apart from this introduction, into eight sections, whose titles are self-explanatory:

  1. 2.

    Weakly cartesian monads

  2. 3.

    Some cartesian cases and a related negative result

  3. 4.

    Normal, 0-regular, subtractive, and weakly subtractive varieties

  4. 5.

    A condition that excludes cartesianness

  5. 6.

    Semimodules I: Preliminary remarks and weak cartesianness of the functor \(T\)

  6. 7.

    Semimodules II: Weak cartesianness for \(\eta \)

  7. 8.

    Semimodules III: Weak cartesianness of \(\mu \) and the main results

  8. 9.

    Semimodules IV: Examples.

2 Weakly cartesian monads

Recall that a commutative diagram

(2.1)

in a category is a weak pullback if for each pair \((u:P\rightarrow X, v:P\rightarrow Y)\) there exists a morphism \(t:P\rightarrow W\) (not necessarily unique) such that \(p\cdot t=u\) and \(q\cdot t=v\). If the category has pullbacks, (2.1) is a weak pullback if and only if the canonical map

(2.2)

is a split epimorphism. Therefore, in the category \(\mathrm{\mathbf Set}\) of sets, (2.1) is a weak pullback if and only if \(\langle p,q\rangle \) is surjective, that is, for every \(x\in X\) and \(y\in Y\) with \(f(x) = g(y)\), there exists \(w\in W\) with \(p(w) = x\) and \(q(w) = y\).

Definition 2.1

Given a monad \(T = (T,\eta ,\mu )\) on \(\mathrm{\mathbf Set}\), we say that:

  1. (a)

    the functor \(T\) is weakly cartesian if it preserves weak pullbacks, or, equivalently, transforms pullbacks into weak pullbacks.

  2. (b)

    the natural transformation \(\eta \) is weakly cartesian if, for every map \(u : A\rightarrow B\), the naturality square

    (2.3)

    is a weak pullback.

  3. (c)

    analogously, the natural transformation \(\mu \) is weakly cartesian if, for every map \(u:A\rightarrow B\), the naturality square

    (2.4)

    is a weak pullback.

Remarks 2.2

  1. (a)

    Replacing weak pullbacks with (ordinary) pullbacks in Definitions 2.1, we obtain the conditions that define cartesian monads. Moreover, every cartesian monad obviously satisfies the conditions of Definition 2.1. Monads satisfying these conditions will be called weakly cartesian, as it is done in [18, 24, 27].

  2. (b)

    The monads with weakly cartesian functor \(T\) (called also condition (BC) on the functor \(T\)) and weakly cartesian \(\mu \) (called condition (BC) on the natural transformation \(\mu \)) were considered many times, starting from [4], and including [8]. Weak cartesianness of \(\eta \) was not used in those papers since it does not hold for the ultrafilter monad, which was the main example there—except [7], where a special case was used. We will say that the monad \((T,\eta ,\mu )\) satisfies (BC) if \(T\) and \(\mu \) satisfy (BC).

  3. (c)

    As, e.g., Manes does in his papers [20, 21], we call a functor taut if it preserves pullbacks along monomorphisms, while a natural transformation is taut if the naturality squares determined by monomorphisms are pullbacks. Clearly every weakly cartesian functor is taut, and every weakly cartesian natural transformation is taut. A monad \(T=(T,\eta ,\mu )\) is taut if \(T\), \(\eta \) and \(\mu \) are taut. We also remark that Möbus introduced this type of monads under the name Alexandrov monad in [22].

Any variety \(\mathrm{\mathbf C}\) of universal algebras is equipped with the free-forgetful adjunction

(2.5)

and therefore has its corresponding monad \(T = (T,\eta ,\mu )\) on \(\mathrm{\mathbf Set}\), in which

$$\begin{aligned} T = UF, \;\eta \ \hbox { is as in} \ (2.5), \hbox {and }\mu =U\varepsilon F. \end{aligned}$$
(2.6)

We can also write

$$\begin{aligned} \mathrm{\mathbf Set}^T = \hbox {Alg} (\Omega ,\Phi ), \end{aligned}$$
(2.7)

where \((\Omega ,\Phi )\) is any presentation of \(\mathrm{\mathbf C}\) as the category of \(\Omega \)-algebras satisfying a set \(\Phi \) of identities.

In particular, there are two trivial cases:

Example 2.3

Suppose \(\Phi \) in (2.7) contains the identity \(x = y\), where \(x\) and \(y\) are distinct variables. Then any \(T\)-algebra has at most one element, and in particular the same is true for \(T(X)\), for any set \(X\). The monad \(T\) is then called trivial. In fact there are exactly two trivial monads on \(\mathrm{\mathbf Set}\), both of the form (2.6) for some variety \(\mathrm{\mathbf C}\), namely:

  1. (a)

    the monad \(T\) with \(T(X) = 1\) (one-element set) for every set \(X\)—this is the case if and only if the set \(\Omega _0\) of \(0\)-ary operators in any (equivalently, in every) presentation (2.7) is non-empty; and

  2. (b)

    the monad \(T\) with

    $$\begin{aligned} T(X)={\left\{ \begin{array}{ll}1,&{}\hbox { if }X\ne \emptyset ,\\ \emptyset ,&{}\hbox { if }X=\emptyset \end{array}\right. } \end{aligned}$$

    which is the case if and only if the set \(\Omega _0\) is empty.

In both cases it is easy to see that \(T\) satisfies (BC), but \(\eta \) is not weakly cartesian.

From now on we shall always assume that the monad \(T\) is non-trivial, or, equivalently, that \(\mathrm{\mathbf C}\) is non-trivial, that is, that \(\mathrm{\mathbf C}\) contains algebras of arbitrarily large cardinality. In particular this implies that every component of \(\eta \) is injective.

Remarks 2.4

  1. (a)

    Since every component of \(\eta \) is injective, weak cartesianness of \(\eta \) is equivalent to the requirement that every diagram of the form (2.3) is a pullback, that is, it is equivalent to cartesianness of \(\eta \). Hence, when \(T1\cong 1\), weak cartesianness of \(\eta \) gives that diagram (2.3) for \(u:X\rightarrow 1\) is a pullback, that is, \(\eta _X\) is an isomorphism for every \(X\), and so \(T\) is the identity monad.

  2. (b)

    In contrast to this and Proposition 2.3 of [21] we point out that, if \(T\) is non-trivial, \(\eta \) is taut. Indeed, consider our diagram (2.3) assuming that \(u\) is injective, and suppose \(\eta _B(b) = T(u)(t)\) for some \(b\in B\) and \(t\in T(A)\). We shall separately consider the case \(b\in u(A)\), and the case \(b\not \in u(A)\).

    • Case 1: \(b\in u(A)\). We have \(b = u(a)\) for some \(a\in A\). This gives:

      $$\begin{aligned} T(u)\eta _A(a) =\eta _Bu(a) =\eta _B(b) = T(u)(t) \end{aligned}$$

      and so \(\eta _A(a) = t\), since \(T(u)\) is injective.

    • Case 2: \(b\not \in u(A)\). Let \(C\) be an arbitrary non-empty \(T\)-algebra, and \(c\) and \(d\) elements in \(C\). Let \(f\) and \(g\) be any two maps from \(B\) to \(C\) with \(f(b) = c\), \(g(b) = d\), and \(f(x) = g(x)\) for every \(x\in u(A)\), and let \(\overline{f}\) and \(\overline{g}\) be \(T\)-algebra homomorphisms \(T(B)\rightarrow C\) induced by \(f\) and \(g\) respectively. We have:

      $$\begin{aligned} \begin{array}{rcll} c &{}=&{} f(b)&{}\hbox {(by the assumption on f)}\\ &{}=&{} \overline{f}\eta _B(b)&{}\hbox {(by definition of}\; \overline{f})\\ &{}=&{}\overline{f}T(u)(t)&{}\hbox {(by the assumption on}\; b\; \hbox {and}\; t)\\ &{}=&{}\overline{g}T(u)(t)&{}\hbox {(since} \overline{f}T(u),\overline{g}T(u):T(A)\rightarrow C\; \hbox {are}\; T\hbox {-algebra} \\ &{}&{}&{}\hbox {homomorphisms induced by the maps} fu,gu : A\rightarrow C \\ &{}&{}&{}\hbox {respectively, and since these two maps coincide by the}\\ &{}&{}&{}\hbox {assumptions on} f,g)\\ &{}=&{}d&{}\hbox {(by similar arguments).} \end{array} \end{aligned}$$

      That is, every \(T\)-algebra has at most one element, making \(T\) a trivial monad. This excludes Case 2.

3 Some cartesian cases and a related negative result

Following Carboni and Johnstone [2], Leinster [19] defines a finitary algebraic theory to be strongly regular if it can be presented with identities in which “the same variables appear in the same order, without repetition, on each side”. In our notation (2.7) this means that \(\Phi \) consists of such identities, and if so, then, motivated by Theorem C.1.1 in [19], we could call \(\mathrm{\mathbf C}\) operadic. As we know from [2] (see also [19] and [3]), the corresponding monad is then cartesian, which, among other things, obviously implies the following well-known observation:

Observation 3.1

The monads determining the following varieties are cartesian:

  1. (a)

    All varieties of the form \(\mathrm{Alg}(\Omega ,\Phi )\) with \(\Phi =\emptyset \). In particular, all varieties in which \(\Omega =\Omega _0\), that is, each basic operation is \(0\)-ary, can be presented this way. This includes sets and pointed sets.

  2. (b)

    All varieties \(\mathrm{\mathbf C}\) of the form \(\mathrm{\mathbf C}=\mathrm{\mathbf Set}^M\), where \(M\) is a monoid. In fact, more generally, the same is obviously true for all varieties of the form \((C\downarrow \mathrm{\mathbf Set}^M)\), where \(C\) is any \(M\)-set.

  3. (c)

    The variety of semigroups.

  4. (f)

    The variety of monoids.

We should also mention an analogy with Manes’ Theorem 4.14 of [20] and Proposition 3.11 of [21], which say that a finitary monad is taut if and only if the corresponding algebraic theory is balanced. Here “balanced”, also called “regular” by some other authors, is a notion weaker than what we referred to as “strongly regular”: it only requires having the same set of variables on each side. This fact is very important for us since all easy consequences of our results involving taut monads can also be obtained from it.

In spite of Observation 3.1(b), there are varieties of \(M\)-sets that are not even weakly cartesian. This follows from Theorem 3.3 below, whose formulation and proof require to adjust the universal-algebraic and the monad-theoretic terminology and notation. For our purposes it actually suffices to mention that:

  • The universal-algebraic expression \(t = t(x)\), for a unary term \(t\), simply means that \(t\) belongs to \(T(\{x\})\). And if \(t\) belongs to \(T(\{x\})\), \((A,h_A)\) is a \(T\)-algebra, and \(a\) is an element in \(A\), then \(t(a) = h_AT(\underline{a})(t)\) is the image of \(t\) under the \(T\)-algebra homomorphism \((T(\{x\}),\mu _{\{x\}})\rightarrow (A,h_A)\), induced by the map \(\underline{a} :\{x\}\rightarrow A\) sending \(x\) to \(a\). In particular, if \(t'\in T(\{x\})\) is another such unary term, then \(t(t') =\mu _{\{x\}}T(\underline{t'})(t)\), that is, \(t(t')\) is the image of \(t\) under the \(T\)-algebra homomorphism \((T(\{x\}),\mu _{\{x\}})\rightarrow (T(\{x\}),\mu _{\{x\}})\), induced by the map \(\underline{t'}: \{x\}\rightarrow T(\{x\})\) sending \(x\) to \(t'\). Furthermore, \(t' =\eta _{\{x\}}(x)\) makes \(\mu _{\{x\}}T(\underline{t'})(t) =\mu _{\{x\}}T(\eta _{\{x\}})(t) = t\), and so \(t=t(\eta _{\{x\}}(x))\); this nicely agrees with the universal-algebraic expression \(t = t(x)\) up to identifying \(x\) with its image in \(T(\{x\})\).

  • In the notation above, if \(u : A\rightarrow B\) is a \(T\)-algebra homomorphism, then

    $$\begin{aligned} u(t(a)) = t(u(a)). \end{aligned}$$
    (3.1)

    Indeed, \(u(t(a)) = uh_AT(\underline{a})(t) = h_BT(u)T(\underline{a})(t) = h_BT(u(\underline{a}))(t) = h_BT(\underline{u(a)})(t) = t(u(a))\).

  • When \(A\) is equipped with a \(T\)-algebra structure \(h_A\), the (well-defined) element \(t(a)\) of \(A\) should not be confused with the element \(t(\eta _A(a))\) of \(T(A)\). These elements are only related by

    $$\begin{aligned} t(a) = h_A(t(\eta _A(a))). \end{aligned}$$
    (3.2)

    since

    $$\begin{aligned} t(\eta _A(a)) = T(\underline{a})(t), \end{aligned}$$
    (3.3)

    which is easy to check:

    $$\begin{aligned} t(\eta _A(a)) =\mu _AT(\underline{\eta _A(a)})(t) =\mu _AT(\eta _A\underline{a})(t) =\mu _AT(\eta _A)T(\underline{a})(t) = T(\underline{a})(t). \end{aligned}$$
  • By a pseudo-constant unary term we mean a term \(t\in T(\{x\})\) such that, for every \(T\)-algebra \(A\) and every two elements \(a\) and \(a'\) in \(A\), we have \(t(a) = t(a')\) in \(A\). In the universal-algebraic terminology, a unary term \(t\) is pseudo-constant if and only if, for some distinct variables \(x\) and \(y\), the identity \(t(x) = t(y)\) holds in \(\mathrm{\mathbf C}\). Note that whenever \(T(\emptyset )\) is non-empty, or, equivalently, \(\Omega _0\) is nonempty, every pseudo-constant term \(t\in T(\{x\})\) is constant, that is, it belongs to the image of \(T(\emptyset )\) in \(T(\{x\})\). The notion of pseudo-constant term has an obvious non-unary version of course.

The following lemma would be considered trivial in universal algebra; nevertheless we shall prove it since it plays an important role in the proof of Theorem 3.3(b) below.

Lemma 3.2

Let \(S\) be a set and \(s\) and \(s'\) elements in \(S\). If \(s\ne s'\) and \(t\) is not a pseudo-constant term, then \(t(\eta _A(s))\ne t(\eta _A(s'))\).

Proof

Let \((A,h_A)\) be a \(T\)-algebra and \(a\) and \(a'\) elements in \(A\). Let \(f : S\rightarrow A\) be any map with \(f(s) = a\) and \(f(s') = a'\), and \(g : (T(S),\mu _S)\rightarrow (A,h_A)\) be a \(T\)-algebra homomorphism such that \(g\eta _S = f\); the existence of such \(g\) follows from the fact that \(\eta _S\) makes \((T(S),\mu _S)\) the free \(T\)-algebra on \(S\). Using (3.1) (for \(g\) playing the role of \(u\)), we obtain

$$\begin{aligned} t(a) = t(f(s)) = t(g(\eta _S(s))) = g(t(\eta _A(s))), \end{aligned}$$

and similarly \(t(a') = g(t(\eta _A(s')))\). Therefore the equality \(t(\eta _A(s)) = t(\eta _A(s'))\) would imply that \(t\) is a pseudo-constant term. \(\square \)

Theorem 3.3

  1. (a)

    If \(\mathrm{\mathbf C}\) admits a pseudo-constant term that is not constant, then the functor \(T\) is not weakly cartesian (and not even taut).

  2. (b)

    Let \(\mathrm{\mathbf C}= \mathrm{\mathbf Set}^T = \mathrm{Alg}(\Omega ,\Phi )\) be a variety of universal algebras that admits non-pseudo-constant unary terms \(t, t'\in T(\{x\})\), for which \(t(t')\) is a pseudo-constant term. Then the natural transformation \(\mu \) is not weakly cartesian (and not even taut).

Proof

(a): Let \(t\in T(\{x\})\) be a pseudo-constant non-constant term, and let us choose in the pullback diagram (2.1): \(X = \{x\}\), \(Y = \{y\}\), \(Z = \{x,y\}\), with \(f\) and \(g\) the inclusion maps. Then \(\langle T(p),T(q)\rangle : T(W)\rightarrow T(X)\times _{T(Z)}T(Y)\) cannot be surjective since:

  • \(T(W) = T(\emptyset )\) is empty – otherwise there would be no pseudo-constant non-constant terms;

  • \(T(X)\times _{T(Z)}T(Y)\) is non-empty since it contains the element determined by the pair \((t(\eta _X(x)), t(\eta _Y(y)))\).

(b): Let \(u : A\rightarrow B\) by any map of sets with non-empty \(A\) and the cardinality of \(B\) strictly larger than the cardinality of \(T^2(A)\), and \(a\) any fixed element in \(A\). For each \(b\in B\), using (3.2), then (3.3), we obtain

$$\begin{aligned} \begin{array}{l@{\quad }l@{\quad }l@{\quad }l} \mu _B(t(\eta _{T(B)}(t'(\eta _B(b))))) &{}=&{} t(t'(\eta _B(b)))&{} (\hbox {by} (3.2))\\ &{}=&{} t(T(\underline{b})(t'))&{} (\hbox {by} (3.3))\\ &{}=&{} T(\underline{b})(t(t')) &{}(\hbox {by} (3.1))\\ &{}=&{} (t(t'))(\eta _B(b))&{} (\hbox {by} (3.3))\\ &{}=&{} (t(t'))(T(u)(\eta _A(a)))\;&{}(\hbox {since}\; t(t')\; \hbox {is a pseudo-constant term}) \end{array} \end{aligned}$$

Therefore the set \(P_a = \{((t(\eta _{T(B)}(t'(\eta _B(b)))),T(u)(\eta _A(a)))\;|\,b\in B\}\) can be considered as a subset in the pullback \(T^2(B)\times _{T(B)}T(A)\). After that, since the cardinality of \(B\) is strictly larger than the cardinality of \(T^2(A)\), it suffices to show that, for \(b, b'\in B\), we have

$$\begin{aligned} b \ne b' \;\Rightarrow \; t(\eta _{T(B)}(t'(\eta _B(b))))\ne t(\eta _{T(B)}(t'(\eta _B(b')))). \end{aligned}$$

But we have

$$\begin{aligned} b\ne b'\;\Rightarrow \; t'(\eta _B(b)) \ne t'(\eta _B(b'))\;\Rightarrow \; t(\eta _{T(B)}(t'(\eta _B(b)))) \ne t(\eta _{T(B)}(t'(\eta _B(b')))), \end{aligned}$$

where both implications follow from Lemma 3.2 since \(t'\) and \(t\) are not pseudo-constant terms. \(\square \)

The simplest example seems to be:

Example 3.4

Let \({\mathbb {N}}\) be the additive monoid of natural numbers, and \(\mathrm{\mathbf C}\) the variety of \({\mathbb {N}}\)-sets determined by the identity \(2x = 2y\). Then the corresponding monad \(T = (T,\eta ,\mu )\) can be described as follows:

  • \(T(\emptyset ) =\emptyset \);

  • If \(X\) is a non-empty set, then \(T(X)\) is the disjoint union of \(\{0,1\}\times X\) with a one-element set, say \(\{\infty \}\);

  • \(\eta _X : X\rightarrow T(X)\) is defined by \(\eta _X(x) = (0,x)\);

  • \(\mu _X : T^2(X)\rightarrow T(X)\) is defined by

    $$\begin{aligned} \begin{array}{ll}\mu _X(k,(l,x))&{}={\left\{ \begin{array}{ll} (k+l,x),&{}\hbox { if }k+l\le 1,\\ \infty ,&{}\hbox { if }k+l=2,\end{array}\right. }\\ \mu _X(k,\infty )&{}=\mu _X(\infty )=\infty .\end{array} \end{aligned}$$

This monad has a pseudo-constant non-constant term, namely \(\infty \), and so \(T\) is not weakly cartesian by Theorem 3.3(a). Furthermore, for this monad \(t = (1,x)\in T(\{x\})\) is not pseudo-constant term, while \(t(t) =\infty \) is a pseudo-constant term. Therefore, by Theorem 3.3(b), \(\mu \) is not weakly cartesian. It is easy to check, however, that \(\eta \) is.

Remark 3.5

It is easy to check that modifying Example 3.4 by only adding an arbitrary (non-zero) number of constants and still requiring only the identity \(2x = 2y\), and possibly identities involving only constants, we obtain a monad with \(T\) and \(\eta \) weakly cartesian, but, still by Theorem 3.3(b), \(\mu \) is not weakly cartesian.

4 Normal, \(0\)-regular, subtractive, and weakly subtractive varieties

The following conditions on a variety \(\mathrm{\mathbf C}\) of universal algebras are well known to be equivalent:

  • \(\mathrm{\mathbf C}\) is pointed as a category, that is, each hom-set \(\hom _C(A,B)\) has a distinguished element \(0_{A,B}\), such that, for every morphism \(\alpha :A'\rightarrow A\) and every morphism \(\beta :B\rightarrow B'\), we have \(\beta 0_{A,B}\alpha = 0_{A',B'}\);

  • the free algebra in \(\mathrm{\mathbf C}\) on the empty set (=the initial object in \(\mathrm{\mathbf C}\)) has exactly one element (=is isomorphic to the terminal object in \(\mathrm{\mathbf C}\));

  • the set \(\Omega _0\) of \(0\)-ary operators in \(\mathrm{\mathbf C}\) is non-empty, and every two constant terms are equal to each other in each object in \(\mathrm{\mathbf C}\);

  • every algebra in \(\mathrm{\mathbf C}\) has a one-element subalgebra;

  • every algebra in \(\mathrm{\mathbf C}\) has a unique one-element subalgebra, and, for every morphism \(\alpha :A'\rightarrow A\), that subalgebra is the image of \(0_{A',A}\).

When these conditions hold, we shall simply say that \(\mathrm{\mathbf C}\) is pointed, and we shall use the symbol \(0\) for

  • each of the morphisms \(0_{A,B}\);

  • the initial object in \(\mathrm{\mathbf C}\);

  • the unique subalgebra in any algebra in \(\mathrm{\mathbf C}\);

  • the unique element in that subalgebra;

we will also say that \(0\) is the unique constant (=\(0\)-ary basic operation) in \(\mathrm{\mathbf C}\).

When \(\mathrm{\mathbf C}\) is pointed, every morphism \(u:A\rightarrow B\) has a kernel

which can be described, up to isomorphism, as the inclusion map of \(\{a\in A\;|\;u(a) = 0\}\) into \(A\), and a cokernel

which can be described, up to an isomorphism, as the canonical morphism from \(B\) to its quotient by the smallest congruence \(\sim \), under which \(u(a)\sim 0\) for each \(a\in A\). The surjective homomorphisms (=regular epimorphisms) that occur as cokernels are called normal epimorphisms. Furthermore, following [16], we shall say that \(\mathrm{\mathbf C}\) is normal if every regular epimorphism in \(\mathrm{\mathbf C}\) is a normal epimorphism. It is well known that every pointed BIT variety in the sense of [25] (=ideal determined varieties in the sense of [11]) is normal. In particular this includes all semi-abelian varieties (see [13], although in the case of pointed varieties semi-abelianness is equivalent to Bourn protomodularity [1]), and (therefore, although known independently) all varieties of groups with multiple operators in the sense of [12], and so many varieties of classical algebra, e.g. of groups, non-unital rings, modules and various kinds of non-unital algebras over rings, etc. It also includes, say, Heyting semilattices (see e.g. [17]; note, however, that the role of \(0\) there is played by the constant \(1\)), but not, for example, semilattices, lattices, monoids, etc.

Slightly more generally, instead of normal varieties we could consider what universal algebraists call \(0\)-regular varieties (see [1416] for explanations and references). Those are not necessarily pointed, but have a distinguished constant \(0\) such that every congruence on any given algebra is completely determined by its \(0\)-class. An obvious reason to consider the non-pointed case is to include important examples, such as unital rings and unital algebras over rings, Boolean and Heyting algebras, etc.

Another class of varieties containing the above-mentioned varieties of classical algebra is the class of subtractive varieties in the sense of Ursini [26] (in fact appeared already in [11]). They are defined as varieties having a distinguished constant \(0\) and a binary term \(s\) with the identities \(s(x,x) = 0\) and \(s(0,x) = x\) (among others). As shown in [11], a (not necessarily pointed) variety is ideal determined if and only if it is \(0\)-regular and subtractive at the same time, with the same \(0\). Writing briefly,

$$\begin{aligned} \hbox {ideal determined } = 0\hbox {-regular }\;\cap \;\hbox { subtractive, } \end{aligned}$$
(4.1)

and we are going to introduce weakly subtractive varieties with

$$\begin{aligned} \hbox {weakly subtractive }\supset \; 0\hbox {-regular }\;\cup \;\hbox { subtractive, } \end{aligned}$$
(4.2)

as follows:

Definition 4.1

A variety \(\mathrm{\mathbf C}\) is said to be weakly subtractive if it admits a constant \(0\) and a non-pseudo-constant \(n\)-ary term \(s\) satisfying the identity \(s(x,\ldots ,x) = 0\) and not satisfying the identity \(s(x_1,\ldots ,x_n) = 0\).

Observation 4.2

  1. (a)

    Every subtractive variety is weakly subtractive indeed, as trivially follows from the definitions.

  2. (b)

    Every \(0\)-regular variety is also weakly subtractive as follows, e.g., from Corollary 1.7 in [11].

  3. (c)

    For the subtractive and \(0\)-regular varieties the desired term \(s\) there is actually binary. On the other hand, in Definition 4.1, the cases \(n= 0\) and \(n = 1\) are automatically excluded, since in those cases our requirements on \(s\) become contradictory.

For a morphism \(u : A\rightarrow B\) in a variety \(\mathrm{\mathbf C}\) having a distinguished constant \(0\), we shall still write \(\mathrm{Ker}(u) = \{a\in A\;|\;u(a) = 0\}\), even though this \(\mathrm{Ker}(u)\) does not have to be a subalgebra in \(A\) in general. The following simple observation characterizes weak subtractivity of \(\mathrm{\mathbf C}\) in terms of the corresponding monad on \(\mathrm{\mathbf Set}\):

Observation 4.3

Let \(\mathrm{\mathbf C}\) be a variety having a distinguished constant \(0\) and \(T = (T,\eta ,\mu )\) be the corresponding monad on \(\mathrm{\mathbf Set}\). The following conditions are equivalent:

  1. (i)

    \(\mathrm{\mathbf C}\) is weakly subtractive;

  2. (ii)

    there exists a map \(f : X\rightarrow Y\) with \(\mathrm{Ker}(T(f))\ne 0\);

  3. (iii)

    for each cardinal number \({\mathfrak {m}}\) there exists a map \(f : X\rightarrow Y\) with \(\mathrm{Ker}(T(f))\) having cardinality \(\ge \mathfrak {m}\).

Let us only explain the implication (ii) \(\Rightarrow \) (iii), which might seem less obvious. Condition (ii) means that the variety \(\mathrm{\mathbf C}\) admits a, say \(n\)-ary, term \(t\) with \(t(x_1,\ldots ,x_n)\ne 0\) in \(T(X)\) and \(t(f(x_1),\ldots , f(x_n)) = 0\) in \(Y\), for some \(f : X \rightarrow Y\), and some elements \(x_1,\ldots , x_n\) in \(X\). Let us use:

  • a set \(S\) of cardinality \(\ge \mathfrak {m}\);

  • the map \(S\times f : S\times X \rightarrow S\times Y\) defined by \((S\times f)(s,x) = (s,f(x))\).

We have:

  1. (a)

    \(t(0,\ldots ,0) = 0\) in any \(T\)-algebra \(A\). Indeed, since \(T(Y)\) is the free \(T\)-algebra on \(Y\), the map \(u :Y \rightarrow T(A)\), sending all elements of \(Y\) to \(0\), extends to a homomorphism \(v : T(Y) \rightarrow T(A)\), and we have

    $$\begin{aligned} t(0,\ldots ,0)&= t(uf(x_1),\ldots ,uf(x_n)) = t(vf(x_1),\ldots ,vf(x_n)) \\&= v(t(f(x_1),\ldots , f(x_n))) = v(0) = 0. \end{aligned}$$
  2. (b)

    \(t((S\times f)(s,x_1),\ldots , (S\times f)(s,x_n)) = 0\) in \(T(S\times X)\) for every \(s\) in \(S\). Indeed, let \(w : Y \rightarrow S\times Y\) be defined by \(w(y) = (s,y)\)—then

    $$\begin{aligned} t((S\times f)(s,x_1),\ldots , (S\times f)(s,x_n))&= T(w)(t(f(x_1),\ldots , f(x_n))) \\&= T(w)(0) = 0, \end{aligned}$$

    since \(T(w)\) is a homomorphism of algebras.

  3. (c)

    As follows from (b), \(t((s,x_1),\ldots ,(s,x_n))\) is in \(\mathrm{Ker}(T(S\times f))\) for each \(s\) in \(S\), after which it suffices to show that

    $$\begin{aligned} s \ne s' \implies t((s,x_1),\ldots ,(s,x_n)) \ne t((s',x_1),\ldots ,(s',x_n)). \end{aligned}$$

    For, given \(s \ne s'\) in \(S\), consider any map \(g : S\times X \rightarrow X\) with \(g(s,x) = x\) and \(g(s',x) = 0\), for each \(x\) in \(X\). We have

    $$\begin{aligned} \begin{array}{{l}} T(g)(t((s,x_1),\ldots ,(s,x_n))) = t(x_1,\ldots ,x_n) \ne 0,\hbox { and}\\ T(g)(t((s',x_1),\ldots ,(s',x_n))) = t(0,\ldots ,0) = 0 \hbox { by (a)}.\end{array} \end{aligned}$$

Therefore \(t((s,x_1),\ldots ,(s,x_n)) \ne t((s',x_1),\ldots ,(s',x_n))\), as desired.

And this observation helps to prove:

Proposition 4.4

If \(\mathrm{\mathbf C}\) is weakly subtractive, then \(T\) is not a weakly cartesian functor (not even taut).

Proof

Let \(f : X\rightarrow Y\) be as in 4.3(iii) with \(\mathfrak {m}\) larger than the cardinality of \(T(\emptyset )\). Consider the pullback

(4.3)

The resulting canonical map \(T(\emptyset )\rightarrow T(X)\times _{T(Y)}T(\emptyset )\) cannot be surjective since \(T(X)\times _{T(Y)}T(\emptyset )\) contains the set \(\mathrm{Ker}(T(f))\times \{0\}\), whose cardinality is larger than the cardinality of \(T(\emptyset )\). \(\square \)

5 A condition that excludes cartesianness

Unlike Sect. 2, due to the simplicity of the data considered in this section, we shall freely use the universal-algebraic expressions \(t = t(x), t(y)\) instead of \(t(\eta _Y(y))\), etc. The section is devoted to a simple result showing that many familiar monads are not cartesian:

Proposition 5.1

Suppose \(\mathrm{\mathbf C}\) admits a commutative binary operation which is not pseudo-constant, that is, admits a binary term \(t\) satisfying \(t(x,y) = t(y,x)\) and not \(t(x,x') = t(y,y')\), where \(x, x', y, y'\) are pairwise distinct variables. Then the functor \(T\) does not preserve pullbacks.

Proof

Let us choose the pullback (2.1) using pairwise distinct elements \(x, x', y, y'\) as follows:

  • \(X\) is any set containing \(x\) and \(x'\);

  • \(Y\) is any set containing \(y\) and \(y'\);

  • \(Z\) is a one-element set (which uniquely determines \(f\) and \(g\)).

Then \(W = X\times Y\), and so \(T(W)\) contains the elements \(t((x,y),(x',y'))\) and \(t((x,y'),(x',y))\); moreover \(t((x,y),(x',y'))\ne t((x,y'),(x',y))\) since the elements \((x,y), (x',y'), (x,y'), (x',y)\) are pairwise distinct. On the other hand,

$$\begin{aligned} \begin{array}{lll}\langle T(p),T(q)\rangle (t((x,y),(x',y'))) &{}=&{} (t(x,x'),t(y,y')) = (t(x,x'),t(y',y))\\ &{}=&{} \langle T(p),T(q)\rangle (t((x,y'),(x',y))).\end{array} \end{aligned}$$

That is, \(\langle T(p),T(q)\rangle \) is not injective. \(\square \)

Marek Zawadowski told us that this proposition also follows from results of [23].

6 Semimodules I: preliminary remarks and weak cartesianness of the functor \(T\)

From now on, \(\mathrm{\mathbf C}\) will denote the variety \(S\)-\(\mathrm{\mathbf SMod}\) of semimodules over a semiring \(S\). This and the next two sections will be devoted to the analysis of weak cartesianness of the functor \(T\) and of the natural transformations \(\eta \), \(\mu \), respectively, for the corresponding (free \(S\)-semimodule) monad \(T = (T,\eta ,\mu )\).

Let us recall that an \(S\)-semimodule is a commutative monoid \(A\), written additively, and equipped with a scalar multiplication \(S\times A\rightarrow A\), written as \((s,a)\mapsto sa\), and satisfying the identities

$$\begin{aligned} 1a = a,\; s(ta) = (st)a,\; 0a = 0,\; (s+t)a = sa\!+\!ta,\; s0 = 0,\; s(a\!+\!b) = sa\!+\!sb,\nonumber \\ \end{aligned}$$
(6.1)

in obvious notation. Let us also mention that \(S\)-\(\mathrm{\mathbf SMod}\) is enriched in the category of commutative monoids, and its biproducts

(6.2)

are the usual cartesian product \(A\times B\) with the product projections \(\pi _1\) and \(\pi _2\), and \(\iota _1\) and \(\iota _2\) defined by \(\iota _1(a) = (a,0)\) and \(\iota _2(b) = (0,b)\) respectively. This coincidence of binary products and coproducts implies the preservation of arbitrary coproducts by the functor \(T\), and, either using this fact or directly, the monad \(T = (T,\eta ,\mu )\) can be described as follows:

  • for any set \(X\), \(T(X)\) is the set of all maps \(t : X\rightarrow S\) for which the set \(\{x\in X\;|\;t(x)\ne 0\}\) is finite;

  • for a map \(f : X\rightarrow Y\), we have \(T(f)(t)(y) = \sum _{f(x)=y}t(x)\);

  • \(\eta _X : X\rightarrow T(X)\) is the map that has \(\eta _X(x)(x) = 1\) and \(\eta _X(x)(y) = 0\) for \(x\ne y\);

  • \(\mu _X : T^2(X)\rightarrow T(X)\) is the map defined by \(\mu _X(u)(x) = \sum _{t\in T(X)}u(t)t(x)\), where all possibly infinite sums make sense since they involve only finitely many non-zero summands.

We will analyze weak cartesianness of the functor \(T\) in several steps.

Lemma 6.1

The following conditions are equivalent:

  1. (i)

    the \(T\)-image of (2.1) is a weak pullback whenever \(Y =\emptyset \);

  2. (ii)

    the \(T\)-image of (2.1) is a weak pullback whenever the numbers of elements in \(X\), \(Y\), and \(Z\) are \(2\), \(0\), and \(1\) respectively;

  3. (iii)

    no non-zero element in \(S\) has an additive inverse.

Proof

If \(Y =\emptyset \), then \(W =\emptyset \) and \(T(W) = T(Y) = \{0\}\), and so the \(T\)-image of (2.1) is a weak pullback if and only if the kernel of \(T(f) : T(X)\rightarrow T(Z)\) is \(\{0\}\). And when \(X\) and \(Z\) are as in (ii), the map \(T(f) : T(X)\rightarrow T(Z)\) can be identified with the addition map \(+: S\times S\rightarrow S\), which makes the equivalence (ii) \(\Leftrightarrow \) (iii) obvious. Since the implication (i) \(\Rightarrow \) (ii) is trivial, it only remains to prove the implication (iii) \(\Rightarrow \) (i).

Suppose \(\alpha :X\rightarrow S\) belongs to the kernel of \(T(f)\). Then, for every \(z\) in \(Z\), we have \(\sum _{f(x)=z}\alpha (x) = 0\), and, as follows from (iii), this means that \(\alpha (x) = 0\) for every \(x\) in \(X\). That is, \(\alpha =0\) in \(T(X)\). \(\square \)

Lemma 6.2

For the weak pullback diagrams

(6.3)

where \(+\) denotes the coproduct (=disjoint union) of sets, if \(T\) preserves the left-hand weak pullbacks, then it also preserves the right-hand weak pullback.

Proof

Just observe that the free \(S\)-semimodule functor preserves coproducts and that finite coproducts of \(S\)-semimodules are canonically isomorphic to products. \(\square \)

Lemma 6.3

The following conditions are equivalent:

  1. (i)

    the \(T\)-image of (2.1) is a weak pullback whenever \(f\) and \(g\) are surjective;

  2. (ii)

    the \(T\)-image of (2.1) is a weak pullback whenever the numbers of elements in \(X\), \(Y\) and \(Z\) are \(2\), \(2\) and \(1\) respectively.

  3. (iii)

    for every \(a, b, c, d\) in \(S\) with \(a+b = c+d\), there exist \(x, y, z, t\) in \(S\) with

    $$\begin{aligned} x+y = a,\; z+t = b,\; x+z = c,\; y+t = d. \end{aligned}$$
    (6.4)
  4. (iv)

    for every natural \(m\) and \(n\), and \(s_1,\ldots , s_m\) and \(t_1,\ldots , t_n\) in \(S\) with \(s_1+\cdots +s_m = t_1+\cdots +t_n\), there exists a \(\{1,\ldots ,m\}\,\times \, \{1,\ldots ,n\}\)-indexed family \((x_{ij})\) of elements in \(S\) with

    $$\begin{aligned}&x_{i1}+\cdots +x_{in} = s_i\quad \hbox { for all } i = 1,\ldots ,m\quad \hbox { and }\nonumber \\&\quad x_{1j}+\cdots +x_{mj}= t_j \quad \hbox { for all } j = 1,\ldots ,n. \end{aligned}$$
    (6.5)

Proof

When \(X,\, Y\), and \(Z\) are as in (ii), the \(T\)-image of (2.1) can be presented as

(6.6)

where \(u\) and \(v\) are defined by \(u(x,y,z,t) = (x+y,z+t)\) and \(v(x,y,z,t) = (x+z,y+t)\) respectively, which makes the equivalence (ii) \(\Leftrightarrow \) (iii) obvious. Since the implication (i) \(\Rightarrow \) (ii) is trivial, it only remains to prove the implications (iii) \(\Rightarrow \) (iv) and (iv) \(\Rightarrow \) (i).

(iii) \(\Rightarrow \) (iv) is an elementary exercise. We can use induction in \(m\) and in \(n\), and since \(m\) and \(n\) play “symmetric roles”, we can assume that there exist an \(\{1,\ldots ,m\}\times \{2,\ldots ,n\}\)-indexed family \((y_{ij})\) of elements in \(S\) with

$$\begin{aligned}&y_{i2}+\cdots +y_{in} = s_i\quad \hbox { for all } i = 1,\ldots ,m, \hbox { and }\\&\quad y_{1j}+\cdots +y_{mj}= u_j \quad \hbox { for all } j = 2,\ldots ,n, \end{aligned}$$

where \(u_2 = t_1+t_2\) and \(u_j = t_j\) for \(j = 3,\ldots ,n\). After that we can take \(x_{ij} = y_{ij}\) for \(j = 3,\ldots ,n\), and define \(x_{ij}\) for \(j = 1, 2\) in such a way that

$$\begin{aligned} x_{i1}+x_{i2} = y_{i2}\quad \hbox { for all }i = 1,\ldots ,m,\hbox { and }x_{1j}+\cdots +x_{mj} = t_j\quad \hbox { for all }j = 1, 2. \end{aligned}$$

This, using induction in \(m\), reduces to (iii).

(iv) \(\Rightarrow \) (i): Since the functor \(T\) preserves filtered colimits, it suffices to consider the case of finite \(X\), \(Y\), and \(Z\). Moreover, thanks to Lemma 6.2, we can assume that \(Z\) is a one-element set. But assuming \(Z\) in (i) to be a one-element set (and \(X\ne \emptyset \ne Y\)) makes (i) obviously equivalent to (iv), just like (ii) is equivalent to (iii). \(\square \)

Theorem 6.4

The following conditions are equivalent:

  1. (i)

    \(T\), as a functor, is weakly cartesian;

  2. (ii)

    \(S\) satisfies the conditions of Lemmas 6.1 and 6.3.

Proof

(i) \(\Rightarrow \) (ii) follows from Lemmas 6.1 and 6.3.

(ii) \(\Rightarrow \) (i): Assuming that conditions 6.1(iii) and 6.3(iii) hold, we have to show that \(T\)-images of arbitrary pullbacks (2.1) are weak pullbacks. Using filtered colimits again this reduces to the case of pullbacks of finite sets, and then, using Lemma 6.2, we can assume that \(Z\) in (2.1) is a one-element set. After this we just apply Lemmas 6.1 and 6.3. \(\square \)

Remark 6.5

Using similar arguments it is easy to show that the functor \(T\) is taut if and only if no non-zero element in \(S\) has an additive inverse.

7 Semimodules II: weak cartesianness for \(\eta \)

Theorem 7.1

The following conditions are equivalent:

  1. (i)

    \(\eta \) is weakly cartesian;

  2. (ii)

    If \(s+s' = 1\) in \(S\), then either \(s = 0\), or \(s' = 0\). In particular no non-zero element of \(S\) has an additive inverse.

Proof

(i) \(\Rightarrow \) (ii): When \(A\) is a two-element set, say, \(A = \{a,a'\}\), and \(B\) is a one-element set, say, \(B = \{b\}\), diagram (2.3) becomes

(7.1)

and, obviously, diagram (7.1) is a (weak) pullback if and only if condition (ii) holds.

(ii) \(\Rightarrow \) (i): Suppose, in diagram (2.3), we have \(\eta _B(b) = T(u)(t)\) for some \(b\in B\) and some \(t : A\rightarrow S\) in \(T(A)\). Then we have:

$$\begin{aligned} \sum _{u(a)=b}t(a) = 1 \hbox { and } \sum _{u(a)=b'}t(a) = 0\quad \hbox { for } b'\ne b \hbox { in }B. \end{aligned}$$
(7.2)

But, under condition (ii), (7.2) is equivalent to the existence of (a unique) \(a\in A\) with

$$\begin{aligned} t(a) = 1 \hbox { and } \forall x\in A\;\; (x \ne a \Rightarrow t(a) = 0), \end{aligned}$$
(7.3)

and (7.3) implies that the image of \(a\) under the canonical map \(A \rightarrow T(A)\times _{T(B)}B\) is the element determined by the pair \((b,t)\). This proves (i). \(\square \)

8 Semimodules III: weak cartesianness of \(\mu \) and the main results

For \(S\) and \(T = (T,\eta ,\mu )\) as in the two previous sections, the analysis of weak cartesianness for \(\mu \), which this section is devoted to, is rather complicated and we will use weak cartesianness for the functor \(T\) in it. The first of our auxiliary results is obvious and surely known to many readers:

Lemma 8.1

Let

(8.1)

be a commutative diagram in the category of sets. Then:

  1. (a)

    if \(\fbox {1}\) and \(\fbox {2}\) are weak pullbacks, then so is the rectangle \(\fbox {1}\fbox {2}\);

  2. (b)

    if \(\fbox {1}\fbox {2}\) is a weak pullback and are jointly monic, then \(\fbox {1}\) is a weak pullback;

  3. (c)

    if \(\fbox {1}\fbox {2}\) is a weak pullback and the map \(X\rightarrow Y\) is surjective, then \(\fbox {2}\) is a weak pullback.

Furthermore:

  1. (d)

    the class of weak pullback diagrams of sets is closed under products in the category of all square diagrams in the category of sets;

  2. (e)

    for any weak pullback diagram (2.1) and any map \(h : X\rightarrow E\), the diagram

    (8.2)

    is a weak pullback.

Lemma 8.2

The class \({\mathcal {U}}\) of all maps \(u : A\rightarrow B\) for which diagram (2.4) is a weak pullback has the following properties:

  1. (a)

    it contains all bijections and is closed under composition and filtered colimits;

  2. (b)

    if \(T\) is a weakly cartesian functor, then \({\mathcal {U}}\) is closed under coproducts with bijections.

Proof

(a): The fact that \({\mathcal {U}}\) contains all bijections is obvious. Closedness under composition easily follows from Lemma 8.1(a). Closedness under filtered colimits follows from the fact that the functor \(T\) preserves filtered colimits.

(b): We have to prove that, if the functor \(T\) is weakly cartesian and diagram (2.4) is a weak pullback, then, for any set \(E\), the diagram

(8.3)

also is a weak pullback; here \(+\) stands for coproducts in the category of sets, that is, for disjoints unions (and induced maps between disjoint unions). For that, consider the commutative diagram

(8.4)

in which the unlabeled arrows are canonical bijections, and observe:

  • the square \(\fbox {1}\) is a weak pullback, which follows from weak cartesianness of the functor \(T\) and Lemma 8.1(e);

  • the square \(\fbox {2}\) is a weak pullback, which follows from the fact that so is diagram (2.4) and Lemma 8.1(d);

  • therefore \(\fbox {1}\fbox {2}\) is a weak pullback by Lemma 8.1(a);

  • Since the two top and two bottom vertical arrows in (8.4) are bijections, and \(\fbox {1}\fbox {2}\) is a weak pullback, diagram (8.3) also is a weak pullback. \(\square \)

Corollary 8.3

If \(T\) is a weakly cartesian functor, then the natural transformation \(\mu \) is weakly cartesian if diagram (2.4) is a weak pullback whenever \(A\) is either empty or is a two-element set, and \(B\) is a one-element set.

Corollary 8.4

If \(T\) is a taut functor, then the natural transformation \(\mu \) is taut if diagram (2.4) is a (weak) pullback whenever \(A\) is empty and \(B\) is a one-element set.

Observation 8.5

  1. (a)

    Let \(A\) and \(B\) be the empty and a one-element set, respectively. Then diagram (2.4) can be described as

    (8.5)

    where \(T(0)(s)(0) = s,\, T(0)(s)(s') = 0\) for \(s'\ne 0\), and \(\mu _1\) is defined by \(\mu _1(t) =\sum _{s\in S} t(s)s\).

  2. (b)

    Let \(A\) and \(B\) be a two-element set and a one-element set, respectively (as in the proof of Theorem 7.1). Then diagram (2.4) can be described as

    (8.6)

    where \(\mu _1\) is as above, and \(T(+)\) and \(\mu _2\) are defined by \(T(+)(t)(s) =\sum _{x+y=s} t(x,y)\) and

    $$\begin{aligned} \mu _2(t) = \left( \sum _{(x,y)\in S\times S} t(x,y)x,\sum _{(x,y)\in S\times S} t(x,y)y\right) , \end{aligned}$$

    respectively.

Lemma 8.6

If \(A\) and \(B\) are the empty and a one-element set, respectively, then diagram (2.4) is a weak pullback if and only if, for \(s, s'\in S\), we have:

  1. (a)

    \(s+ s' = 0 \;\Rightarrow \; s = 0 = s'\) (condition 6.1(iii));

  2. (b)

    \(ss' = 0 \;\Rightarrow \; (s = 0\,\vee \,s' = 0)\).

Proof

As follows from Observation 8.5(a), when \(A\) and \(B\) are the empty and a one-element set, respectively, diagram (2.4) is a weak pullback if and only if

$$\begin{aligned} \forall t\in T(S)\;(\mu _1(t) = 0 \;\Rightarrow \; \exists s\in S \;\;T(0)(s) = t). \end{aligned}$$
(8.7)

Using the descriptions of \(\mu _1\) and \(T(0)\) in Observation 8.5(a), we can rewrite (8.7) as

$$\begin{aligned} \forall t\in T(S)\; \left( \sum _{s\in S} t(s)s = 0 \;\Rightarrow \; (s \ne 0 \;\Rightarrow \; t(s) = 0)\right) . \end{aligned}$$
(8.8)

After that we only need to prove that (8.8) is equivalent to the conjunction of conditions (a) and (b), which is straightforward:

Indeed, if (a) holds, then \(\sum _{s\in S} t(s)s = 0\) implies

$$\begin{aligned} \forall s\in S\;\; t(s)s = 0, \end{aligned}$$

which, by (b), implies \(s \ne 0 \;\Rightarrow \; t(s) = 0\). That is, (8.8) follows from the conjunction of conditions (a) and (b). On the other hand, if \(s+s' = 0\), then \(t\) in \(T(S)\), defined by

$$\begin{aligned} t(s) = 1,\; t(1) = s', \hbox { and } t(s'') = 0\quad \hbox { for }s \ne s'' \ne 1, \end{aligned}$$
(8.9)

has \(\sum _{s\in S} t(s)s = 1s + s'1 = s + s' = 0\), which, by (8.8), gives \(s' = 0\), and then \(s = s + s' = 0\). Therefore (8.8) implies (a). If \(ss' = 0\), then \(t\in T(S)\), defined by

$$\begin{aligned} t(s') = s \hbox { and } t(s'') = 0\quad \hbox { for } s'' \ne s', \end{aligned}$$
(8.10)

has \(\sum _{s\in S} t(s)s = ss' = 0\), which, by (8.8), gives \(s' \ne 0 \;\Rightarrow \; s = 0\). \(\square \)

Combining this lemma with Corollaries 6.5 and 8.4 we obtain the following characterization of taut free \(S\)-semimodule monads:

Theorem 8.7

The free \(S\)-semimodule monad \(T\) is taut if and only if no non-zero element in \(S\) has an additive inverse and no non-zero element of \(S\) is a zero divisor.

Lemma 8.8

In the notation of (8.6), let

$$ \begin{aligned} M(S)&= \{m\in T(S)\;|\; \forall a,b\in S\; (\mu _1(m) = a + b \;\Rightarrow \;\exists t\in T^2(S) \; (T(+)(t) \\&= m \; \& \; \mu _2(t) = (a,b)))\}. \end{aligned}$$

If \(T\) is a weakly cartesian functor, then \(M(S)\) is a submonoid in the additive monoid \(T(S)\).

Proof

If \(m = 0\) and \(\mu _1(m) = a + b\), then \(a = 0 = b\) by Lemma 6.1(iii). Therefore, for \(m = 0\) and \(\mu _1(m) = a + b\), we have \( T(+)(0) = m \; \& \; \mu _2(0) = (a,b)\). This implies that \(M(S)\) contains \(0\). For \(m_1, m_2 \in M(S)\) and \(a, b\in S\) with \(\mu _1(m_1+ m_2) = a + b\), we observe:

  • Since \(\mu _1(m_1)+\mu _1(m_2) = a+b\), by Lemma 6.3 there exist \(a_1, a_2, b_1, b_2\in S\) with

    $$\begin{aligned} \mu _1(m_1) = a_1+b_1,\;\mu _1(m_2) = a_2+b_2,\; a = a_1+a_2,\; b = b_1+b_2. \end{aligned}$$
  • Then, since \(m_1\) and \(m_2\) are in \(M(S)\), there exist \(t_1\) and \(t_2\) in \(T^2(S)\) with

    $$\begin{aligned} T(+)(t_1) = m_1,\;\mu _2(t_1) = (a_1,b_1),\; T(+)(t_2) = m_2,\;\mu _2(t_2) = (a_2,b_2). \end{aligned}$$
  • Therefore \(T(+)(t_1+t_2) = m_1+m_2\) and \(\mu _2(t_1+t_2) = (a_1,b_1)+(a_2,b_2) = (a,b)\), which proves that \(m_1+m_2\) is in \(M(S)\).\(\square \)

Lemma 8.9

Let \(\mathrm{\mathbf C}\) be any subset of \(S\) that generates \(S\) as an additive monoid. If \(T\) is a weakly cartesian functor, then the following conditions are equivalent:

  1. (i)

    diagram (2.4) is a weak pullback whenever \(A\) is a two-element set, and \(B\) is a one-element set;

  2. (ii)

    for every \(a, b, c, d\in S\) with \(c\) in \(\mathrm{\mathbf C}\) and \(cd = a+b\), there exists a map

    with

    $$\begin{aligned} \sum _{x+y=d} t(x,y)x = a,\;\sum _{x+y=d} t(x,y)y = b,\;\sum _{x+y=d} t(x,y) = c. \end{aligned}$$
    (8.11)

Proof

As follows from Observation 8.5(b), condition (i) holds if and only if diagram (8.6) is a weak pullback. Therefore condition (i) is equivalent to the equality \(M(S) = T(S)\), where \(M(S)\) is defined as in Lemma 8.8. Let us write \([d,c]\) for the map \(S\rightarrow S\) defined by \([d,c](d) = c\) and \(s\ne d \;\Rightarrow \; [d,c](s) = 0\). The set \(\{[d,c]\;|\;d\in S, c\in C\}\) generates \(T(S)\) as an additive monoid, and so the equality \(M(S) = T(S)\) holds if and only if every \([d,c]\), with \(d\in S\) and \(c\in C\), is in \(M(S)\). On the other hand, \([d,c]\) is in \(M(S)\) if and only if, for every \(a, b\in S\) with \(cd = a+b\), there exists \(t\in T^2(S)\) with

$$\begin{aligned} \sum _{(x,y)\in S\times S} t(x,y)x = a, \;\sum _{(x,y)\in S\times S} t(x,y)y = b,\;\sum _{x+y=d} t(x,y) = c,\;\sum _{x+y=s} t(x,y) = 0,\nonumber \\ \end{aligned}$$
(8.12)

for all \(s \ne d\). The fact that the existence of such a \(t\) (for all \(a, b, c, d\in S\) with \(c\) in \(C\)) is equivalent to condition (ii) follows from Lemma 6.1(iii). \(\square \)

Now we are ready to present our main results on the free \(S\)-semimodule monad \(T\):

Theorem 8.10

The free \(S\)-semimodule monad \(T\) satisfies (BC) if and only if the semiring \(S\) satisfies the following conditions:

  1. (a)

    no non-zero element in \(S\) has an additive inverse;

  2. (b)

    for every \(a, b, c, d\in S\) with \(a+b = c+d\), there exist \(x, y, z, t\) in \(S\) with

    $$\begin{aligned} x+y = a,\, z+t = b,\; x+z = c,\; y+t = d; \end{aligned}$$
  3. (c)

    \(S\) has no zero divisors, that is, \(ss'=0\;\Rightarrow \;(s=0\,\vee \,s'=0)\) for \(s, s'\in S\);

  4. (d)

    for every \(a, b, c, d\in S\) with \(cd = a+b\), there exists a map \(t : \{(x,y)\in S\times S\;|\; x + y = d\}\rightarrow S\) satisfying (8.11).

Furthermore, condition (d) can be replaced by:

  1. (e)

    there exists a subset \(C\) of \(S\) that generates \(S\) as an additive monoid, and such that, for every \(a, b, c, d\in S\) with \(c\) in \(C\) and \(cd = a+b\), there exists a map \(t : \{(x,y)\in S\times S\;|\; x+ y = d\}\rightarrow S\) satisfying (8.11).

Proof

Note that conditions (a)–(e) coincide with conditions 6.1(iii), 6.3(iii), 8.6(b), 8.9(ii) with \(C = S\), and 8.9(ii) respectively; for (e) \(\Leftrightarrow \) 8.9(ii) note, however, that “any” can be understood as either existential or universal quantifier—which follows from Lemma 8.9. After that all we need is to recall that the functor \(T\) satisfies (BC) if and only if \(S\) satisfies 6.1(iii) and 6.3(iii) (Theorem 6.4), and to put together Corollary 8.3 and Lemmas 8.6 and 8.9. \(\square \)

From here and Theorem 7.1 we obtain:

Theorem 8.11

The free \(S\)-semimodule monad \(T\) is weakly cartesian if and only if it satisfies (BC) and the implication \(s+s' = 1\;\Rightarrow \;(s = 0 \,\vee \, s' = 0)\) for \(s, s'\in S\).

9 Semimodules IV: examples

Conditions (b), (d), and (e) of Theorem 8.10 can be complicated to check, and we will use simple stronger conditions provided by

Proposition 9.1

  1. (a)

    If for every \(a, b\in S\) there exists \(x\in S\) such that either \(a+x = b\) or \(b+x = a\), and the additive monoid of \(S\) is a monoid with cancelation, then condition 8.10 (b) holds.

  2. (b)

    If every non-zero element in \(S\) has a multiplicative inverse and no non-zero element in \(S\) has an additive inverse, then condition 8.10(d) holds.

  3. (c)

    If \(S\) is generated by \(\{1\}\), then condition 8.10(e) holds.

Proof

(a): Let \(a, b, c, d\in S\) be as in Theorem 8.10(b). Without loss of generality we can assume that there exist \(p, q, r\in S\) with \(a+p = b\), \(a+q = c\), \(a+r = d\), after which we can take \(x = 0\), \(y = a\), \(z = c\), and \(t = r\). Indeed, we will obtain \(x+y = 0+a = a\), \(z+t = c+r = b\) since \(a+c+r = c+d = a+b\), \(x+z = 0+c = c\), and \(y+t = a+r = d\).

(b): Given \(a, b, c, d\in S\) with \(cd = a+b\), consider the two cases \(d = 0\) and \(d \ne 0\) separately. If \(d = 0\), then \(a = 0 = b\), and

defined by \(t(0,0) = c\) satisfies (8.11). If \(d \ne 0\), then \(t : \{(x,y)\in S\times S\;|\;x+y = d\}\rightarrow S\), defined by

$$\begin{aligned} t(d,0) = ad^{-1},\; t(0,d) = bd^{-1}, \; (d,0) \ne (x,y) \ne (0,d) \;\Rightarrow \; t(x,y) = 0,\qquad \end{aligned}$$
(9.1)

also satisfies (8.11).

(c): We take \(C = \{1\}\) and then to take \(a, b, c, d\in S\) with \(c\) in \(C\) and \(cd = a + b\) is to take \(a, b, d\in S\) with \(d = a + b\). Then \(t : \{(x,y)\in S\times S\;|\; x+y = d\}\rightarrow S\), defined by

$$\begin{aligned} t(a,b) = 1 \hbox { and } (x,y) \ne (a,b) \;\Rightarrow \;t(x,y) = 0, \end{aligned}$$

satisfies (8.11). \(\square \)

Example 9.2

If \(S = {\mathbb {N}}\) is the semiring of natural numbers, then all our conditions are satisfied. In particular for conditions 8.10(b) and 8.10(e) this follows from Propositions 9.1(a) and 9.1(c) respectively. In this case \(S\)-semimodules are nothing but commutative monoids, and we conclude that the free commutative monoid monad is weakly cartesian. Note that it is not cartesian by Proposition 5.1.

Example 9.3

Let \(S = \{0,1\}\) be the quotient semiring of \({\mathbb {N}}\) determined by \(1+1 = 1\). Then the \(S\)-semimodules are nothing but commutative idempotent monoids, and so, as trivially follows from Theorem 8.11, the free commutative idempotent monoid monad is not weakly cartesian. However, it satisfies (BC), since \(S = \{0,1\}\) satisfies all conditions of Theorem 8.10. For condition 8.10(e) it follows again from Proposition 9.1(c), while condition 8.10(b) can be easily checked directly. Note that Proposition 9.1(a) cannot be used here since the additive monoid of \(S\) does not admit cancelation. The fact that the free commutative idempotent monoid monad satisfies (BC) is in fact known and was already mentioned e.g. in [8].

Example 8.4

In spite of Examples 9.2 and 9.3, the “intermediate” semiring \(S = \{0,1,2\}\), having \(2+1 = 2\), does not satisfy condition 8.10(b), and so in this case the free \(S\)-semimodule functor does not even preserve weak pullbacks. Indeed, for \(a = b = 1\) and \(c = d = 2\), we have \(a + b = c + d\), but there is no \(x, y, z, t\) in \(S\) with

$$\begin{aligned} x+y = a, \; z +t = b,\; x+z = c, \; y+t = d. \end{aligned}$$

This shows the cancelation requirement cannot be removed in Proposition 9.1(a), as Example 9.3 might seem to suggest.

Example 8.5

If \(S = {\mathbb {Q}}_+\) is the semiring of non-negative rational numbers, then, as in Example 9.3, the free \(S\)-semimodule monad is not weakly cartesian (again, trivially from Theorem 8.11) but satisfies (BC). However, Proposition 8.10(c) cannot be used here, and Proposition 9.1(b) should be used instead, which makes condition 8.10(d) trivial. The same is true if \(S = {\mathbb {R}}_+\) is the semiring of non-negative real numbers.

Example 8.6

If the semirings \(S\) and \(S'\) satisfy conditions 8.10(a) and 8.10(b), then the product \(S\times S'\) also does. But it never satisfies condition 8.10(c) unless one of the two rings is trivial. This gives free semimodule monads which do not satisfy (BC) although the functor \(T\) does.