1 Semantic Constraints on Spatial Prepositions

Prepositions are a limited class of words, but with a wide range of meanings and uses, even if we consider only locative prepositions in one language, like English in, near, over or behind (Lindstromberg 2010).

  1. (1)

    a. There is a fly in my soup.

    b. The bomb went off near a guest house.

    c. Install the door knob over the key slot.

    d. Keep behind the line.

These locative prepositions define a relation between a trajector and landmark, by identifying a particular ‘region of acceptability’ or ‘search domain’ with respect to the landmark where the trajector can be found (Logan and Sadler 1996; Levinson 1996). Different geometries can be the basis of a preposition’s meaning, like topology or projective geometry (Crangle and Suppes 1989) with different frames of reference, making use of deictic, intrinsic or environmental features (Levinson 1996). Various non-geometric factors (like function and force-dynamics) also play an important role (e.g. Herskovits 1986; Vandeloise 1991; Bowerman 1996; Feist and Gentner 1998; Coventry and Garrod 2004; Carlson and van der Zee 2005; Gärdenfors 2014).

Since prepositions are at the crossroads of spatial cognition, linguistic expression, and computational application, this variety and flexibility has been the topic of many theoretical and experimental studies by psychologists, linguists, and information scientists, within and across languages. This research has increased our understanding of the many factors that play a role in prepositional meanings and the way this might differ between languages.Footnote 1

However, this also raises the much less investigated question whether and how meanings of prepositions are constrained, that is, how all the different spatial and other factors come together in such a way that they can be learned, represented, and used by human beings. Although this is a general question about word meanings, it is especially relevant for prepositions, given their importance at many different levels of lexicon, grammar, and discourse. Landau and Jackendoff (1993), Regier (1996), and Zwarts and Winter (2000) pay attention to the constraints on prepositional meanings from different perspectives. In this article, we approach this issue from the perspective of conceptual spaces, with a focus on the geometric constraint of convexity (Gärdenfors 2000, 2014), which has already been shown to be at work in a range of different domains. In this approach, word meanings correspond to convex regions (subsets of points) in a conceptual space, which are defined as follows (Gärdenfors 2000, p. 69):

  1. (2)

    A subset C of a conceptual space S is said to be convex if, for all points x and y in C, all points between x and y are also in C.

One concrete example of the constraint of convexity is found in the domain of color words. Jäger (2010) has shown that for the languages included in the World Color Survey there is strong evidence that color words correspond to convex regions of the color space.

Although applying convexity to locative prepositions might seem a simple matter, it is far from trivial, because it depends heavily on the assumptions that we make about the structure and representation of space. Although definition (2) suggest that there is a straightforward and intuitive notion of convexity, we believe that the semantics of prepositions shows that convexity depends on the way spatial positions are represented in relation to a particular origo. Our goal in this paper is therefore not empirical, but theoretical: building on what is known about prepositions, we will show that, when using polar coordinates, a particular version of convexity holds quite generally in the domain of prepositions and it can serve as a guide to alternative semantic analyses in those cases where it does not seem to hold.

The structure of the paper is as follows. Section 2 briefly sketches the background of conceptual spaces in relation to the semantics of prepositions. We then spell out our model of polar coordinates in Sect. 3, which we use to define the meanings of locative prepositions in Sect. 4. In Sect. 5 we demonstrate to what extent the resulting meanings are convex with respect to a new notion of polar betweenness. We extend our model to directional prepositions in Sect. 6.

2 Conceptual Spaces, Convexity, and Locative Prepositions

In order to model the meaning of prepositions we need to make some general assumptions, first about the role of regions and convexity in conceptual spaces and then about the way prepositions are treated.

As we already saw, an important notion in the semantic theory of Gärdenfors (2000, 2014) is that of convexity, the thesis that concepts can be represented as convex regions of some conceptual space. A fact that is often overlooked is that there are several ways of conceiving of betweenness. In traditional Euclidean geometry, the points between the points a and c are the points that lie on the straight line between a and c. For a 3-dimensional Cartesian space with coordinates x, y and z, the notion of betweenness can be more formally defined as follows:

  1. (3)

    A point \(b=\langle x_{b},y_{b},z_{b}\rangle \) lies between a point \(a=\langle x_{a}, y_{a}, z_{a}\rangle \) and a point \(c=\langle x_{c}, y_{c}, z_{c}\rangle \) if there is some \(k, 0<k < 1\) such that \(x_{b}= kx_{a}+(1-k)x_{c}\), \(y_{b}= ky_{a}+(1-k)y_{c}\) and \(z_{b}= kz_{a}+(1-k)z_{c}\).

As we will see, however, using polar coordinates, another notion of betweenness can be defined and thereby another notion of convexity.

Spatial prepositional phrases (PPs) like in my soup, near a guest house, over the key slot, and behind the line can also be taken to correspond to regions in a conceptual space, physical space in this case. What kind of points do these regions consist of? One idea is that the region corresponding to a spatial PP consists of a set of vectors. The use of vectors in the semantics of prepositions has been advocated from different points of view: to account for their relation with cognitive maps (O’Keefe 1996, 2003), for the way they are modified (e.g. three feet above, right behind, Zwarts 1997; Zwarts and Winter 2000), the role that attention plays in their use (Regier and Carlson 2001), for modeling their frames of reference and points of view (Bohnemeyer 2012), and their relation with syntactic structures (Svenonius 2006, 2008; Kracht 2008). The notion of a vector is a simple but powerful way to represent the position of the trajector relative to the landmark. Its starting point is somewhere in the landmark and its end point is somewhere in the trajector and the preposition imposes restrictions on the length and direction of the vector. A prepositional phrase like below the lamp corresponds then to the set of vectors that have their origin in the lamp and that point downward, each representing a potential position for a trajector in that region. We assume that all prepositions that have a spatial, geometric meaning can be represented using vectors in this way.

However, there are also many prepositions, some of them quite basic, for which it has been argued that they are not based on geometric factors, but on force-dynamic or functional factors (for example, Herskovits 1986; Vandeloise 1991; Dewell 1994; Bowerman 1996; Garrod et al. 1999; Tyler and Evans 2001; Beliën 2002; Coventry et al. 2001; Coventry and Garrod 2004; Carlson and van der Zee 2005; Zwarts 2010; Gärdenfors 2014). There is debate in the literature about whether prepositions like at, in and on, and over and under are defined entirely in terms of non-geometric factors, or whether their meaning involves a combination of geometric and functional elements. Here we follow Gärdenfors (2014) in assuming that these prepositions are not spatial in nature, but force-dynamic. We apply our analysis to locative prepositions that are more clearly spatial (although functional factors can also play a role for these prepositions too): the topological prepositions inside and outside, the distance prepositions near and far, the projective (or axial) prepositions above, below, in front of, behind, beside, and the preposition between. In Sect. 6, we then turn to directional prepositions, like to, from, and through.

3 Spatial Representation Using Polar Coordinates

Most commonly, space is represented with the aid of the Cartesian coordinates x, y and z, representing ‘width’, ‘depth’ and ‘height’ and where distances are measured using a Cartesian metric. However, there is another way of representing space, namely in terms of polar coordinates, which represent points in space in terms of distance and angles.

It is important to note that we are not analyzing the metaphysics of space, but how humans cognitively represent space. We operate with several perspectives on space, the most important being the egocentric, where space is seen from the point of view of the speaker, and the allocentric, where space is seen “from outside”. An interesting question, that seems to have been scarcely addressed, is what coordinate system best represents our spatial cognition. It seems plausible that our brains actually uses both types of coordinates (see Gallant et al. 1993 for evidence that some cells in macaque brains code for polar coordinates and some for Cartesian coordinates).

Fig. 1
figure 1

Representation of point p in polar coordinates \(\langle r,{\uptheta },\upvarphi \rangle \). “3D Spherical 2” by Dmcq—Own work. Licensed under CC BY-SA 3.0 via Wikimedia Commons—https://commons.wikimedia.org/wiki/File:3D_Spherical_2.svg#/media/File:3D_Spherical_2.svg

Let us present how a three-dimensional space S can be defined in terms of polar coordinates, or more precisely, spherical coordinates (e.g. Marsden and Tromba 1981, p. 44). It is assumed that the space has an origo point o and two axes x and z. As illustrated in Fig. 1, a point p is represented as a vector relative to that origo and those axes x and z as a triple \(\langle r,{\uptheta },\upvarphi \rangle \):

  1. (4)

    r (the radius) is a real number (with \(r \ge 0\)) representing the distance of p from the origo o,

    \({\uptheta }\) (the azimuth angle) is the angle (with \({0^{\circ }} \le {\uptheta }< 360^{\circ })\) between p and the x-axis,

    \(\upvarphi \) (the polar angle) is the angle (with \(0^{\circ } \le \upvarphi \le 180^{\circ }\)) between p and the positive z-axis (the zenith).

Notice that when the polar angle \(\upvarphi \) is \(0^{\circ }\) (straight ‘up’) or \(180^{\circ }\) (straight ‘down’), then the value of \({\uptheta }\) is arbitrary and that \({\uptheta }\) and \(\upvarphi \) are both arbitrary when \(r=0\), that is, at the origo. We assume that the angle \({\uptheta }\) goes counterclockwise when seen from the positive side of the z-axis (from ‘above’), so that the negative y-axis corresponds to \({\uptheta }= 90^{\circ }\) and the positive y-axis to \({\uptheta }=270^{\circ }\). In geographical applications of polar coordinates, the x-axis represents ‘north’ and the z-axis represents ‘up’, as the fixed reference directions relative to which other angles are defined. We will not make such an assumption, but assume that the x-axis and z-axis are fixed in an arbitrary way, independently of linguistically relevant directions like ‘up’, and introduce such directions explicitly in Sect. 4.

We can assign to every pair of points a and b the shortest angle \(\angle { ab}\) (in degrees) between them.Footnote 2 This angle is \(0^{\circ }\) for points that have the same angular coordinates. It is \(180^{\circ }\) for points a and \(-a\) that are opposite and \(90^{\circ }\) for points that are perpendicular to each other. These notions will come in useful in Sect. 4.

Given the representation of polar coordinates, one can define a notion of polar betweenness (p-betweenness) that is different from the one generated by the standard Cartesian metric:

  1. (5)

    A point \(b=\langle r_{b},{\uptheta }_{b,}{\upvarphi }_{b}\rangle \) lies polarly between (p-between) a point \(a=\langle r_{a},{\uptheta }_{a},{\upvarphi }_{a}\rangle \) and a point \(c=\langle r_{c},{\uptheta }_{c},{\upvarphi }_{c}\rangle \) if there is some \(k, 0 < k < 1\) such that

    $$\begin{aligned}&r_{b} = kr_{a} + (1-k)r_{c},\\&{\uptheta }_{b }=k{\uptheta }_{a }+ (1-k){\uptheta }_{c }\quad \hbox {iff } {\vert }{\uptheta }_{a}-{\uptheta }_{c}{\vert } < 180^{\circ },\quad \hbox {and } {\uptheta }_{b }=k{\uptheta }_{a}+ (1-k)({\uptheta }_{c}-360^{\circ })\\&\quad \hbox {iff} {\vert }{\uptheta }_{a}-{\uptheta }_{c}{\vert }>180^{\circ }, \end{aligned}$$

    and

    $$\begin{aligned}&{\upvarphi }_{b }=k{\upvarphi }_{a }+ (1-k){\upvarphi }_{c}. \end{aligned}$$

The division into two cases in the criterion for \({\uptheta }_{b}\) derives from the intuition that p-betweenness can only be defined for the shortest distance along the \({\uptheta }\)-dimension. Otherwise, any point would lie between any two points on a circle. For example, when \({\uptheta }_{a}=45^{\circ }\) and \({\uptheta }_{c} = 270^{\circ }\), then the second part of the definition in (5), when applied with \(k=0.5\), gives a \({\uptheta }_{c}\) coordinate \(-22.5\) (= 337.5), which is halfway between \(45^{\circ }\) and \(270^{\circ }\) taking the shortest way. Note that the definition leaves betweenness undefined in the case when \({\vert }{\uptheta }_{a}-{\uptheta }_{c}{\vert } = 180^{\circ }\), when ‘the shortest distance’ between \({\uptheta }_{a}\) and \({\uptheta }_{c}\) is undefined. In order to apply the definition also for the case when a (or c) is the origo, where the values of \({\uptheta }\) and \(\upvarphi \) are undefined, we may put \({\uptheta }_{a }={\uptheta }_{c }\) and \({\upvarphi }_{a }={\upvarphi }_{c}\).

The polar coordinates introduce a different geometry on the space, compared with the standard Cartesian coordinates. Consequently, the ‘line segments’ generated by this polar betweenness relation can be ‘curved’, if seen with Cartesian glasses.Footnote 3 This is illustrated in Fig. 2 for the two-dimensional case (omitting the polar angle \(\upvarphi \)) for three situations A, B, C, in which the open circles are ‘b’ points p-between the ‘a’ and ‘c’ points represented by the black circles. The A situation shows that the points p-between a and c form a radial line if a and c differ only in radius. If they differ only in angle, then the points p-between them lie on a circle, as in situation B. If both angle and radius differ, then the p-between points describe a curve. To be more specific, in that case these points lie on an Archimedean spiral.

Fig. 2
figure 2

Polar betweenness

Two points at (almost) opposite sides of the origo are related by a curve that goes around the origo, as shown in Fig. 3. This illustrates the crucial difference with ordinary betweenness that will turn out to be relevant for the convexity of prepositions.

Fig. 3
figure 3

P-betweenness around the origo

Consider now a region R in S, represented as a set of points each defined as a polar coordinate with respect to one and the same origo o.

  1. (6)

    A subset R of S is polarly convex (p-convex), if and only if, for all points a and c in R, any point b in S that is p-between a and c is also in R.

One thing to note about p-convexity, in comparison to Cartesian convexity, is under what coordinate transformations convexity is preserved. Cartesian convexity is preserved under multiplications, translations and rotations of the coordinate system. In contrast, polar convexity only preserves convexity under multiplications (changing the values of the r coordinate) and rotations (changing the values of the \(\upvarphi \) and \({\uptheta }\) coordinates). If a translation occurs, i.e. if the origo moves, then convexity may not be preserved. This underlines the important role that the origo plays in polar coordinate systems.

What kind of regions in three-dimensional space are p-convex? We will not give a full and systematic overview, but mention prominent examples that will be important later on for the p-convexity of prepositions (Sect. 5). A sphere is p-convex, but only if the origo of its polar coordinates lies in its center. The same is true for the interior of a sphere, its exterior, and the area between two spheres with different radii but the same origo. A full line or full plane is p-convex if it contains the origo. A half-line or line segment is p-convex if it is on the same line as the origo. A half-space or half-plane is p-convex if the origo is a point on the bounding plane or line. An infinite cone is p-convex with respect to its apex.

The intersection of two p-convex regions \(R_{1}\) and \(R_{2}\) is also p-convex, but only if the p-convexity of the two regions is based on the same origo. The intersection of a sphere with a half-space that goes through the origo is a hemisphere, for instance, which is also p-convex. Proof: Let \(R_{1}\) and \(R_{2}\) be two p-convex regions with the same origo and \(a,c \in R_{1} \cap R_{2}\). Therefore, \(a,c \in R_{1}\) and \(a,c\in R_{2}\). Let b be p-between a and b. Then \(b \in R_{1}\) and \(b \in R_{2}\) because \(R_{1}\) and \(R_{2}\) are both p-convex. Therefore \(b\in R_{1}\cap R_{2}\).

Regions with other familiar geometric shapes are not p-convex, even though they might be convex in terms of Cartesian coordinates, like a triangle, cube, or cylinder. A double cone is an example of a region that is neither convex in the Cartesian, nor in the polar sense. Suppose that a and b are points in the region of some of these shapes, then the curve of p-between points connecting a and b may not be entirely contained in the region, even if the origo o has a central location. This is illustrated in Figs. 4 and 5.

Fig. 4
figure 4

Non-p-convexity of rectangle

Fig. 5
figure 5

Non-p-convexity of double cone

The polar convexity defined above generalizes the two different continuity constraints that Zwarts (1997: 81) proposed for the region corresponding to a PP, and that are based on two different betweenness relations between non-zero vectors that are members of the same vector space. Roughly speaking, a vector v is linearly between u and w if a situation as in the left part of Fig. 6 obtains and radially between two vectors u and w in a situation like in the right part (Zwarts 1997, p. 81).

Fig. 6
figure 6

Linear and radial betweenness of vectors (from Zwarts 1997, p. 81)

A region R corresponding to a locative PP is linearly (radially) continuous iff for every \(u,w\in R\), if v is linearly (radially) between u and w, then \(v \in R\). Zwarts argues that all locative PPs correspond to regions that are both linearly and radially continuous. Although he uses the term continuity, it is clear that these constraints are directly relevant for the convexity of prepositional meanings in conceptual spaces (Gärdenfors 2000, p. 172). However, the problem is that continuity is not a general and unified notion, but comes in two different versions that are conceptually related, but not formally. As a consequence, the underlying notion of betweenness and its consequences for the convexity of prepositional regions remains unclear. By analyzing vectors in terms of polar coordinates a notion of polar betweenness and polar convexity becomes available that can unify and strengthen Zwarts’ notion of continuity.

There is much more to say about cognitive and computational aspects of spatial representations. Some would argue that it is more natural and efficient to have qualitative instead of quantitative representations, with primitive regions and lines, instead of points (Cohn and Renz 2008). We agree that it would be interesting to see to what extent qualitative representations are sufficient to represent the semantics of prepositions, but here we will confine ourselves to some brief remarks. A main thesis of this paper is that most locational and directional prepositions can be represented by regions that are convex, where betweenness is defined in terms of polar coordinates. Thus the notions of betweenness and convexity are central to our analysis. It is true that these notions can be incorporated without assuming a metric, for example in ordered sets (for a survey see Cohn and Renz 2008). However, for many systems of qualitative reasoning, for example the Region Connection Calculus (Cohn et al. 1997) and the Qualitative Trajectory Calculus (van de Weghe et al. 2005), the notion of convexity cannot be expressed. On the other hand there are other systems of qualitative reasoning where convexity is added as an operator C so that the convex hull of a region X is the region C(X) (e.g. Randell et al. 1992; Davis 2006). In these systems, the convex regions X can be identified by the fact that they satisfy the identity \(\hbox {C}(X)=X\).Footnote 4 A main difference is, however, that the notion of polar convexity is not discussed in any of these systems.

In a sense, our semantics is qualitative since all of our descriptions of the regions of prepositions are based on qualitative criteria for polar coordinates, like their being inside or outside the landmark, near or far, above or below. So in principle, we do not need the metric generated by the polar coordinate system, but only the notion of polar betweenness (with respect to an origo), from which the set of polarly convex sets then can be defined. The important difference from previous analyses is that the set of polarly convex sets is not the same as for Cartesian betweenness.

4 Semantic Definitions of Locative Prepositions in Polar Coordinates

We noted earlier that a locative preposition describes a relation between a trajector and a landmark.Footnote 5 This relation can be broken up into two parts (e.g. Jackendoff 1983; Wunderlich 1991). The first part is one of the many different spatial functions that map a landmark to a region, corresponding to different prepositional meanings. The second part is a general function that locates the trajector in that region. In the remainder we will restrict ourselves to the prepositional functions that map landmarks to regions. For example, if L is a landmark, then near(L) will be the region of points that are in proximity to L.

We will make the following assumptions and idealizations about how landmarks and regions relate to the spatial domain S that we defined in the previous section. First, we assume that the center of mass of the landmark will function as the origo of S (but see Zwarts and Winter 2000; Regier and Carlson 2001; Mador-Haim and Winter 2015 for much more realistic models). In order to make explicit that a landmark L fixes the origo of the space S in which the preposition is interpreted, we write S(L) for the polar coordinate system defined by landmark L. So, if \(L_{1}\) and \(L_{2}\) are landmarks at different locations in space, then \(S(L_{1})\) and \(S(L_{2})\) are different coordinate systems because of their different origos. In \(S(L_{1})\), the coordinate \(\langle r,{\uptheta },\upvarphi \rangle \) represents a different point than in \(S(L_{2})\).

Second, we will consider only spherical landmarks, which is of course a strong idealization, but one which for the meanings of most locative prepositions does not result in any major distortions. Clearly, for a full account of spatial semantics, we ultimately need a model that can deal with landmarks of any shape, but we believe that the properties of most prepositions can be studied and modeled more easily with a more restricted type of landmark.Footnote 6 After all, we are not interested in how the shape of a region is determined by the accidental shape of a landmark, but by the lexical properties of a preposition. Moreover, schematizations of different kinds play an important role in spatial language (Herskovits 1998). Third, we assume that the set of points occupied by the landmark is a closed subset of S(L), that is, it contains its boundary.

We now show that by using polar coordinates, locative prepositions can be given a systematic description that also brings out explicitly the spatial features of each preposition. The coordinates of a polar system provide different ways of defining regions in the space S(L) of a landmark. The main distinction is between prepositions that refer to the radius of a point and prepositions that refer to the angles \({\uptheta }\) and \(\upvarphi \).

The first distinction based on the radius coordinate is that between internal and external regions, corresponding to the prepositions inside and outside, respectively. Given the idealizations that we have made, we can define these regions as sets of points of which the radius x is either smaller than the radius r\(_{L}\) of the landmark L (inside) or larger (outside), as illustrated in Figs. 7 and 8.

  1. (7)

    inside\((L) = \{ \langle x,{\uptheta },\upvarphi \rangle \in S(L): x\le \hbox {r}_{L }\}\)

    outside\((L) = \{ \langle x,{\uptheta },\upvarphi \rangle \in S(L): x>\hbox {r}_{L} \}\)

The corresponding regions (or at least their two-dimensional cross-sections) can be diagrammed by shading the area where the points of the regions (that is, the endpoints of the vectors) are.

Fig. 7
figure 7

Inside region

Fig. 8
figure 8

Outside region

Without the idealization of a circular landmark, the definitions in (7) would be more complicated, but not crucially different. It would have to refer to the boundary points of a convex landmark L in the following way. If \(\langle b,{\uptheta },\upvarphi \rangle \) is a point on the boundary of L, then a point \(\langle x\mathbf{,}{\uptheta },\upvarphi \rangle \) is inside L if \(x\le b\) and outside L if \(x > b\). If landmarks are not convex, then it becomes much more complicated to determine what is inside and outside (see Ullman 1984, for cognitive issues and Mador-Haim and Winter 2015, for semantic issues).

Next, the space outside the landmark can be divided into points that are near or far, as shown in Figs. 9 and 10.Footnote 7 Of course, at what distance from the landmark the division between ‘near’ and ‘far’ is made is dependent on the context, in particular the size of the landmark: what is near the sun in outer space covers a much larger distance than what is near a golf ball on the ground. Moreover, the division is not sharp but might allow for a vague gap between near and far.Footnote 8

The definitions of near and far are given in (8) as sets of polar coordinates, for a given landmark with radius \(\hbox {r}_{L}\) and a contextually given norm \(\hbox {c}_{L}\) that represents the demarcation between ‘near’ and ‘far’ relative to L.

  1. (8)

    near\((L) = \{ \langle x,{\uptheta },\upvarphi \rangle \in S(L): \hbox {r}_{L} <x < \hbox {c}_{L} \}\)

    far\((L)=\{ \langle x,{\uptheta },\upvarphi \rangle \in S(L): \hbox {c}_{L} < x \}\)

Then there are prepositions that refer to the angular coordinates. They are based on a frame of reference that defines a region at a particular side of the landmark. There is a rich literature about the different frames of reference, with different terminologies (see Levinson 1996, for an overview). We follow here Levinson, who makes a distinction between absolute frames of reference (based on features of the environment, such as gravitation and compass points), intrinsic frames of reference (based on features of the landmark itself, like its ‘active’ side), and relative frames of reference (based on the position of an observer looking at the landmark).

A frame of reference can be represented by a point (a ‘unit vector’) \(\langle 1,{\uptheta }_{0},{\upvarphi }_{0}\rangle \) with particular values for \({\uptheta }_{0}\) and \({\upvarphi }_{0}\). An absolute frame of reference is a direction that is given independently of any landmark or observer, as a feature of the environment shared by all landmarks under every perspective. For example, this unit vector can be taken to be \({ grav} = \langle 1,{\uptheta }_{g},{\upvarphi }_{g}\rangle \), which is fixed as an absolute frame of reference by the downward direction of gravitation. The prepositions above and below include those points in the region that are close to the unit vector that represents the corresponding vertical axis.

  1. (9)

    above\((L) = \{ \langle x,{\uptheta },\upvarphi \rangle \in S(L): x > \hbox {r}_{L}\) and \(\langle x,{\uptheta },\upvarphi \rangle \) is close to –grav}

    below\((L) = \{ \langle x,{\uptheta },\upvarphi \rangle \in S(L): x > \hbox {r}_{L}\) and \(\langle x,{\uptheta },\upvarphi \rangle \) is close to grav}

Fig. 9
figure 9

Near region

Two points a and b are close if there is a degree d such that \(\angle { ab} < d\). The closeness in (9) is a matter of context and vagueness, but for the sake of concreteness we will simply assume that \(d=45^{\circ }\). This yields the infinite cones that we see in Figs. 11 and 12 (showing two-dimensional cross-sections of regions that are actually three-dimensional).

An intrinsic frame of reference is a way of assigning directions that depends on features of the landmark, like where its face and senses are (for humans and animals) or where its interactive side is (for animals). For a landmark L at a particular moment of time, \(\hbox {front}_{L}\) gives the unit vector \(\langle 1,{\uptheta }_{F},{\upvarphi }_{F}\rangle \) in which L is ‘facing’ at that moment. In its canonical orientation, \(\hbox {front}_{L}\) will be horizontal, i.e. orthogonal to the absolute direction given by the gravitational vector. Similarly, we can assume a function \(\hbox {right}_{L}\) that assigns a direction orthogonal to \(\hbox {front}_{L}\). Notice that these intrinsic frames of reference are functions of the landmark L. The intrinsic meaning of in front of, behind, right of, and left of can then be defined in (10).

  1. (10)

    \(\hbox {in-front-of}_\mathrm{int}(L) = \{ \langle x,{\uptheta },\upvarphi \rangle \in S(L): x > \hbox {r}_{L}\) and \(\langle x,{\uptheta },\upvarphi \rangle \) is close to front\(_{L}\}\)

    \(\hbox {behind}_\mathrm{int}(L) = \{ \langle x,{\uptheta },\upvarphi \rangle \in S(L): x > \hbox {r}_{L}\) and \(\langle x,{\uptheta },\upvarphi \rangle \) is close to –front\(_{L}\}\)

    \(\hbox {right-of}_\mathrm{int}(L) = \{ \langle x,{\uptheta },\upvarphi \rangle \in S(L): x > \hbox {r}_{L}\) and \(\langle x,{\uptheta },\upvarphi \rangle \) is close to right\(_{L}\}\)

    \(\hbox {left-of}_\mathrm{int}(L) = \{ \langle x,{\uptheta },\upvarphi \rangle \in S(L): x > \hbox {r}_{L}\) and \(\langle x,{\uptheta },\upvarphi \rangle \) is close to –right\(_{L}\}\)

The corresponding regions are cone-shaped again, like those of above and below.

Fig. 10
figure 10

Far region

Fig. 11
figure 11

Above region

Fig. 12
figure 12

Below region

Another preposition that can be based on an intrinsic frame of reference is beside. Following Leech (1974) and Svenonius (2008) we analyze beside here as involving proximity to the sides of the landmark. It seems reasonable to assume that the landmark has a left and right side and that the definition of beside (and next to, its synonym for our purposes) should be as follows:Footnote 9

  1. (11)

    \(\hbox {beside}(L) = \{ \langle x,{\uptheta },\upvarphi \rangle \in S(L): \hbox {r}_{L} < x \le \hbox {c}_{L}\) and \(\langle x,{\uptheta },\upvarphi \rangle \) is close to right\(_{L}\) or to –right\(_{L}\}\)

What makes this region different from the regions of the (other) axis-based prepositions is that it stretches out in two opposite directions, as shown in Fig. 13.

Fig. 13
figure 13

Beside region

The prepositions in front of, behind, right, and left can also be interpreted with respect to a relative frame, which can be represented by making the front and right sides of L dependent on an observer O. \(\hbox {Front}_{L,O}\) gives the side of L that is oriented towards O and \(\hbox {right}_{L,O}\) gives the side of L that has the same orientation as O.

  1. (12)

    \(\hbox {in-front-of}_\mathrm{rel}(L) = \{ \langle x,{\uptheta },\upvarphi \rangle \in S(L): x > \hbox {r}_{L}\) and \(\langle x,{\uptheta },\upvarphi \rangle \) is close to front\(_{L,O}\}\)

    \(\hbox {behind}_\mathrm{rel}(L) = \{ \langle x,{\uptheta },\upvarphi \rangle \in S(L): x > \hbox {r}_{L}\) and \(\langle x,{\uptheta },\upvarphi \rangle \) is close to –front\(_{L,O}\}\)

    \(\hbox {right-of}_\mathrm{rel}(L) = \{ \langle x,{\uptheta },\upvarphi \rangle \in S(L): x > \hbox {r}_{L}\) and \(\langle x,{\uptheta },\upvarphi \rangle \) is close to right\(_{L,O}\}\)

    \(\hbox {left-of}_\mathrm{rel}(L) = \{ \langle x,{\uptheta },\upvarphi \rangle \in S(L): x > \hbox {r}_{L}\) and \(\langle x,{\uptheta },\upvarphi \rangle \) is close to –right\(_{L,O}\}\)

We turn to the preposition between, which is, in its most prominent use, based on two landmarks, as in the prepositional phrase between John and Mary (Habel 1989; van der Zee and Watson 2004). There are two origos for the polar coordinates defined by this PP, John and Mary, and therefore two polar coordinate systems, S(John) and S(Mary). Consider a point between John and Mary. This point corresponds to coordinates \(\langle x,{\uptheta },\upvarphi \rangle \in S\)(John) such that \(\langle x,{\uptheta },\upvarphi \rangle \) is directed towards Mary; it also corresponds to coordinates \(\langle x',{\uptheta }',\upvarphi '\rangle \in S\)(Mary) such that \(\langle x',{\uptheta }',\upvarphi '\rangle \) is directed towards John. We can say that \(\langle x,{\uptheta },\upvarphi \rangle \) and \(\langle x',{\uptheta }',\upvarphi '\rangle \) describe the same point in space, but from different origos. This leads us to the following quite rudimentary definition of between, where dir(\(L,L'\)) is the vector in S(L) that represents the point of \(L'\) closest to L, as defined in (13), and \(\langle x,{\uptheta },\upvarphi \rangle \approx \langle x',{\uptheta }',\upvarphi '\rangle \) represents that those two coordinate triples represent the same point in space, even though they are defined with respect to different origos.

  1. (13)

    dir(\(L,L'\)) = the shortest vector \(\langle x,{\uptheta },\upvarphi \rangle \in \) outside(L) for which there is a \(\langle x',{\uptheta }',\upvarphi '\rangle \in \) inside(\(L'\)) such that \(\langle x,{\uptheta },\upvarphi \rangle \approx \langle x',{\uptheta }',\upvarphi '\rangle \)

  2. (14)

    between-and \((L,L') = \{ \langle x,{\uptheta },\upvarphi \rangle \in S(L)\): \(\hbox { dir}(L,L') = \langle d,{\uptheta },\upvarphi \rangle \hbox { and }x < d \}\)

In this case the region between-and(\(L,L'\)) consists of the points lying on the shortest line segment connecting the two landmarks L and \(L'\), as illustrated in Fig. 14.

Fig. 14
figure 14

The region between two landmarks L and \(L'\)

Note that between(\(L,L'\)) and between(\(L',L\)) correspond to the same line, but with different origos. Obviously, this is a very narrow definition, which needs to be elaborated, but it captures the essential nature of between. One way to extend the definition is to allow a wider (vague) region between the landmarks. Another extension is that between can refer to multiple landmarks as in the expression between the trees. In the latter case, the convex hull of the landmarks would be a plausible representation of the meaning (presuming a unique origo in relation to which polar convexity is defined).

In summary, the space around a landmark can be represented in terms of polar coordinates and a locative preposition carves out a region in this space by imposing simple conditions on the values of the polar coordinates. We will now consider whether the regions defined in this way are also polarly convex.

5 The Convexity of Locative Prepositions

Given the notions of polar betweenness and polar convexity that we defined in Sect. 3, the question is now whether the regions of locative prepositions that we defined in Sect. 4 are convex in this sense.

The preposition inside is trivially p-convex, because of the idealizing assumption about landmarks we have made. The region corresponding to inside is always the interior of a circle, which is p-convex (as well as convex in the Cartesian sense, of course). In reality, inside can be used with landmarks of any shape, like inside the cathedral, with the cathedral having the shape of a cross, and the resulting interior region is not convex or p-convex in that case. We are interested in the general properties that prepositional categories have, independently of the more accidental properties of the landmarks that they are based on. Inside is not relevant for studying such properties, because the shape of its region is completely determined by the landmark that it applies to. The ‘exterior’ prepositions are where p-convexity has to be put to the test.

Consider therefore outside, near, and far, that are problematic for the ordinary, Cartesian notion of convexity, because there is a gap in the center of the region. If position a to the east of my house is outside (near, far) and a position b to the west is outside (near, far), then there are definitely positions between a and b that are not outside (near, far). But this description depends on Cartesian betweenness. If we apply p-betweenness to these regions, then they are p-convex in terms of polar coordinates. As we saw in Sect. 3, the exterior of a sphere (the shape of outside and far regions) is p-convex and also the region between two spheres (the shape of near regions). Crucially, in these cases the curved nature of the p-convexity line between the two points a and b leads around the gap in the middle. Hence, a point p-between a to the east and b to the west is a point either to the north or to the south (unless a and b are on diametrically opposite sides, because then p-betweenness is not defined, see Sect. 3).

The regions defined by the ‘angular’ prepositions above, below, in front, behind, left, and right are all p-convex according to the definitions of Sect. 4, because they are the intersection of two basic p-convex regions: the exterior of a sphere (because of the condition that \(x > \hbox {r}_{L}\)) and an infinite cone (because of the condition that the angular coordinates are close to the unit vector that represents the axis). What frame of reference underlies the preposition does not affect the p-convexity. What is important for the p-convexity of these regions is that they are located at one side of the landmark only. Regions defined by between are p-convex because between is taken very strictly, resulting in a line segment between the two landmarks. We are left then with the p-convexity of the preposition beside, for which we defined a region that consists of two parts on opposite sides of the landmark. Given definition (11) this region is not p-convex (see Fig. 13). However, we do not want to conclude that beside falsifies p-convexity, until we have explored alternative ways of defining its meaning. As Leech (1974) points out for the synonym at the side of:

In a very general sense, at the side of has no reference to vertical or horizontal orientation: in an astronomical context, Star x is at the side of star y may mean no more than that the two stars are in spatial proximity. A less general sense of at the side of is found in I placed my hat at the side of his, which means ‘... close to his on a horizontal plane’; there is a sense, that is, in which ‘x is at the side of y’ excludes the possibility of vertical proximity. A third and still more specific sense is observed in His car was at the side of mine, where at the side of is in contrast to in front of and behind, and therefore limits proximity to the secondary horizontal axis. (Leech 1974, p. 169)

If we follow this view, beside does have a meaning that is p-convex, namely a meaning of spatial proximity (like near). We propose that that is the basic meaning of beside and that the more restricted uses result from the pragmatic competition with above/below (excluding vertical proximity) and in front of/behind (excluding proximity to those sides). It has the same meaning as near, but it forms a lexical field with projective prepositions like above and behind, which restrict its meaning in contextually dependent ways. In other words, there is an element of negation in beside: ‘near, but not above, behind, etc.’, somewhat similar to an implicature. If this is true, then it teaches us an important lesson about convexity in general. Cases of non-convexity might actually result from the pragmatic interaction between a convex category and another category that partially overlaps it, creating an ‘indentation’ that causes non-convexity.

When we turn to complex prepositions, run-of-the-mill cases satisfy convexity (given our assumptions about landmarks):

  1. (15)

    a. An exit sign is located 1 feet above the exit.

    b. The hotel is too far outside the city.

Only artificial cases such as (16a) and (16b) will violate p-convexity:

  1. (16)

    a. Click a position diagonally above the circle.

    b. The current level is thus exactly one or exactly two meters above the official level.

In other words, not only basic prepositions are p-convex, but even many of the complex ones. This means that p-convexity gains additional support as a general semantic constraint for locative prepositions, as well as the underlying use of polar coordinates. Of course, the principle of p-convexity should also be analyzed cross-linguistically, against a much wider range of languages and locative adpositions, but that is beyond the scope of this paper.

6 Directional Prepositions with Polar Coordinates

6.1 Directional Prepositions and Paths

We now turn from locative prepositions to directional prepositions, which are used to describe how a trajector is moving with respect to a landmark:

  1. (17)

    a. The fox went into the forest.

    b. The dog came off the table.

    c. The fox jumped through the hedge.

The previous section introduced the formalism for representing the location of a trajector in terms of one vector. If a trajector is moving or if it is extended in shape, then we need the notion of a path, i.e. a directed curve (see for instance Jackendoff 1983; Talmy 2000; Eschenbach et al. 2000; Zwarts 2005; Kracht 2008, and many others). There are different ways to represent a path, but we will represent it here as a function from the real interval [0,1] to S, continuous in the standard calculus sense. The values of the interval [0,1] do not represent moments of time, but they are an ordering mechanism. We also assume that a path has a constant ‘velocity’, i.e. that its first derivative is constant (see Zwarts 2005; Kracht 2008, for more details). Note that instead of [0,1] we could also have taken a discrete domain for the function, because a path might not always need or allow for a continuous curve, but may be just a finite sequence of positions, or, minimally, only two positions, a starting point and an endpoint. What is important is that the path represents ‘locations in sequence’ so to say. In the sequel we also make the assumption that the path is simple, that is, it does not cross itself. This can be defined by saying that for all \(i, j \in [0,1]\), such that \(i\ne j, p(i)\ne p(j)\).

We can refer to the starting point of a path p as p(0), the end point is p(1) and for any \(i\in (0,1)\), p(i) is an intermediate point. All the points of a path are represented in terms of polar coordinates taken from S. It will be convenient later on to refer to these coordinates in the following ways:

  1. (18)

    If \(p(i)=\langle x,{\uptheta },\upvarphi \rangle \), then

    radius\((p(i)) = x\), azimuth\((p(i)) = {\uptheta }\), and polar\((p(i)) = \upvarphi \).

In Fig. 15 a path p with respect to a landmark L is shown with \(p(0) = a\) and \(p(1) = b\), with radius\((p(0)) > \hbox {r}_{L}\) and radius\((p(1)) < \hbox {r}_{L}\). If the situation holds in the xy-plane, then for every \(i\in [0,1]\), polar\((p(i)) = 90^{\circ }\).

Fig. 15
figure 15

A path into L

6.2 Defining the Meaning of Directional Prepositions

Among the directional prepositions one usually distinguishes the following three types (e.g. Jackendoff 1983; Kracht 2002; Pantcheva 2010):

  1. (19)

    goal prepositions: to, into, onto, towards

    source prepositions: from, out of, off, away from

    route prepositions: through, over, along, around, across

In addition, it is possible to get directional readings for locative prepositions:

  1. (20)

    goal: (to go) below, behind, ... something

    source: (to come) from below, from behind, ... something

    route: (to pass) below, behind, ... something

The source needs to be marked by the source preposition from, while the possibility to get a goal or route interpretation depends very much on the verb and other factors (see for example Gehrke 2008; Nikitina 2008).

Most of the directional prepositions can be represented as imposing a locative condition on a particular part of the path, for instance on the end point (goal) or starting point (source). Stricter and weaker definitions are possible, as discussed in Zwarts (2005) and other works, but here we will use definitions that only involve opposite conditions on the starting point and end point. Suppose that P(S(L)) represents the set of paths in the space S(L) of a landmark L. (21) gives examples of definitions of goal prepositions, as functions from landmarks to sets of paths, and (22) does the same for source prepositions.Footnote 10

  1. (21)

    Goal prepositions

    \(\hbox {to}(L) = \{ p \in P(S(L)): p(1) \in \hbox {near}(L)\hbox { and }p(0) \in \hbox {far}(L) \}\)

    \(\hbox {into}(L) = \{ p \in P(S(L)): p(1) \in \hbox { inside}(L)\hbox { and }p(0) \in \hbox {outside}(L)\}\)

    \(\hbox {to-behind}(L) = \{ p \in P(S(L)): p(1) \in \hbox { behind}(L)\hbox { and }p(0) \notin \hbox {behind}(L)\}\)

  2. (22)

    Source prepositions

    \(\hbox {from}(L) = \{ p \in P(S(L)): p(0) \in \hbox { near}(L)\hbox { and }p(1) \in \hbox { far}(L)\}\)

    \(\hbox {out-of}(L) = \{ p \in P(S(L)): p(0) \in \hbox {inside}(L)\hbox { and }p(1) \in \hbox { outside}(L)\}\)

    \(\hbox {from-behind}(L) = \{ p \in P(S(L)): p(0) \in \hbox { behind}(L)\hbox { and }p(1) \notin \hbox { behind}(L)\}\)

Notice that goal and source prepositions differ in whether they apply the locative condition to the end point or the starting point, respectively. A way to handle certain route prepositions is to view them as sequences of two conjoined paths:Footnote 11 by or past a landmark means that the first path goes to the landmark and the second path goes away from the landmark; through a landmark means that the first path goes into the landmark and the second path goes out of from the landmark; and going behind a landmark means that the first path goes to behind the landmark and the second path goes from behind the landmark.

  1. (23)

    Route prepositions

    \(\hbox {by}(L) = \hbox {past}(L) =\{ p \in P(S(L))\): there is a path \(q \in \) to(L) and a path \(r \in \) from(L) and \(p=q{^{^{\wedge }}}r \}\)

    \(\hbox {through}(L) = \{ p \in P(S(L))\): there is a path \(q \in \) into(L) and a path \(r \in \hbox { out-of}(L)\) and \(p=q{^{^{\wedge }}}r \}\)

    \(\hbox {via-behind}(L) = \{ p \in P(S(L))\): there is a path \(q \in \hbox {to-behind}(L)\) and a path \(r \in \hbox {from-behind}(L)\) and \(p=q{^{^{\wedge }}}r \}\)

The prepositions towards and away from involve a comparison of the radius (the distance to the landmark) at the beginning and end of the path:

  1. (24)

    \(\hbox {towards}(L) = \{ p \in P(S(L))\): \(\hbox { radius}(p(1)) <\hbox { radius}(p(0))\) and for all \(i \in [0,1] \hbox { radius}(p(i))>\hbox {r}_{L}\}\)

    \(\hbox {away-from}(L) = \{ p \in P(S(L))\): \(\hbox { radius}(p(0)) < \hbox {radius}(p(1))\) and for all \(i \in [0,1]\hbox { radius}(p(i)) > \hbox {r}_{L}\}\)

We now come to a class of directional prepositions that are at a level of greater complexity than the others, namely around, across, and along. What makes them different from the other route prepositions is that they are not specified in terms of a location at a particular point of the path, but in terms of the shape or orientation of the path as a whole. Around paths are round in some sense, across paths are orthogonal to the main axis of the landmark, along paths parallel (see, for instance Landau and Jackendoff 1993 for more discussion).

We have to set aside across and along, because, as we already mentioned, they involve landmarks that are elongated, which goes beyond our simple model of the landmark as a region around an origo. This leaves us with around, which is a preposition with a wide range of meanings (Zwarts 2003). A very strict interpretation of around (restricted to one plane, with \(\upvarphi =90^{\circ }\)) takes it as corresponding to a full and perfect circular path:

  1. (25)

    \(\hbox {around}_\mathrm{circle}(L) = \{ p \in P(S(L))\): there is an \(x > \hbox {r}_{L}\) and an \(\upalpha \in [0^{\circ },360^{\circ })\), such that for all \(i\in \) [0,1] \(p(i)=\langle x,(\upalpha +i^{.}360^{\circ }),90^{\circ }\rangle \) or for all \(i \in [0,1]\, p(i)=\langle x,(\upalpha -i^{.}360^{\circ }),90^{\circ }\rangle \}\)

x corresponds to the radius of the circular path and \(\upalpha \) to the value of the azimuth angle at which the path starts. The disjunction allows the path to go clockwise and counterclockwise. We can extend this meaning by allowing smaller parts of the circumference to be covered, using a factor k. If \(k = 0.5\), then the path corresponds to a semicircle, if \(k = 0.25\) to a quarter circle. The value of k should be sufficiently large for the path to be circular.

  1. (26)

    \(\hbox {around}_\mathrm{partcircle}(L) = \{ p \in P(S(L))\): there is an \(x > \hbox {r}_{L}\), an \(\upalpha \in [0^{\circ },360^{\circ })\), and a sufficiently large \(k \in (0,1)\) such that for all \(i\in [0,1] \,p(i)=\langle x,\upalpha +k^{.}i^{.}360^{\circ },90^{\circ }\rangle \) or for all \(i\in [0,1]\, p(i)=\langle x,\upalpha -k^{.}i^{.}360^{\circ }, 90^{\circ }\rangle \}\)

Another sense of around is a path that encloses the landmark completely on all sides (in the xy plane), as defined in (27).

  1. (27)

    \(\hbox {around}_\mathrm{enclosure}(L) = \{ p \in P(S(L))\): for every \({\uptheta }\in [0^{\circ },360^{\circ })\) there is an \(i \in [0,1]\) and an \(x >\hbox {r}_{L}\) such that \(p(i)=\langle x,{\uptheta },90^{\circ }\rangle \}\)

This definition includes paths that spiral around the landmark, for instance. There are more possibilities (see Zwarts 2003), but this gives an idea of the variety that some route prepositions exhibit.

6.3 Convexity of Directional Prepositions

If we focus on simple paths that are defined as mappings from the interval [0,1] to points \(\langle x,{\uptheta }\rangle \) in one plane (omitting the polar coordinate \(\upvarphi \)), the following general definition adopts the polar geometry for paths:Footnote 12

  1. (28)

    Let \(p_{a}(i)\) and \(p_{c}(i)\), where \(i \in \) [0,1], be two paths mapping onto \(\langle x_{a}(i),{\uptheta }_{a}(i)\rangle \) and \(\langle x_{c}(i),{\uptheta }_{c}(i)\rangle \) respectively. Then the path \(p_{b}(i)\) is said to be polarly between (p-between) \(p_{a}(i)\) and \(p_{c}(i)\), if and only if, there is some \(k, 0 < k < 1\), such that for all \(i \in \) [0,1], \(p_{b}(i)=\langle { kx}_{a}(i) + (1-k)x_{c}(i),k{\uptheta }_{a}(i) + (1-k){\uptheta }_{c}(i)\rangle \) iff \({\vert }{\uptheta }_{a}(i)-{\uptheta }_{c}(i){\vert } < 180^{\circ }\) and \(p_{b}(i) =\langle { kx}_{a}(i)+(1-k)x_{c}(i),k{\uptheta }_{a}(i) + (1-k)({\uptheta }_{c}(i)-360^{\circ })\rangle \) iff \({\vert }{\uptheta }_{a}(i)-{\uptheta }_{c}(i){\vert }>180^{\circ }\).Footnote 13

Given this definition it is easy to prove that the meanings of the goal prepositions above are polarly convex. Convexity for into follows from convexity of inside together with the convexity of outside. More precisely, if \(p_{1}\) and \(p_{2}\) are two paths such that \(p_{1}(1)\in \hbox {inside}(L)\) and \(p_{2}(1)\in \hbox {inside}(L)\) for any L, then for any path \(p_{3}\) between \(p_{1}\) and \(p_{2}\), it will also hold that \(p_{3}(1) \in \hbox {inside}(L)\); and if \(p_{1}(0)\in \hbox {outside}(L)\) and \(p_{2}(0) \in \hbox {outside}(L)\) for any L, then for any path \(p_{3}\) between \(p_{1}\) and \(p_{2}\), it will also hold that \(p_{3}(0) \in \hbox {outside}(L)\). A parallel proof can be given for out of.

Similarly, the convexity of to follows from the fact that the regions for near and far are convex. To be precise, if \(p_{1}\) and \(p_{2}\) are two paths such that \(p_{1}(1)\in \hbox {near}(L)\) and \(p_{2}(1) \in \hbox {near}(L)\) for any L, then for any path \(p_{3}\) between \(p_{1}\) and \(p_{2}\), it will also hold that \(p_{3}(1) \in \hbox {near}(L)\); and if \(p_{1}(0)\in \hbox {far}(L)\) and \(p_{2}(0) \in \hbox {far}(L)\) for any L, then for any path \(p_{3}\) between \(p_{1}\) and \(p_{2}\), it will also hold that \(p_{3}(0) \in \hbox {far}(L)\). See Fig. 16 for a graphical illustration. An analogous argument holds for from, and similar arguments can be made for (to) behind and from behind, since the regions for behind and its complement outside the landmark are convex.

Fig. 16
figure 16

Path \(p_{3}\) is p-between \(p_{1}\) and \(p_{2}\)

The polar convexity of towards(L) is easy to prove: if \(p_{1}(i)\) and \(p_{2}(i)\) are two paths such that radius(\(p_{1}(1)) < \hbox {radius}(p_{1}(0))\) and for all \(i \in \) [0,1] radius\((p_{1}(i))> \hbox {r}_{L}\), and radius\((p_{2}(1))<\hbox {radius}(p_{2}(0))\) and for all \(i \in \) [0,1] radius\((\hbox {p}_{2}(i))> \hbox {r}_{L}\), then for any path \(p_{3}\) p-between \(p_{1}\) and \(p_{2}\), it will also hold that radius\((p_{3}(1)) < \hbox {radius}(p_{3}(0))\) and for all \(i \in \) [0,1] radius\((p_{3}(i)) > \hbox {r}_{L}\). An analogous argument shows that away from is convex.

Given the definitions in (23), the convexity of the route prepositions follows essentially from the convexity of the set of points that conjoin the two paths, in analogy with the convexity for the goal prepositions. A point conjoining two paths p and q is a point for which \(p(1) = q(0)\). It is interesting to note that with our definition of betweenness for paths, conjoining paths preserves convexity: If P and Q are convex sets of paths and \(P{^{^{\wedge }}\!}Q\) is the set of conjoinings of paths from P and Q, then \(P{^{^{\wedge }}\!}Q\) is convex too.Footnote 14

We finally turn to the polar convexity of the different versions of \({ around}, { around}_\mathrm{circle}\), \({ around}_\mathrm{partcircle}\), \({ around}_\mathrm{enclosure}\), for which such a conjoining approach of more basic paths is not an option. On the basis of definition (28), the corresponding sets of paths are not necessarily polarly convex. To see this, consider two \(\hbox {around}_\mathrm{partcircle}\) paths \(p_{1}\) and \(p_{2}\) that have the same constant radius r and the same starting angle \(\upalpha =0^{\circ }\), but that go in opposite directions (clockwise versus counterclockwise). There is a path \(p_{3}\) p-between \(p_{1}\) and \(p_{2}\) that also has a radius r but that has an angle \({\uptheta }\) that is always halfway between the \({\uptheta }\) angles of \(p_{1}\) and \(p_{2}\), that is, the angle \(0^{\circ }\). However, this path is not an \(\hbox {around}_\mathrm{partcircle}\) path. In fact, it is not even a ‘proper’ path because it maps only to a single point. See Fig. 17.

Fig. 17
figure 17

Non-p-convexity of around paths

The source of the problem is that if two paths go in opposite angular directions then their angles can even each other out in the definition of polar betweenness. A way out of this problem requires a closer look at the directionality of prepositions. What makes route prepositions like around different from goal and source prepositions is that they are ‘symmetric’. If a path p is in the set \(\hbox {around}_\mathrm{partcircle}(L)\) for a landmark L, then so is the path \({\sim } p\) that is the exact reversal of p, i.e., for every \(i \in \) [0,1], \({\sim } p(i)=p(1-i)\). In a sense, within this symmetrical set, p and \({\sim } p\) are non-distinct paths, because around does not care about the direction of a path. We can say that the paths \(p_{1}\) and \(p_{2}\) have the same directionality if either azimuth(\(p_{1}(0)) < \hbox {azimuth}(p_{1}(1))\) and \(\hbox {azimuth}(p_{2}(0)) < \hbox {azimuth}(p_{2}(1))\) or \(\hbox {azimuth}(p_{1}(0)) > \hbox {azimuth}(p_{1}(1))\) and \(\hbox {azimuth}(p_{2}(0)) > \hbox {azimuth}(p_{2}(1))\). (This definition works for all simple paths, as long as none of the paths cross the line \({\uptheta }= 0^{\circ }\). In that case, a more complicated definition will become necessary.) As a more precise version of (28), we can then include directionality in the definition of p-convexity of a set of paths P:

  1. (29)

    A set of paths P is directionally p-convex if and only if, for every path \(p_\mathrm{a}\) and \(p_\mathrm{c}\) in P that have the same directionality, and for any path \(p_{b}\) that has the same directionality and is p-between \(p_\mathrm{a}\) and \(p_\mathrm{c}\), \(p_{b}\) is in P.

With this definition, \(\hbox {around}_\mathrm{partcirc}\) becomes directionally p-convex, because a path between two others, and with the same direction, will also be a path satisfying the conditions of \(\hbox {around}_\mathrm{partcirc}\).

7 Conclusion

The main topic of this paper has been the use of geometric notions to describe the semantics of locative and directional prepositions. By using polar coordinates instead of Cartesian coordinates, we have been able to present a unified analysis of most of these prepositions. Locative prepositions denote regions in space and directional prepositions denote simple conditions on paths in space defined in terms of distance (radius) and direction (angle). The use of polar coordinates opens up new computational methods to calculate the meanings of prepositional expressions and new ways to analyze linguistic phenomena within and across languages.

Locative and directional prepositions are prime examples of how meaning is geometrically structured (Gärdenfors 2014). We have been able to show that locative and directional prepositions support the general idea that concepts can be represented by convex regions in conceptual spaces (Gärdenfors 2000), if convexity is defined in terms of a new notion of betweenness over polar coordinates. Polar convexity generalizes and unifies the earlier constraints of continuity in Zwarts (1997). Several prepositions that seem to present counterexamples to polar convexity turn out to have special features. The preposition beside is non-convex because of the role of negation, due to competing items, and around required special attention because of its symmetric nature.

Our exploration was based on strong idealizations about the nature and shape of the landmark (as one single, spherical, bounded object referred to by a definite noun phrase). One question for further research is to what extent certain idealizations or schematizations of the shape of a landmark are part of parcel of the way spatial language is used and spatial cognition functions (see Herskovits 1998) and what role convexity plays in those schematizations. Another question is how our approach to convexity extends to landmarks that are plural or that are referred to by indefinite noun phrase. As Mador-Haim and Winter (2015) show, there is an intricate interaction between the properties of the landmark and of the preposition and that interaction also involves convexity.

We set aside some prominent prepositions that cannot be analyzed in purely spatial terms, such as in, on, over, under and against. Some authors (e.g. Coventry et al. 2001; Coventry and Garrod 2004) claim that the semantics of these prepositions have both force dynamic and spatial components. We believe, however, that the spatial component can be derived (metonymically) from the space of forces (see Gärdenfors 2014, Section 11.6–11.8). After all, forces are spatially located. This would also help to explain why some of the force-dynamic prepositions (at, in, on) are the most basic and versatile prepositions used for locating objects and even why they appear as parts of complex prepositions (at the left, in front, on top). It would take us too far, however, to develop this point here.

Since the force fields involved depend to a large extent on the nature of the trajector and the landmark and their relation, it is difficult to determine to what extent the meaning of these prepositions represent convex regions in the force domain or some product space (see Gärdenfors 2014, Sections 11.6–11.8 for a discussion of this). This topic merits further investigation.