1 Introduction

Since the work of Chomsky (2000, 2001), a common perspective in Minimalist syntax has been that Internal Merge requires the establishment of an Agree relation. Expanding on this view, there is a growing body of literature including Chomsky (2004, 2007, 2008), Obata (2010), Obata and Epstein (2011), van Urk (2015), Keine (2016, 2019), and Lohninger et al. (2022), which focuses on deriving the distinction between A- and \(\hspace{1.27777pt}\overline{\hspace{-1.27777pt}\text {A}\hspace{-1.27777pt}}\hspace{1.27777pt}\)-movement from the Agree mechanism. In particular, it has been suggested that A-movement and \(\hspace{1.27777pt}\overline{\hspace{-1.27777pt}\text {A}\hspace{-1.27777pt}}\hspace{1.27777pt}\)-movement behave differently because they are driven by different kinds of features, a concept implemented in various ways by the aforementioned authors. Despite the differences among specific proposals, a common thread across these previous studies is that the relevant constraints on movement are either fully or partially removed from movement dependencies themselves, and the burden of explanation is instead placed on the Agree mechanism and Merge. Concomitantly, there has also been increasing discussion about whether an Agree-based view of the A/\(\hspace{1.27777pt}\overline{\hspace{-1.27777pt}\text {A}\hspace{-1.27777pt}}\hspace{1.27777pt}\)-distinction handles well-known generalizations and puzzles in a more explanatory way, and whether all properties of movement can be reduced to Agree and features.

Against this backdrop, this paper takes as its point of departure novel observations on the interaction between A/\(\hspace{1.27777pt}\overline{\hspace{-1.27777pt}\text {A}\hspace{-1.27777pt}}\hspace{1.27777pt}\)-operations at the Khalkha Mongolian (henceforth Mongolian) embedded clausal periphery. The main proposal is that an Agree-based view of movement types, combined with the Mongolian clausal architecture, allow us to provide a unified account for the patterns of exceptional case marking (ECM), wh-licensing, and topicalization in Mongolian. Specifically, the current account explains three interconnected properties of Mongolian syntax:

  1. (i)

    Mongolian allows acc-case assignment on an embedded subject at the edge of an embedded finite CP.

  2. (ii)

    Only acc-subjects can be A-extracted from an embedded finite CP; nom subjects may not be.

  3. (iii)

    \(\hspace{1.27777pt}\overline{\hspace{-1.27777pt}\text {A}\hspace{-1.27777pt}}\hspace{1.27777pt}\)-operations (wh-licensing and thematic topicalization) on subjects within the embedded clause are incompatible with the contexts specified in (i-ii).

To help illustrate these properties, consider the sentences presented in example (1).

figure a

First, (1a) is an embedded declarative construction, in which the matrix verb khel- (‘to say’) takes a finite embedded CP headed by the complementizer gej. Property (i) is illustrated by the fact that the embedded subject Bat can appear in either nom or acc form. With respect to their differences, previous research suggests that nom subjects are structurally lower than acc subjects. The acc-case marking is due to the subject raising to the edge of CP, receiving its case from the matrix clause in an ECM-like fashion (Fong 2019, use of the term ECM mine). The nom and acc embedded subjects in (1a) also differ in their movement properties. As will be discussed in the subsequent sections, only acc subjects can A-move into the matrix clause, and nom subjects generally cannot. In the second example (1b), the embedded clause is also a finite CP headed by [\({}_{\textsc {c}}\) gej]. What is different from (1a) is that in (1b) the clausal complement is a wh-question in which the subject wh-phrase takes embedded scope. Importantly, in this example ECM on the embedded subject is degraded. Note that the degraded status of acc on the subject is not due to the [+Q] CP blocking cross-clausal acc case assignment (cf. Kitagawa 1985). The embedded wh-question in (1c) has the object wh-phrase taking embedded scope. The embedded subject is a non-wh, regular DP Zaya. The complement clauses in (1b) and (1c) presumably have the same size. The acc case marking is degraded on the wh-subject in (1b), but not on the regular non-wh-subject Zaya in (1c).

As I will show, ECM is not only degraded on wh-subjects of embedded interrogatives as seen in (1), but is also degraded when an embedded subject undergoes embedded thematic topicalization, both being \(\hspace{1.27777pt}\overline{\hspace{-1.27777pt}\text {A}\hspace{-1.27777pt}}\hspace{1.27777pt}\)-operations. Building on an Agree-based view of phrasal movement, I argue that (i-iii) directly results from the Mongolian clausal architecture and the availability of a \(\phi \)-probe on \(\hbox {C}^{0}\). While it has been argued that Mongolian Spec CP can be an A-position (Fong 2019), I show that there exists an intermediate \(\hspace{1.27777pt}\overline{\hspace{-1.27777pt}\text {A}\hspace{-1.27777pt}}\hspace{1.27777pt}\)-domain between TP and CP, resulting in a \([[[ \hspace{.1in} ... \hspace{.1in} _{\textsc {a}}] \,{}_{\hspace{1.27777pt}\overline{\hspace{-1.27777pt}{{\textbf {A}}}\hspace{-1.27777pt}}\hspace{1.27777pt}}] \,{}_{{{\textbf {A}}}}]\) clausal periphery. This departs from a typical characterization of A/\(\hspace{1.27777pt}\overline{\hspace{-1.27777pt}\text {A}\hspace{-1.27777pt}}\hspace{1.27777pt}\)-domains, in which different domains are structurally determined in terms of height. In particular, \(\hspace{1.27777pt}\overline{\hspace{-1.27777pt}\text {A}\hspace{-1.27777pt}}\hspace{1.27777pt}\)-movement has been taken to target higher landing sites than A-movement (e.g., Williams 2003; Müller 2014, also see Keine 2016, 2018 for an overview of alternative approaches to characterizing A/\(\hspace{1.27777pt}\overline{\hspace{-1.27777pt}\text {A}\hspace{-1.27777pt}}\hspace{1.27777pt}\)-positions).Footnote 1 The proposal for Mongolian, in which Spec CP as an A-position is potentially available above some \(\hspace{1.27777pt}\overline{\hspace{-1.27777pt}\text {A}\hspace{-1.27777pt}}\hspace{1.27777pt}\)-domains, does not fit well with accounts in which A- and \(\hspace{1.27777pt}\overline{\hspace{-1.27777pt}\text {A}\hspace{-1.27777pt}}\hspace{1.27777pt}\)-domains are directly mapped onto the hierarchy of levels of projections. Instead, the current proposal, if on the right track, suggests that there should not be a universal classification of A/\(\hspace{1.27777pt}\overline{\hspace{-1.27777pt}\text {A}\hspace{-1.27777pt}}\hspace{1.27777pt}\)-positions, nor should there be a universal hierarchy between them (e.g., that \(\hspace{1.27777pt}\overline{\hspace{-1.27777pt}\text {A}\hspace{-1.27777pt}}\hspace{1.27777pt}\)-positions are uniformly higher than A-positions), a natural consequence under the Agree-based approach to movement types. Since properties of movement make reference to the Agree mechanism, what makes one movement type differ from another is really the structural locations and properties of the probes and goals, subject to cross-linguistic variation. This is a desirable outcome. As discussed in much recent literature, the A- vs. \(\hspace{1.27777pt}\overline{\hspace{-1.27777pt}\text {A}\hspace{-1.27777pt}}\hspace{1.27777pt}\)-distinction no longer has an independent status in the Minimalist Program. The Agree-based approach, among other proposals, is one of the ways to render such distinction an epiphenomenon that emerges from the operations Agree and Merge (see e.g., Chomsky 1995, 2004; for recent discussions and alternative proposals see Safir 2019).

Due to its flexible word order and unique locality profile, Mongolian provides an ideal testing ground for the Agree-based approach to movement types. The case study presented here illuminates the behavior of the Agree mechanism as it relates to movement dependencies in that it examines the features which are operative in triggering movement, and the interactions between probes bearing different features. Against this general background, the rest of the paper will proceed as follows. In Section 2, I provide background on the Mongolian ECM constructions, specifically addressing the previous proposal that Spec CP in these ECM configurations can be an A-position. In Section 3, I present a set of data which shows that ECM is degraded when the embedded subject is a wh-phrase taking embedded scope. In Section 4, I present an additional set of data showing that ECM is also degraded when the embedded subject is the embedded thematic topic. In Section 5, I build on the observations in Sections 3 and 4 and present an Agree-based analysis for the interactions between ECM and \(\hspace{1.27777pt}\overline{\hspace{-1.27777pt}\text {A}\hspace{-1.27777pt}}\hspace{1.27777pt}\)-operations (wh-licensing, thematic topicalization). Section 6 discusses further implications of the current proposal. Section 7 concludes the paper.

2 Mongolian ECM and the Potential A-Status of Spec CP

The main purpose of this section is to provide the necessary background for the discussion of the interaction between ECM and \(\hspace{1.27777pt}\overline{\hspace{-1.27777pt}\text {A}\hspace{-1.27777pt}}\hspace{1.27777pt}\)-operations in Sections 3 and 4. I focus in particular on the previous treatment of Mongolian ECM, and the proposal that Spec CP can be an A-position in Mongolian.

2.1 Some Background on Mongolian ECM Constructions

Mongolian exhibits nom-acc case alternation on the embedded subject shown in (2) below.

figure b

The construction in (2b) differs from typical English ECM constructions (e.g., I believe him to be smart) in several ways. First, the acc case marking on the embedded subject is optional. In most cases, the subject of a finite embedded clause can alternate between acc (-iig in (2b)) and nom (morphologically unmarked on regular DPs, marked on pronouns). Second, the embedded clause which allows an acc subject in (2) is a full finite CP with a complementizer gej. In these two respects, Mongolian ECM constructions resemble those in Japanese (e.g., Kuno 1976, 2007; Hiraiwa 2001; Tanaka 2002a) and Korean (Yoon 1991; Hong 2005; Yoon 2007), although language-specific differences remain.

Fong (2019) argues that Mongolian ECM, as exemplified in (2b), does not involve prolepsis in which the acc subject is base-generated in the matrix clause. Instead, she demonstrates that the acc subject in (2) originates from the embedded clause, and subsequently raises to the edge of the embedded CP where it receives acc case from the matrix clause. I will summarize some of her core arguments here. First, the acc subject can surface to the right of an unambiguously embedded adverb (cf. e.g., Deal ’s 2017 discussion on Nez Perce, where a proleptic acc-DP cannot surface below an embedded adverb). The data in (3) suggest that the acc-subject originates from the embedded clause and that it need not obligatorily raise into the matrix clause.

figure c

However, Mongolian has a highly flexible word order. Thus, an alternative analysis of the word order in (3) is that both the temporal adverb margaaš ‘tomorrow’ and Dulmaa-g ‘Dulmaa-acc’ are located in the matrix clause. This alternative account would be possible if the adverb could be placed inside the matrix clause by scrambling.

figure d

To rule out the possible derivation in (4), Fong shows that independently scrambling the adverb margaaš out of the embedded CP results in the secntence being severely degraded.

figure e

Fong attributes the status of (5) to a general ban on long-distance scrambling (LDS) in Mongolian. However, as will be discussed below, Mongolian does allow LDS of embedded arguments (e.g., Sakamoto 2012, 2017; Gong 2023). The reason behind the degraded status of (5) is worth exploring further in future research.Footnote 2 Nevertheless, for the purpose of the current point, it suffices to confirm that LDS of the adverb ‘tomorrow’ is independently unavailable in (5), and thus the structural analysis (4) relying on scrambling of ‘tomorrow’ is ruled out. Therefore, the fact that the acc-subject surfaces to the right of the embedded temporal adverb can be taken to indicate that the acc-subject is located within the embedded CP in (3).

In addition, the entire embedded CP containing the acc-subject can be fronted as a unit. This is expected if the acc-subject is located within the embedded clause in (6).

figure h

Fong presents another set of data concerning idiomatic interpretation which supports the idea that the acc-subject originates from the embedded clause.

figure i

In (7), the acc-subject Bat-iin nüd-iig ‘Bat’s eye’-acc can be construed as a part of the subject-verb idiom. This is expected if the acc-subject is introduced inside the embedded clause.Footnote 3

The above facts point to two conclusions. First, the embedded acc-subject originates from the embedded clause. Second, the embedded acc-subject does not obligatorily raise into the matrix clause. Building on these conclusions and considering additional facts from binding (see Fong 2019: 10-13), Fong argues that while acc subjects do not obligatorily raise into the matrix clause, they are nevertheless higher than regular nom subjects in the syntactic structure. Specifically, she proposes that the subject raises to the edge of embedded CP, where it receives acc case from the matrix v. This process is schematized in (8).

figure j

As will be discussed in detail in Section 5, I will adopt Fong’s idea that the embedded subject can raise to the edge of the embedded CP and receive acc case there. However, I will depart from Fong’s analysis in not taking the matrix v to be the source of acc case assignment. Rather, I adopt the idea that acc case in Mongolian is assigned as a dependent case. In the next section, I will present additional properties of Mongolian ECM, and return to the details of the dependent case analysis in Section 5.

2.2 Spec CP as an A-position

In the preceding section, I discussed some facts of ECM constructions in Mongolian, and showed that they can be accounted for under Fong’s (2019) analysis in which the embedded subject undergoes raising to Spec CP and receives acc case there. Another important observation Fong (2019) makes is that the accusative embedded subjects can A-move out of the embedded CP and into the matrix clause. First, while in most cases the embedded subject can either be in nom or acc, nom subjects in general may not move out of the embedded clause, only acc subjects can.

figure k

Footnote 4

Fong (2019) demonstrates that the cross-clausal movement of acc subjects exemplified in (9b) displays A-properties. For example, (10) shows that cross-clausal movement of an embedded accusative subject feeds variable binding in the matrix clause.

figure l

Based on evidence such as (10), Fong concludes that the cross-clausal displacement of the accusative subject observed in (9b) is a kind of A-movement.

figure m

It is well-known that A-movement out of a finite clause is not possible in English (Chomsky 1973)—a sentence like (12) is ungrammatical.

figure n

The standard account (e.g., Chomsky 1977, 1981, 1986; May 1979; Lasnik and Saito 1992; Fukui 1993a) developed to exclude constructions like (12) consists of two major components. First, the requirement, due to subjacency, that movement proceed through the edge of a finite clause, which is an \(\hspace{1.27777pt}\overline{\hspace{-1.27777pt}\text {A}\hspace{-1.27777pt}}\hspace{1.27777pt}\)-position in English; Second, a constraint imposing a Ban on Improper Movement which prohibits movement from an \(\hspace{1.27777pt}\overline{\hspace{-1.27777pt}\text {A}\hspace{-1.27777pt}}\hspace{1.27777pt}\)-position to an A-position.Footnote 5

figure o

In contrast to English, subsequent research suggests that some languages do allow A-movement to proceed out of a finite clause, known as hyperraisingFootnote 6 (e.g., Ura 1994, 1995; Ferreira 2004; Carstens and Diercks 2013; Halpert 2019; Zyman 2018; see Zyman 2021 for a recent review of languages with hyperraising and major analytical approaches). Mongolian, as discussed above, is one such language. According to Fong’s (2019) analysis, the embedded Spec CP in Mongolian is an A-position, and thus movement to Spec CP is an instance of A-movement.

figure p

Therefore, Mongolian has “proper” movement through Spec CP, an idea which was previously proposed for Japanese (e.g., Tanaka 2002a) and Korean (e.g., Yoon 1991). On Fong’s account, A-movement through Spec CP in Mongolian is not deemed a violation of (13) due to the presence of a \(\phi \)-probe on \(\hbox {C}^0\). In particular, she extends van Urk ’s (2015) proposal that A-movement is distinguished from \(\hspace{1.27777pt}\overline{\hspace{-1.27777pt}\text {A}\hspace{-1.27777pt}}\hspace{1.27777pt}\)-movement in that the former involves \(\phi \)-feature agreement, whereas the latter does not (also see Chomsky 2007 for similar ideas). Essentially, Spec CP is regarded as an A-position in Mongolian because movement targeting Spec CP takes place in response to \(\phi \)-agreement. However, as Zyman (2021) comments, since modern Khalkha Mongolian does not seem to exhibit overt \(\phi \)-agreement, a natural question is whether there is any direct evidence for the existence of such \(\phi \)-probe in Mongolian, an issue which remains unresolved in Fong (2019). In the upcoming sections, I identify and confirm several predictions arising from the availability of a probe on \(\hbox {C}^{0}\) which induces A-movement. Specifically, I show that the interactions between A- and \(\hspace{1.27777pt}\overline{\hspace{-1.27777pt}\text {A}\hspace{-1.27777pt}}\hspace{1.27777pt}\)-operations at the embedded finite clausal periphery naturally follow from an Agree-based view of movement types, in which A-movement to Spec CP proceeds in response to a \(\phi \)-probe on \(\hbox {C}^{0}\), combined with the Mongolian clausal architecture.

3 ECM is incompatible with wh-licensed subjects

In this section, I present the interaction between ECM and wh-licensing in Mongolian. In Section 3.1, I provide background on Mongolian wh-questions. I adopt the assumption that wh-phrases do not move covertly, and are instead licensed in-situ. Given this background, I present previously unnoticed data in Section 3.2, suggesting that in an embedded question, wh-licensing and ECM cannot co-occur on the embedded subject. I identify some of the issues these data pose for the A-status of Spec CP, which I will provide a solution for in Section 5.

3.1 Wh-Licensing in Mongolian

This subsection provides background on Mongolian wh-questions in preparation for the discussion in Section 3.2, focusing specifically on the structure of embedded questions. I assume wh-licensing in Mongolian does not involve covert or overt movement of wh-phrases, and present empirical data consistent with this view.

3.1.1 Matrix and Embedded Questions

Mongolian is a wh-in-situ language. A grammatical wh-question requires the presence of a wh-Q particle be/we, which is distinguished from a yes/no question particle uu/üü.

figure q

Footnote 7

In embedded constructions, the location of a wh-Q particle unambiguously indicates the scope of the wh-phrase. As shown in (16a), the wh-phrase ali nomig (‘which book.acc’) inside the embedded clause obligatorily takes embedded scope when the question particle be is inside the embedded clause. In contrast, the wh-phrase obligatorily takes matrix scope when the question particle be is in the matrix clause, as shown in (16b).

figure r

Notice also that as exemplified in (16a), the wh-question particle is always followed by the complementizer gej in Mongolian finite embedded questions.Footnote 8 I propose that the wh-question particle be/we instantiates a \(\hbox {Force}^{0}\) head, which hosts a wh probe. The complementizer gej in an embedded question instantiates \(\hbox {C}^{0}\) located above \(\hbox {Force}^{0}\). Therefore, a wh-question with embedded scope such as the one in (16a) has the following clausal periphery.

figure s

3.1.2 Wh-licensing in Mongolian

In this section, I discuss the mechanism of wh-licensing in Mongolian and take note of some auxiliary assumptions. It should be acknowledged that whether in-situ wh-phrases undergo covert movement or not in Mongolian is a complex issue beyond the scope of the current paper. Therefore, I will limit my discussion to the points immediately relevant to the current topic, and leave a thorough investigation into this issue to future work. In this paper, I take the position that Mongolian wh-questions involve wh-licensing, which does not require overt or covert movement but is instead established via Agree with \(\hbox {Force}^{0}\). I discuss data related to subjacency and focus intervention effects which lead me to adopt the no-covert-movement view.

First, while Fong (2019) suggests that the unacceptability of a wh-phrase inside conditional clauses and whether-islands can be accounted for by assuming covert wh-movement in Mongolian, it has long been observed, since Huang (1982), that some in situ wh-phrases can indeed appear in positions from which overt extraction is not quite acceptable. As discussed in detail by Simpson (2000), the fact that overt and covert movement often are not fully parallel in their locality profile poses challenges to the type of approach which motivates covert wh-movement based on island sensitivity. This non-parallelism with regard to movement carries over to Mongolian, in which relative clauses (RCs) are strong islands for scrambling (18a), and overt \(\hspace{1.27777pt}\overline{\hspace{-1.27777pt}\text {A}\hspace{-1.27777pt}}\hspace{1.27777pt}\)-movement, such as relativization (18b).

figure t

Nevertheless, a wh-phrase may freely appear in RCs (19).Footnote 9

figure u

Similarly, a wh-phrase can also take scope out of an adjunct clause, another strong island for overt phrasal movement such as scrambling.

figure v

The second piece of relevant data concerns focus intervention effects, a phenomenon which has been suggested to have implications for the presence or absence of covert wh-movement in a certain language (e.g., Pesetsky 2000; Keine 2016; Aravind 2018; Kotek 2019, see also Cable 2010). Beck (2006), building on Kim (2002) and expanding earlier work (Beck 1996; Beck and Kim 1997), discusses intervention effects in wh-questions of a number of languages, such as Korean, Japanese, Malayalam, Hindi/Urdu, French, German, and parts of English. The focus intervention configuration can be described as follows.

figure w

An example of focus intervention effect in Mongolian is illustrated in (22). In (22a), the NPI khen č (‘anyone’) c-commands the wh-object yamar nomig (‘what book-acc’), in violation of (21a). The sentence is severely degraded. Importantly, the structural relationship between the wh-phrase and the NPI intervener matters. If the wh-phrase scrambles to a higher position where it is no longer c-commanded by the intervener, the intervention effect disappears. This possibility is presented in (22b).

figure x

Expanding on earlier work, Beck provides a semantic analysis of intervention effects in which wh-phrases make use of the same interpretational mechanism as focus. To give a very general summary, in her account a wh-phrase can only be evaluated by a question operator, and requires its first c-commanding operator to be a question operator. As a result, a wh-phrase c-commanded by an intervening focus-sensitive operator will be uninterpretable even if there exists a higher c-commanding question operator, giving rise to intervention effects.Footnote 10 Beck’s analysis and its implications are further explored in Kotek (19942019). In particular, Kotek examines the correlation between superiority and intervention effects in English (an observation due to Pesetsky 2000), and suggests that covert movement, just as overt movement, is capable of obviating intervention effects. Subsequent studies build on this idea and use focus intervention effects to diagnose the existence of potential covert wh-movement in a wh-in-situ language (e.g., Keine 2016 for Hindi, Aravind 2018 for Malayalam). In particular, it is taken that focus intervention effect arises if neither overt nor covert movement takes place.

If these proposals are on the right track, the data in (22) potentially provide further clues on the absence of covert wh-movement in Mongolian. Specifically, if the wh-phrase in (22a) had undergone covert movement to a higher position, we would not have observed an intervention effect. The fact that such an effect does arise in (22a) suggests that the wh-phrase does not undergo covert movement to a higher position.

In the current analysis, I will assume a mechanism in syntax in which construction of a wh-question does not involve covert movement, and instead involves establishing an Agree relation between a wh-probe (on \(\hbox {Force}^{0}\) in Mongolian) and an appropriate goal. This Agree relation is taken to be able to apply long-distance, as evidenced by the fact that a wh-phrase inside an embedded CP can be licensed by a matrix Q-particle in (16b), repeated below.

figure y

Before proceeding to the main observations regarding wh vs. ECM interaction, I will make some additional comments regarding the long-distance wh-licensing mechanism assumed here. The Agree relationship between the Q-particle and the wh-phrase in (23) crosses a CP phase boundary. A natural question arises as to how this kind of long-distance Agree mechanism fares with locality restrictions on syntactic operations such as the Phase Impenetrability Condition (PIC) (Chomsky 2001). For present purposes, I assume that wh-agreement in Mongolian is subjected to locality restrictions, and that the long-distance Agree is possible here due to a version of cyclic agreement (Legate 2005; Munakata 2007; Aravind 2018). The core idea of cyclic agreement, as laid out in Legate (2005), is that potential inter-phasal agreement relations may consist of smaller local agreement steps established through the intermediary of intervening phase-defining heads. While this mechanism was originally proposed to handle certain unaccusative constructions, Legate notes that the cyclic agreement operation has clear applications in wh-in-situ constructions. A recent implementation of cyclic agreement in \(\hspace{1.27777pt}\overline{\hspace{-1.27777pt}\text {A}\hspace{-1.27777pt}}\hspace{1.27777pt}\)-constructions is Aravind ’s (2018) account of long-distance wh-licensing in Malayalam. Building on Legate (2005), Aravind proposes that in Malayalam, long-distance wh-agreement that spans across a CP phasal boundary is mediated by the \(\hbox {C}^{0}\) of the intermediate clause. For concreteness, in this paper I will assume a similar long-distance agreement mechanism for Mongolian wh-licensing. Take the configuration in (23) as an example. The agreement between the matrix wh-Q and the embedded wh-phrase is mediated by the \(\hbox {C}^{0}\) of the intermediate CP. Although the intermediate CP is not associated with interrogative interpretation, I take the \(\hbox {C}^{0}\) of this intervening CP to possess a wh-feature which may Agree with ali nomig ‘which book.acc’ in the embedded clause. After the wh-feature of the intermediate \(\hbox {C}^{0}\) is valued via Agree, this intermediate \(\hbox {C}^{0}\) itself may function as a goal for the matrix clause wh-probe.Footnote 11

3.2 Wh-Licensing vs. ECM

The core observation of this section is that ECM is degraded when the embedded subject is a wh-phrase taking embedded scope. First, recall from previous discussion that wh-Q particles in Mongolian unambiguously indicate wh-scope. Therefore, in (24) only the embedded scope reading is available. Notice that in this sentence, the wh-phrase ali nomig (‘which book.acc’) is the embedded object, and acc case marking on the embedded subject Bat is acceptable.

figure z

In contrast, if the wh-phrase is in the subject position of the embedded clause, it can only be in nom case. Acc case marking is degraded.

figure aa

Similar effects are found in embedded multiple wh-questions. In (26), the ECM of the embedded wh-subject is degraded.

figure ab

Strikingly, this constraint disappears when the embedded wh-subject obligatorily takes matrix scope (i.e., no embedded scope reading is possible).

figure ac

The observations made in (24–27) can be descriptively stated as (28).

figure ad

This generalization raises three questions. First, if Mongolian embedded Spec CP is an A-position that a subject can optionally raise to and receive acc, why is this option unavailable when the subject is a wh-phrase taking embedded scope? Second, why does this option become available when the wh-subject obligatorily takes matrix scope? Third, what is the structural source of this pattern, and how do we model it in such a way so that (28) emerges from independent properties of grammar? I will defer a solution to these questions until Section 5. In the next section, I identify parallels between embedded wh-constructions and thematic topicalization in their interactions with ECM, which allows for a unified analysis that accounts for all three phenomena.

4 ECM is incompatible with thematic topic subjects

In this section, I examine further data which show that ECM is incompatible with subjects as embedded thematic topics. I identify the status of thematic (aboutness) topics in Mongolian and posit that they are interpreted at Th(eme)P. I then show that thematic topic interpretations are unavailable on ECM subjects, a pattern which resembles (28).

4.1 Topic Constructions in Mongolian

In Mongolian, topics are often indicated by the particle bol. The interpretation of phrases marked by bol is, to some extent, similar to those marked by wa in Japanese (e.g., Kuno 1975; Heycock 2008; Tomioka 2010; Vermeulen 2013). For example, similar to wa, bol-marked phrases can be interpreted as a “theme” (i.e., aboutness topic or thematic topic as defined in Kuno 1975) or as a contrastive topic; thus, (29) is ambiguous. The sentence can be used to convey information about Bat, without implying any information about other individuals or entities. Alternatively, under the contrastive reading, the sentence implicates contrast with other individuals or entities salient in the context.

figure ae

While a contrastive reading is available for bol-marked phrases in most cases, the thematic reading is possible only when a phrase is in the clause-peripheral position. In (30), bol marks ter nomig (‘that book.acc’) in the canonical object position. The object can only receive contrastive reading. However, if the object is placed at the sentence-initial position as in (31), the thematic reading becomes accessible.

figure af

The thematic reading on a subject topic remains accessible even when the subject no longer counts as the sentence-initial constituent after scrambling applies. As shown in (32), when a bol-marked subject is preceded by a scrambled object, the thematic reading, although not as strong as the contrastive reading, is still accessible.

figure ag

Importantly, thematic topic interpretations are not limited to matrix clauses. In Mongolian, finite CP complements of the verb ‘to say’ allow embedded thematic topics. A typical case is given in (33), in which the third singular pronominal possessive in the embedded CP coindexes with the matrix subject bagš (‘teacher’), ensuring that it is not a direct quotation (cf. Fukui 2006; Vermeulen 2013). The embedded subject Bat is marked by bol, giving rise to an embedded thematic topic reading. Meanwhile, the contrastive reading is also available.

figure ah

As (33) exemplifies, structurally speaking, the kind of embedded environment which can accommodate thematic topic interpretations is limited to CPs that are “big enough” (i.e., finite CPs headed by the complementizer gej). In contrast, nonfinite embedded clauses smaller than a CP cannot host a thematic topic. For example, (34) contains a nonfinite embedded clause which is marked with acc case. The embedded verb unšikh (‘to read’) is a nonfinite participial form with a non-past interpretation.Footnote 12 This embedded clause does not project CP, nor does it allow [\(_{\textsc {c}}\) gej]. The phrase marked by bol only has contrastive reading.Footnote 13

figure ai

Based on the patterns of matrix and embedded topics, I propose that there is a designated projection Th(eme)P for thematic topics in the Mongolian clausal periphery, between TP and CP, adapting similar proposals for Japanese by Saito (20092012) to the current case. A phrase occupying this position receives thematic topic reading.

figure aj

4.2 Embedded Topicalization vs. ECM

Given the background provided above, consider (36-37). In both examples, the embedded subject Natsagdorj is marked by bol. Crucially, when the embedded bol-marked topic receives acc case as in (35), the thematic topic reading is no longer available. Such a reading can only be obtained when the embedded subject is in nom, as in (37).

figure ak
figure al

This pattern resembles the one exhibited by wh-constructions in Section 3.2. Typically, both topicalization and wh-licensing/movement are identified as \(\hspace{1.27777pt}\overline{\hspace{-1.27777pt}\text {A}\hspace{-1.27777pt}}\hspace{1.27777pt}\)-operations. The parallelism can therefore be stated in terms of interactions between A- and \(\hspace{1.27777pt}\overline{\hspace{-1.27777pt}\text {A}\hspace{-1.27777pt}}\hspace{1.27777pt}\)-operations. In particular, \(\hspace{1.27777pt}\overline{\hspace{-1.27777pt}\text {A}\hspace{-1.27777pt}}\hspace{1.27777pt}\)-operations involving wh and thematic topics are incompatible with ECM (an A-operation), when they apply to the same DP at the embedded clausal periphery.

Similar to wh-constructions, thematic topic marking on the embedded object does not interfere with ECM. This point is illustrated with (38), in which the embedded object nooluurin büteegdekhüüniig (‘cashmere products’.acc) is marked with the topic marker bol. The fact that the object cashmere product in (38) is not in its canonical object position is indicated by the fact that it is interrupted by a sentential-level adverb jil bür ‘every year’ from the verb aw- (‘buy’). In this example, the thematic reading of the embedded object is accessible. Meanwhile, the embedded subject eej (‘mother’) can receive acc case marking. The acc-subject arguably is located at the edge of the embedded CP, since it not only receives ECM but is also locally bound by the matrix subject.

figure am

Given the observations above, the core generalization about embedded thematic topics can therefore be descriptively stated as (39).

figure an

5 A/\(\hspace{1.27777pt}\overline{\hspace{-1.27777pt}\text {A}\hspace{-1.27777pt}}\hspace{1.27777pt}\)-operations at the clausal periphery: an Agree-based analysis

5.1 Overview of the Analysis

In this section, I summarize the important properties of Mongolian wh-questions, thematic topicalization, ECM, and the proposed analyses for them. I then introduce a unified Agree-based approach which accounts for their interactions at the embedded clausal periphery. First, it was suggested in Fong (2019) that Mongolian Spec CP can be an A-position. The edge of CP is also a position where the embedded subject may receive acc case from the matrix clause. Embedded subjects which do not move to Spec CP stay in Spec TP and receive nom instead. In addition, A-movement launching from Spec CP into the matrix clause is not deemed a violation of the ban on improper movement, since Spec CP can be an A-position in Mongolian.

figure ao

Independently, I have presented novel data on finite CPs hosting embedded wh-questions and thematic topicalization. I have adopted (17), repeated below as (41), for wh-questions with embedded scope, and (35), repeated as (42), for embedded thematic topics.

figure ap
figure aq

Further, I have presented and examined patterns in which embedded wh-questions and thematic topics interact with ECM in a similar fashion. In particular, ECM is incompatible with a wh-subject taking embedded scope (descriptively summarized in (28), repeated below as (43)). ECM is also incompatible with an embedded thematic subject (summarized in (39), repeated below as (44)).

figure ar
figure as

I suggest that (43-44) are tied to the profile of the Mongolian embedded clausal periphery (41-42). To model this connection, I offer two interrelated claims. The first claim is that raising of embedded subjects to Spec CP occurs in response to a \(\phi \)-probe on \(\hbox {C}^{0}\), along the line of what has been proposed in Fong (2019) (see also van Urk 2015). Since the choice between nom and acc case on the subject of embedded finite clauses is in principle optional in Mongolian, I assume that the \(\phi \)-probe has the option of staying on \(\hbox {C}^{0}\) or being inherited by a lower head (i.e., \(\hbox {T}^{0}\)). When the \(\phi \)-probe remains on \(\hbox {C}^{0}\), it agrees with the embedded subject, and this suppresses the agreement with \(\hbox {T}^{0}\) which would have resulted in nominative case assignment. As a result, the subject raises to Spec CP, where it becomes accessible to accusative case assignment.Footnote 14 By contrast, when the \(\phi \)-probe is inherited by \(\hbox {T}^{0}\), no \(\phi \)-agreement with \(\hbox {C}^{0}\) is triggered, so that the embedded subject is free to agree with \(\hbox {T}^{0}\). As a consequence, the subject does not raise to Spec CP, and the case that emerges on the subject would be nominative.

This leads to the second claim: the incompatibility between ECM and \(\hspace{1.27777pt}\overline{\hspace{-1.27777pt}\text {A}\hspace{-1.27777pt}}\hspace{1.27777pt}\)-operations (43-44) is due to the fact that agreement with the \(\phi \)-probe on \(\hbox {C}^{0}\) bleeds thematic topic and wh-licensing. Therefore, in the current account, the interactions between A- and \(\hspace{1.27777pt}\overline{\hspace{-1.27777pt}\text {A}\hspace{-1.27777pt}}\hspace{1.27777pt}\)-operations on an embedded subject are modeled in terms of relations between probes bearing different kinds of features. I lay out the analysis in greater detail in the next section.

5.2 An Agree-Based Approach to Mongolian ECM

The core aspect of the current analysis is couched in the view that movement of an element X requires prior establishment of a relation between X and a c-commanding probe with some featural requirement which X satisfies (Chomsky 2000). Under this view, differences between movement types emerge from properties of distinct features involved in Agree. Chomsky (2007:25), for example, characterizes the difference between A- and \(\hspace{1.27777pt}\overline{\hspace{-1.27777pt}\text {A}\hspace{-1.27777pt}}\hspace{1.27777pt}\)-movement as follows: “A-movement is IM (internal merge) contingent on probe by uninterpretable inflectional features, while \(\hspace{1.27777pt}\overline{\hspace{-1.27777pt}\text {A}\hspace{-1.27777pt}}\hspace{1.27777pt}\)-movement is IM driven by EF”. In addition, researchers such as van Urk (2015) propose that A-movement behaves distinctly from \(\hspace{1.27777pt}\overline{\hspace{-1.27777pt}\text {A}\hspace{-1.27777pt}}\hspace{1.27777pt}\)-movement because the former involves \(\phi \)-feature agreement but the latter is driven by \(\hspace{1.27777pt}\overline{\hspace{-1.27777pt}\text {A}\hspace{-1.27777pt}}\hspace{1.27777pt}\)-features such as topic or wh. Likewise, in the current analysis for Mongolian, I take (A-)movement to Spec CP to be driven by \(\phi \)-agreement, and in contrast, \(\hspace{1.27777pt}\overline{\hspace{-1.27777pt}\text {A}\hspace{-1.27777pt}}\hspace{1.27777pt}\)-operations such as wh-licensing and thematic topicalization involve \(\hspace{1.27777pt}\overline{\hspace{-1.27777pt}\text {A}\hspace{-1.27777pt}}\hspace{1.27777pt}\)-features (e.g., wh, topic).

As both wh-licensing and topicalization interact with subject raising to Spec CP (ECM), I will start with an analysis of the raising/ECM construction in Mongolian. I assume a general Agree-based view of phrasal movement, according to which A-movement is tied to \(\phi \)-agreement (specific formulation varies, see e.g., Chomsky 2007; Obata and Epstein 2011; van Urk 2015). I propose that optional subject raising to Spec CP (which feeds acc case assignment) is due to (45).

figure at

The above part of my proposal is similar to Fong’s in that both posit a \(\phi \)-probe on \(\hbox {C}^{0}\) that drives subject raising to Spec CP. However, I depart from Fong (2019) in one respect. Recall that Fong assumes the acc case on the raised subject is assigned by v in the matrix clause. As it turns out, independent data suggest that the case assignment mechanism in Mongolian needs to be reconsidered. In this paper, I adopt the idea in Gong (2023) that Mongolian possesses two modalities of structural case assignment, in which acc case is assigned as a dependent case, and nom case is assigned by finite T.Footnote 15

figure au

Under the proposal (46), acc is assigned as a dependent case. In ECM constructions, the embedded subject raises to the edge of the embedded CP by (45), where it becomes accessible to dependent case competition by virtue of being locally c-commanded by a matrix DP (e.g., the matrix subject at Spec vP).

It should be noted that for the purpose of this paper, the specific acc case assignment mechanism is not crucial, as long as the subject DP must raise to the edge of the embedded CP to receive accusative case. Both Fong’s head-licensing analysis and the dependent case analysis assumed here require the embedded subject to raise to the phase edge in order to be accessible to acc case assignment. Nevertheless, additional facts suggest that a hybrid analysis in which accusative is a dependent case might be on the right track (see also Aravind 2021 for additional support for a dependent case analysis of Mongolian acc case). Before turning to the discussion of ECM, I will briefly review some of the empirical evidence from Gong (2023) for the hybrid case assignment mechanism in Mongolian.

The core evidence given in Gong (2023) for the dependent case status of Mongolian accusative case involves various types of embedded constructions. First, embedded subjects can be marked with accusative even when there is no functional head in the matrix clause that can assign to it accusative case. Aravind (2021) observes that predicates like uurlakh ‘to become angry’ do not license accusative case, as shown in (47a). However, when the predicate uurlakh takes a finite CP as its complement, as in (47b), the embedded subject can be in accusative case.

figure av

It seems that the availability of accusative case marking on the embedded subject is dissociated from the case-assigning potential of the matrix predicate. This is further supported by (48), in which the subject of an adjunct clause can be marked with accusative case.

figure aw

In this sentence, there is no functional head available in the matrix clause that can license the acc case on the subject of the adjunct clause. However, the construction is grammatical with an acc subject. On the dependent case approach, these facts follow straightforwardly. The embedded subject, being at the edge of the embedded clause, receives accusative case via competition with the matrix subject in (47b) and (48).

Second, the availability of acc on the embedded subject is contingent upon the presence of an eligible matrix clause case competitor. If a matrix case competitor is absent (e.g., in a construction where the matrix clause involves an impersonal predicate), acc case is not allowed on the embedded subject. An example is provided in (49).

figure ax

The fact that an acc subject is disallowed in (49) would follow if acc case on the embedded subject is a dependent case, assigned via competition with an eligible c-commanding case competitor in the same domain. Since there is no such case competitor in (49), acc case is unavailable.

In addition to the dependent accusative case, nominative case is unavailable in various tenseless domains. One of the relevant constructions discussed in Gong (2023) concerns nonfinite relative clauses in Mongolian, which lacks a tense head. In these relative clauses, the subject must be in genitive case, as shown in (50). This is unexpected if nominative is an unmarked case that is freely available independent of the specific clausal structure, but it is consistent with the view that nominative is assigned by \(\hbox {T}^{0}\).

figure ay

To summarize, we observe that the Mongolian accusative case behaves like a dependent case, whereas the nominative case behaves like it is licensed by \(\hbox {T}^{0}\). In the subsequent discussion, I will adopt the hybrid case assignment mechanism (46) and demonstrate how it works in tandem with the \(\phi \)-probe on \(\hbox {C}^{0}\) in ECM constructions.

In order to illustrate the main proposal, consider the ECM example in (51), in which the subject of the embedded finite CP can either be in nom or acc case.

figure az

According to (45), movement to the edge of CP is driven by agreement with a \(\phi \)-probe on \(\hbox {C}^{0}\). This proceeds as follows. As visualized in (52), the \(\phi \)-probe on \(\hbox {C}^{0}\) searches its domain and agrees with the subject at Spec vP, since it is the closest goal bearing appropriate features. As a result of this agreement process, the subject DP undergoes movement to Spec CP. At the edge of CP, the subject DP receives dependent accusative case by competition with an eligible case competitor in the higher domain (e.g., the matrix subject, marked in (52) as \(\hbox {DP}_{2}\)).

figure ba

In (52), movement to Spec CP takes place in response to the \(\phi \)-probe on \(\hbox {C}^{0}\). Thus, this movement has A-properties and Spec CP is considered an A-position. This is essentially what has been proposed in Fong (2019) for Mongolian, building on Chomsky (2007) and van Urk (2015). However, it is worth noting that in the current formulation, \(\phi \)-agreement with \(\hbox {C}^{0}\) only triggers movement to Spec CP, but \(\hbox {C}^{0}\) is not the accusative case assigner (nor the matrix v). In my account, \(\phi \)-feature-driven movement to Spec CP feeds acc case assignment in the sense that it puts the subject DP in a structural position accessible to dependent case competition. Acc case assignment is dissociated from \(\phi \)-agreement in Mongolian ECM.

Recall that the major feature of Mongolian ECM is that it is optional. The embedded subject can be in either accusative or nominative case. So what about nominative subjects? How are they licensed? I suggest, along the lines of Miyagawa (2005b), that the \(\phi \)-probe has the option of staying on \(\hbox {C}^{0}\) or being inherited by \(\hbox {T}^{0}\) (cf. a later account by Miyagawa 2009, which draws on \(\alpha \)P and assumes the \(\phi \)-probe always gets inherited by a lower head). When the \(\phi \)-probe is inherited by \(\hbox {T}^{0}\), no \(\phi \)-agreement with \(\hbox {C}^{0}\) is triggered, so that the embedded subject is free to agree with \(\hbox {T}^{0}\). Since \(\hbox {T}^{0}\) is the functional head that assigns nom case, nom case will be licensed on the subject DP in this scenario. As indicated in (53), I further suggest that once the subject DP agrees with \(\hbox {T}^{0}\), it moves to Spec TP. In the scenario depicted in (53), Spec CP is not an A-position, and ECM is not possible. Notably, my approach renders ECM an obligatory operation despite the fact that it yields an optional variation. The \(\hbox {C}^{0}\) head may optionally keep its \(\phi \)-feature specification or pass it down to \(\hbox {T}^{0}\), but once it keeps its \(\phi \)-probe, agreement with \(\hbox {C}^{0}\) and movement to Spec CP become obligatory in syntax.

figure bb

Before turning to the core data involving topicalization and wh-licensing, I will summarize the key components of my analysis of ECM. While maintaining Fong’s suggestion that movement to Spec CP in Mongolian takes place in response to a \(\phi \)-probe on \(\hbox {C}^{0}\), I have proposed that \(\phi \)-agreement with \(\hbox {C}^{0}\) (resulting in acc case assignment) suppresses agreement with \(\hbox {T}^{0}\) (resulting in nom case assignment). If \(\hbox {C}^{0}\) passes down its \(\phi \)-features to \(\hbox {T}^{0}\), the subject DP does not agree with \(\hbox {C}^{0}\) and is free to agree with \(\hbox {T}^{0}\), resulting in nominative case assignment. Taken together, the current proposal amounts to stating that the optionality of ECM in Mongolian is due to the fact that \(\hbox {C}^{0}\) has the option to either keep its \(\phi \)-probe or pass it down to \(\hbox {T}^{0}\) (similar to the mechanism adopted in Miyagawa 2005b, cf. Miyagawa 2009; also cf. Takeuchi 2010 for a similar proposal based on optional feature inheritance for Japanese ECM).Footnote 16

The current Agree-based proposal allows for a systematic treatment of the core data concerning interactions between A/\(\hspace{1.27777pt}\overline{\hspace{-1.27777pt}\text {A}\hspace{-1.27777pt}}\hspace{1.27777pt}\)-operations at the clausal periphery. The previous claim in Fong (2019) that there is a \(\phi \)-probe on \(\hbox {C}^{0}\) remained largely a speculation for Mongolian, given the predictions of van Urk ’s system. If \(\hbox {C}^{0}\) indeed can be introduced with a \(\phi \)-probe, then it is expected to have detectable properties or have at least some effects on the derivation that can be indirectly observed. The current Mongolian data on A/\(\hspace{1.27777pt}\overline{\hspace{-1.27777pt}\text {A}\hspace{-1.27777pt}}\hspace{1.27777pt}\)-interactions offer a case study in which the potential \(\phi \)-probe on \(\hbox {C}^{0}\) affects the derivation in a way that is observable on the surface. Specifically, I argue that the incompatibility between ECM and \(\hspace{1.27777pt}\overline{\hspace{-1.27777pt}\text {A}\hspace{-1.27777pt}}\hspace{1.27777pt}\)-operations as summarized in (43-44) is due to the fact that agreement with the \(\phi \)-probe on \(\hbox {C}^{0}\) bleeds thematic topic and wh-licensing in the lower domain. I spell out the specific analyses for topicalization vs. ECM interaction and wh-licensing vs. ECM interaction in the next two sections.

5.3 Topicalization vs. ECM

I start with the analysis of interactions between thematic topicalization and ECM. First, I take typical \(\hspace{1.27777pt}\overline{\hspace{-1.27777pt}\text {A}\hspace{-1.27777pt}}\hspace{1.27777pt}\)-features (thematic topic, wh) to always come from the clausal periphery (i.e., \(\hbox {C}^{0}\)). This is consistent with the fact that thematic topic is possible inside an embedded clause only if it projects a CP headed by [\(_{\textsc {c}}\) gej]. Embedded clauses smaller than a CP, like the one in (54) (= (34)), are unable to host a thematic topic.

figure bc

Given the structure for embedded thematic topics (55), I suggest that the Theme head \(\hbox {Th}^{0}\) requires selection by \(\hbox {C}^{0}\) to obtain its feature specifications.

figure bd

Specifically, I propose that an \(\hspace{1.27777pt}\overline{\hspace{-1.27777pt}\text {A}\hspace{-1.27777pt}}\hspace{1.27777pt}\)-probe with the Theme feature specification is introduced on \(\hbox {C}^{0}\), and is inherited by \(\hbox {Th}^{0}\). To some extent, the current proposal resembles Miyagawa ’s (2010) topic feature inheritance from \(\hbox {C}^{0}\) to \(\hbox {T}^{0}\)/\(\alpha ^0\). I also adopt his treatment of the topic head in which the topic probe (i.e., Theme in the current discussion) is not associated with any particular phrase in the structure. Following Miyagawa, I suggest that the Theme feature, once inherited by \(\hbox {Th}^{0}\), simply requires its specifier to be filled. In other words, Theme does not probe, and a DP may receive thematic topic interpretation only if it occupies Spec ThP.Footnote 17 With this in mind, consider the case in which the thematic topic interpretation of a bol-marked embedded subject is incompatible with ECM. The relevant example is repeated as follows.

figure be

I suggest that the thematic interpretation is unavailable in (56) because agreement with the \(\phi \)-probe on \(\hbox {C}^{0}\) induces movement to Spec CP, bleeding thematic topic licensing on the subject. This process is illustrated with (57). First, \(\hbox {C}^{0}\) is introduced with \(\phi \) and Theme. Upon the merger of \(\hbox {C}^{0}\), two processes happen simultaneously: \(\hbox {Th}^{0}\) receives Theme by inheritance, and in turn requires its specifier to be filled.Footnote 18 At the same time, the \(\phi \)-probe on \(\hbox {C}^{0}\) immediately searches its domain and \(\phi \)-agrees with the subject [\(_{\textsc {dp}}\) Natsagdorj ], triggering subject raising to Spec CP. As \(\phi \)-agreement takes place upon the merger of \(\hbox {C}^{0}\), the raised subject will always end up skipping Spec ThP. Therefore, when \(\hbox {C}^{0}\) retains its \(\phi \)-features, ECM becomes obligatory, and thus the subject can never become a thematic topic.

figure bf

Unlike the subject which must raise in response to \(\phi \) on \(\hbox {C}^{0}\), the object still remains within the domain of \(\hbox {Th}^{0}\), and should in principle be able to raise to Spec ThP. This prediction is borne out by (38), repeated below as (58). In this sentence, the object nooluurin büteegdekhüüniig bol (‘cashmere product’.acc top) receives a thematic topic reading within the embedded clause. Recall from the previous discussion that the object is no longer in its canonical verb-adjacent position, because it is separated from the verb ‘buy’ by a sentential-level adverb ‘every year’.

figure bg

As illustrated in (59-60), I suggest that the embedded object in (58) ultimately receives thematic interpretation at Spec ThP. The derivation proceeds as follows. First, upon the merger of \(\hbox {C}^{0}\), \(\hbox {Th}^{0}\) inherits the Theme feature from \(\hbox {C}^{0}\), and the \(\phi \)-probe on \(\hbox {C}^{0}\) agrees with the embedded subject. This is shown in (59). As shown in (60), although the subject must raise to Spec CP in response to the \(\phi \)-probe and skipping ThP, the object can independently raise to Spec ThP, receiving a thematic topic reading. Therefore, a construction with a subject undergoing ECM and an object acting as thematic topic like (58) is possible.

figure bh

To sum up, under the current analysis, the interactions between ECM and thematic topicalization essentially result from the \(\phi \)-probe on \(\hbox {C}^{0}\) acting on a structure in which a ThP is projected lower than CP. If \(\hbox {C}^{0}\) retains its \(\phi \)-features, the \(\phi \)-probe always ends up agreeing with the subject and inducing raising to Spec CP. Since a DP may receive thematic topic interpretation only when it occupies Spec ThP, a subject attracted by \(\phi \) on \(\hbox {C}^{0}\) always ends up skipping the thematic topic position Spec ThP (importantly, as noted in footnote 18, Spec ThP cannot be filled “in passing”). The object, in contrast, does not interact with the \(\phi \) probe on \(\hbox {C}^{0}\). Therefore, it is free to move to Spec ThP, receiving thematic interpretation there. On the other hand, if \(\hbox {C}^{0}\) passes its \(\phi \)-features down to \(\hbox {T}^{0}\), the subject DP does not agree with \(\hbox {C}^{0}\) or move to Spec CP. In this scenario, either the subject or the object can become the embedded thematic topic. Since no raising-to-Spec CP occurs in the absence of \(\phi \)-features on \(\hbox {C}^{0}\), when a subject becomes the thematic topic, it can only be in nom case:

figure bi

5.4 Wh-Licensing vs. ECM

In this section, I extend the above analysis to the interactions between embedded wh-questions and ECM. Based on the observations made in Section 3, I suggest that Mongolian wh-questions do not involve covert movement of wh-phrases, and instead involve the establishment of an Agree relation between an \(\hspace{1.27777pt}\overline{\hspace{-1.27777pt}\text {A}\hspace{-1.27777pt}}\hspace{1.27777pt}\)-probe with wh-feature specifications and a goal bearing wh-features.Footnote 19 As an \(\hspace{1.27777pt}\overline{\hspace{-1.27777pt}\text {A}\hspace{-1.27777pt}}\hspace{1.27777pt}\)-feature, the interrogative wh feature is similar to Theme in the sense that it is introduced on \(\hbox {C}^{0}\) and is inherited by a lower head (i.e., \(\hbox {Force}^{0}\)). In effect, \(\hbox {Force}^{0}\) starts functioning as a probe which seeks an appropriate goal to agree with only until \(\hbox {C}^{0}\) is introduced.

Before turning to the core data, another key property of wh-questions in Mongolian needs to be discussed. Let us first consider the examples in (62).

figure bj

These examples instantiate a well-known generalization first proposed by Harada (1972), that in languages like Japanese (as well as Mongolian), a wh-phrase must be contained within the domain of the +Q clause where it takes scope.Footnote 20 One piece of evidence suggesting Harada’s condition in fact applies at LF concerns the radical reconstruction property of long distance scrambling (LDS),Footnote 21 illustrated with (63) from Mongolian.

figure bk

Example (63) is a declarative sentence with an embedded wh-question. The wh-phrase yamar nomig (‘what book’.acc) originates in the embedded clause and undergoes long distance scrambling to the matrix-initial position. Since the matrix clause is a declarative sentence, the wh-phrase cannot be licensed in the scrambled position. As a result, the wh-phrase must be interpreted within the embedded clause even though at S-structure it occupies a position outside of it. The grammaticality of (63) suggests that Harada’s condition applies at LF – the scrambled wh-phrase undergoes reconstruction to a position within the scope of the embedded +Q clause, satisfying Harada’s condition at LF.

In addition to the above observations, we have also seen that in Mongolian Q-particles unambiguously indicate the wh-scope. Taken together, Harada’s condition can be stated as follows for Mongolian:

figure bl

Footnote 22

With this background in mind, consider (65), repeated from (25). In an embedded question, a wh-phrase acting as a subject argument may not undergo ECM.Footnote 23

figure bn

In addition, recall that when the wh-phrase is not the subject in a Mongolian embedded question, ECM is indeed allowed (66).

figure bo

The question relevant for the Mongolian case here is why ECM becomes unavailable when the subject is a wh-phrase taking embedded scope. Recall that ECM is due to the raising of an embedded subject to Spec CP, which is taken to be a kind of A-movement. In the current Agree-based analysis, properties of A-movement are the properties of movement resulting from the interaction of a \(\phi \)-probe with its goal, rather than the properties of a particular syntactic position. Under this view, we may attribute the contrast between (65) and (66) to an interpretation requirement stated as follows.

figure bp

FDIR states that an element undergoing movement in response to a \(\phi \)-probe is interpreted within the Spell-Out domain of its landing site. I will first demonstrate how it accounts for the facts regarding the interaction between ECM and wh-interpretation. Then, I will provide independent evidence for FDIR and discuss its potential implications.

First, FDIR correctly excludes ECM on a wh-subject taking embedded scope as in (65). Suppose \(\hbox {C}^{0}\) is introduced with a \(\phi \)-probe along with wh, the latter being inherited by \(\hbox {Force}^{0}\), as depicted in (68).

figure bq

In (68), the subject DP bears a wh-feature. Thus, both the \(\phi \)-probe on \(\hbox {C}^{0}\) and the wh-probe inherited by \(\hbox {Force}^{0}\) will target the subject upon merger of \(\hbox {C}^{0}\). The \(\phi \)-probe agrees with the subject and drives movement of the subject to Spec CP. Note that \(\hbox {C}^{0}\) is a phase head. By standard assumptions of phase theory (Chomsky 2001), the Spell-Out domain of the C phase is the complement of \(\hbox {C}^{0}\) – ForceP. This is labeled as Domain 1 in (68). The edge of CP, on the other hand, does not belong to the Spell-Out domain of the C phase, but instead belongs to the next higher domain, labeled as Domain 2 in (68). By FDIR, the subject wh-phrase must be interpreted in Domain 2 (i.e., the domain which Spec CP belongs to). However, licensing of the embedded question simultaneously requires the wh-phrase to be interpreted within the scope of the Q-particle (by Harada’s Condition), which is an area within Domain 1. As a result of FDIR, the subject wh-phrase is forced to be interpreted outside of the scope of the Q-particle, and thus will not be licensed.

On the other hand, if \(\hbox {C}^{0}\) passes down its \(\phi \)-features to \(\hbox {T}^{0}\), the subject does not undergo \(\phi \)-feature driven movement to Spec CP, and is free to agree with \(\hbox {T}^{0}\), receiving nom case. The subject then raises to Spec TP. By FDIR, the subject needs to be interpreted in Domain 1 and thus has the option to be interpreted within the scope of the Q-particle, satisfying Harada’s Condition.Footnote 24 Therefore, nominative wh-subjects can be properly licensed.

figure br

In contrast, when it is the object that bears wh-features instead of the subject, wh-licensing does not interfere with \(\phi \)-probing of \(\hbox {C}^{0}\), and thus ECM should be possible. This is borne out by the data in (66), with the derivation schematized in (70). Here \(\hbox {Force}^{0}\) targets the object DP bearing wh, and \(\phi \) on \(\hbox {C}^{0}\) targets the subject which does not bear wh. The subject can raise to Spec CP, and the object wh-phrase is interpreted within the lower domain and is properly licensed by \(\hbox {Force}^{0}\).

figure bs

Finally, consider the behavior of an ECM subject in a matrix scope question in (27), repeated in (71).

figure bt

Here the wh-phrase khen (‘who’) in the embedded subject position can undergo ECM. Under the current analysis, this means that the subject can raise to the embedded Spec CP in response to the \(\phi \)-probe on \(\hbox {C}^{0}\). Since the embedded clause is not a question, there is no ForceP projected inside the embedded domain. Since wh-licensing is taken to be able to apply long-distance in the current analysis, the \(\hbox {Force}^{0}\) in the matrix clause can license the embedded subject either in embedded Spec CP (with ECM) or in a CP-internal position (without ECM). The visualization in (72) illustrates the former scenario, in which the subject DP is at the edge of CP, getting acc case there and being wh-licensed by matrix \(\hbox {Force}^{0}\).Footnote 25

figure bu

Notice that in the case of (71), the domain which the embedded wh-subject is interpreted in does not matter for the purpose of wh-licensing. This is because Harada’s Condition only requires the subject to be interpreted within the scope of the matrix \(\hbox {Force}^{0}\) at LF. Since \(\hbox {Force}^{0}\) is located at the periphery of the matrix clause in (71), as long as the wh-subject can be interpreted somewhere within the scope of the matrix \(\hbox {Force}^{0}\), Harada’s Condition can be satisfied. Therefore, regardless of whether ECM applies or not, the embedded subject can be interpreted within the scope of the matrix \(\hbox {Force}^{0}\) in (71).

The Mongolian ECM facts discussed so far are derived by FDIR which restricts the domain of interpretation. Because Mongolian has a clausal periphery in which ForceP is projected below CP, ECM which requires \(\phi \)-feature-driven movement beyond the domain of the C phase, will not be able to reconstruct back into the original domain for wh-question interpretation.

Notice that FDIR only restricts the domain in which a DP must be interpreted, and is underspecified with respect to the specific syntactic position in which the DP is interpreted within that domain. This domain-centered nature of FDIR predicts that while \(\phi \)-feature-driven movement crossing the domain of a phase head must be interpreted in the higher domain, \(\phi \)-feature-driven movement which starts out and lands within the same domain of a phase head should in principle be able to reconstruct. This prediction is borne out by independent data from Mongolian. Consider the following sentence.

figure bv

In (73), the embedded subject neg tsagdaa ‘a policeman’ can take either wide scope or narrow scope with respect to the modal baikh ‘might’. The sentence can either be interpreted as I think there is a policeman and he might come soon, or I think it might be that a policeman will come soon. I suggest that both scopes are available in this case because the subject ‘a policeman’ undergoes movement to the embedded Spec TP, after agreeing with the \(\phi \)-probe on the embedded \(\hbox {T}^{0}\).

figure bw

Since this movement is \(\phi \)-feature-driven, the DP a policeman needs to be interpreted within the Spell-Out domain of Spec TP (its landing site). Crucially, the Spell-Out domain containing the landing site Spec TP also contains the launching site Spec vP (i.e., the two syntactic positions fall within the same Spell-Out domain). Thus, the DP ‘a policeman’ can either be interpreted at Spec TP (outside of the scope of ‘might’), or at Spec vP (within the scope of ‘might’).Footnote 26

In contrast to (73), an ECM subject can only take wide scope with respect to baikh ‘might’. Sentence (75) can only be interpreted as I think there is a policeman and he might come soon.

figure bx

The contrast between (75) and (73) is predicted by FDIR. In (75), the ECM subject has undergone \(\phi \)-feature-driven movement to the embedded Spec CP, which is in a separate Spell-Out domain from its launching site. This is illustrated in (76).

figure by

The scenario depicted in (76) is familiar by now: According to FDIR, the DP ‘a policeman’ needs to be interpreted within the Spell-Out domain which Spec CP belongs to. Consequently, ‘a policeman’ must be interpreted outside of the scope of ‘might’.

In addition, FDIR only restricts the interpretation of a DP that has undergone \(\phi \)-feature-driven movement. We expect that movement which is not driven by \(\phi \)-features should not be constrained by FDIR. This prediction is indeed borne out. The fact that feature-driven movement to Spec CP does not reconstruct for wh-interpretation contrasts sharply with LDS, which does reconstruct for wh-interpretation. Recall that LDS of a wh-phrase shows radical reconstruction in the sense that it has no effect on the interpretation of the embedded question. The relevant example is (63), repeated here as (77).

figure bz

Example (77) shows a wh-phrase can be scrambled to a position outside the c-command domain of the Q-particle where it takes scope at LF. The wh-phrase ‘what book’ is scrambled from the embedded question into the matrix declarative clause, and the sentence is still acceptable. What is surprising about (77) is that although at S-structure the wh-phrase is outside of the scope of the embedded wh Q-particle, just like in (62b), the result sounds far better than (62b). Therefore, a long distance scrambled wh-phrase can nevertheless be interpreted within the scope of an embedded Q-particle. Since LDS is not driven by agreement with a \(\phi \)-probe, FDIR does not apply here, and the fact that it shows radical reconstruction is expected.Footnote 27

6 Mongolian Clausal Periphery: Implications for the A/\(\hspace{1.27777pt}\overline{\hspace{-1.27777pt}\text {A}\hspace{-1.27777pt}}\hspace{1.27777pt}\)-Distinction

Before I conclude this paper, I will provide some further remarks on the implications of the current account. The major part of my empirical investigation has focused on the consequences of previous proposals that some languages allow A-movement through CP, and therefore Spec CP can be an A-position in those languages (e.g., Tanaka 2002a for Japanese, Yoon 1991 for Korean, and Fong 2019 for Mongolian). The A-status of Spec CP raises important questions about the nature of the A/\(\hspace{1.27777pt}\overline{\hspace{-1.27777pt}\text {A}\hspace{-1.27777pt}}\hspace{1.27777pt}\)-distinction and how scope, usually determined by \(\hspace{1.27777pt}\overline{\hspace{-1.27777pt}\text {A}\hspace{-1.27777pt}}\hspace{1.27777pt}\)-operations, can be affected by the A/\(\hspace{1.27777pt}\overline{\hspace{-1.27777pt}\text {A}\hspace{-1.27777pt}}\hspace{1.27777pt}\)-status of CP. Specifically, if movement to Spec CP is a kind of A-movement, such movement should in principle interact with other operations that target CP, such as wh-licensing and topicalization.

The empirical data presented in this paper indicate that in Mongolian, subject raising to Spec CP (feeding ECM) is incompatible with the subject having embedded wh-scope or embedded thematic topic interpretation. Since it is proposed that the \(\hspace{1.27777pt}\overline{\hspace{-1.27777pt}\text {A}\hspace{-1.27777pt}}\hspace{1.27777pt}\)-domain associated with wh- and topic interpretation is structurally lower than Spec CP, which is an A-domain in ECM contexts, superficially it seems that the relevant A/\(\hspace{1.27777pt}\overline{\hspace{-1.27777pt}\text {A}\hspace{-1.27777pt}}\hspace{1.27777pt}\)-interaction can also be formulated as the result of some variant of the ban on improper movement. Specifically, one might posit that the embedded subject has already undergone some sort of \(\hspace{1.27777pt}\overline{\hspace{-1.27777pt}\text {A}\hspace{-1.27777pt}}\hspace{1.27777pt}\)-operation (related to wh or topic interpretation) associated with the lower \(\hspace{1.27777pt}\overline{\hspace{-1.27777pt}\text {A}\hspace{-1.27777pt}}\hspace{1.27777pt}\)-domain, and hence it may not further undergo A-movement (ECM) to Spec CP, an A-position.

However, a careful examination reveals a crucial distinction between Mongolian and typical English improper movement cases. As explicated in Keine (2016, 2018), the canonical English improper movement scenario involves the interaction of two constraints: i) locality requirement of movement; and ii) a well-formedness requirement on movement chains such as a ban on improper movement (specific formulation and explanations vary, see e.g., Chomsky 1973; May 1979; Lasnik and Uriagereka 1988; Cinque 1990; Müller and Sternefeld 1993, among others). But I argue that the same scenario in fact does not arise in Mongolian. Consider the account by May (1979), building on Chomsky (1973), for the ungrammaticality of English examples like (78).

figure ca

Example (78) is ungrammatical because a ban on improper movementFootnote 28 rules out the chain representation created by locality-governed movement, shown in (79), adapted from May’s (2). Due to locality constraints, extraction out of a finite clause requires a prior stop at the edge of the clause. In (79), this obligatory intermediate stop is \(\hbox {e}_2\) at \(\hspace{1.27777pt}\overline{\hspace{-1.27777pt}\text {S}\hspace{-1.27777pt}}\hspace{1.27777pt}\), an \(\hspace{1.27777pt}\overline{\hspace{-1.27777pt}\text {A}\hspace{-1.27777pt}}\hspace{1.27777pt}\)-position in English. Essentially, the surface form of (79) is ruled out in English under the dual restrictions of locality plus a ban on improper movement — If locality is obeyed, then movement targeting the matrix S must stop at the intermediate \(\hspace{1.27777pt}\overline{\hspace{-1.27777pt}\text {A}\hspace{-1.27777pt}}\hspace{1.27777pt}\)-position at \(\hspace{1.27777pt}\overline{\hspace{-1.27777pt}\text {S}\hspace{-1.27777pt}}\hspace{1.27777pt}\), violating the ban on improper movement. On the other hand, if the DP had skipped the intermediate \(\hspace{1.27777pt}\overline{\hspace{-1.27777pt}\text {S}\hspace{-1.27777pt}}\hspace{1.27777pt}\) and directly moved to the matrix clause A-landing site, the ban on improper movement would have been circumvented, but locality would have been violated. There is no derivation available that satisfies both in (78)/(79). Following the discussion in Keine (2016, 2018) and restating this in terms of phase theory, what (78)/(79) suggests is that a DP at the edge of a phase cannot undergo further A-movement, and can only undergo further \(\hspace{1.27777pt}\overline{\hspace{-1.27777pt}\text {A}\hspace{-1.27777pt}}\hspace{1.27777pt}\)-movement. Because locality alone does not rule out A-movement from a phase edge, an additional ban on improper movement has been invoked.

figure cb

While maintaining that locality and the well-formedness requirement of chains (*\(\hspace{1.27777pt}\overline{\hspace{-1.27777pt}\text {A}\hspace{-1.27777pt}}\hspace{1.27777pt}\) \(\rightarrow \)A) hold in English and Mongolian alike, I would like to argue that these restrictions do not end up achieving the same effect in Mongolian, due to differences in clausal architecture. In particular, suppose that in Mongolian a DP moving out of CP must stop at the edge of CP; from the perspective of locality, the DP need not stop at ForceP or Th(eme)P, which are lower than CP. Unlike the English configuration in (79), in which the intermediate \(\hspace{1.27777pt}\overline{\hspace{-1.27777pt}\text {A}\hspace{-1.27777pt}}\hspace{1.27777pt}\)-landing site cannot be skipped, in Mongolian, subject raising to CP, skipping the lower wh-positions (ForceP) or thematic topic positions (ThemeP), does not violate any locality restriction. Stated in more general terms, the scenario in Mongolian is that since its clausal periphery has the shape \([[[ \hspace{.1in} ... \hspace{.1in} _{\textsc {a}}] \,{}_{\hspace{1.27777pt}\overline{\hspace{-1.27777pt}{\textbf {A}}\hspace{-1.27777pt}}\hspace{1.27777pt}}] \,{}_{{{\textbf {A}}}}]\), movement can in principle directly target the A-edge of the clause, skipping the lower \(\hspace{1.27777pt}\overline{\hspace{-1.27777pt}\text {A}\hspace{-1.27777pt}}\hspace{1.27777pt}\)-domain.

figure cc

If this is on the right track, the resemblance of the Mongolian phenomena and the canonical English improper movement cases is only superficial; and whatever constraints in the syntax rule out English (78) cannot be used to rule out the Mongolian cases without further stipulations. Since locality does not force the Mongolian A-movement under examination to have an intermediate \(\hspace{1.27777pt}\overline{\hspace{-1.27777pt}\text {A}\hspace{-1.27777pt}}\hspace{1.27777pt}\)-stop, invoking a ban on improper movement becomes difficult. In addition, under the current account, Mongolian wh-licensing does not involve covert or overt movement. The proposal that the wh-phrase does not move adds further technical challenge to approaches that rely on a ban on improper movement. In the current account, the Mongolian derivation depicted in (80) results directly from the Agree mechanism. The embedded subject moves to Spec CP in response to the \(\phi \)-probe on \(\hbox {C}^{0}\). Since \(\phi \)-agreement is involved, subject raising to Spec CP behaves like A-movement. In addition, since the intermediate \(\hspace{1.27777pt}\overline{\hspace{-1.27777pt}\text {A}\hspace{-1.27777pt}}\hspace{1.27777pt}\)-domain is skipped, the subject never gets properly licensed by \(\hbox {Force}^{0}\) or thematic topic head. Finally, this account also shares with previous Agree-based approaches in that no inherent A/\(\hspace{1.27777pt}\overline{\hspace{-1.27777pt}\text {A}\hspace{-1.27777pt}}\hspace{1.27777pt}\)-distinction is invoked.

7 Conclusion

To sum up the discussion in this paper, I have proposed that interactions between A/\(\hspace{1.27777pt}\overline{\hspace{-1.27777pt}\text {A}\hspace{-1.27777pt}}\hspace{1.27777pt}\)-operations at the Mongolian embedded clausal periphery are due to different features involved in Agree relations, combined with the clausal architecture. My proposal is motivated on the basis of two scenarios in Mongolian embedded constructions with ECM. The first scenario is that ECM is degraded when the embedded subject is a wh-phrase taking embedded scope. The second scenario is that ECM is also degraded when the embedded subject is an embedded thematic topic. Since both wh-licensing and topicalization are \(\hspace{1.27777pt}\overline{\hspace{-1.27777pt}\text {A}\hspace{-1.27777pt}}\hspace{1.27777pt}\)-operations, their interactions with ECM suggest a unified account. Based on the distribution of complementizers in embedded wh-questions and embedded topicalization, I concluded that there exists an intermediate \(\hspace{1.27777pt}\overline{\hspace{-1.27777pt}\text {A}\hspace{-1.27777pt}}\hspace{1.27777pt}\)-domain between TP and CP in Mongolian, resulting in a \([[[ \hspace{.1in} ... \hspace{.1in} _{\textsc {a}}] \,{}_{\hspace{1.27777pt}\overline{\hspace{-1.27777pt}{\textbf {A}}\hspace{-1.27777pt}}\hspace{1.27777pt}}] \,{}_{{{\textbf {A}}}}]\) clausal periphery. I proposed that Mongolian subject raising to Spec CP takes place in response to a \(\phi \)-probe on \(\hbox {C}^{0}\), feeding accusative case assignment on the subject. If raising happens, the subject essentially skips over the intermediate \(\hspace{1.27777pt}\overline{\hspace{-1.27777pt}\text {A}\hspace{-1.27777pt}}\hspace{1.27777pt}\)-domain, which fails to satisfy conditions for wh- or thematic topic licensing.