Between Syntax and Pragmatics: The Causal Conjunction Protože in Spoken and Written Czech

Čermáková, Anna; Komrsková, Zuzana; Kopřivová, Marie; Poukarová, Petra

doi:10.1007/s41701-017-0014-y

Between Syntax and Pragmatics: The Causal Conjunction Protože in Spoken and Written Czech

Original Paper
Published: 25 April 2017

Volume 1, pages 393–414, (2017)
Cite this article

Corpus Pragmatics Aims and scope Submit manuscript

Anna Čermáková ORCID: orcid.org/0000-0001-8597-520X¹,
Zuzana Komrsková¹,
Marie Kopřivová¹ &
…
Petra Poukarová¹

691 Accesses
1 Citation
1 Altmetric
Explore all metrics

Abstract

Research into causal conjunctions suggests that there are various degrees of causality and that causality is better situated on a cline between strong and weak. Some studies of English because/’cause/cos suggest a diachronic change in the spoken language, where the use of because is shifting from prototypical subordinator to discourse marker (Stenström, in: Jucker, Ziv (eds) Discourse markers, John Benjamins, Amsterdam, 1998; Burridge in Aust J Linguist 34(4):524–548, 2014). This study examines in detail the use of the most frequent Czech causal conjunction protože in both written and spoken language, thus making a further contribution to cross-linguistic research into causality and to research into the differences between spoken and written language more generally. There are two major language varieties of Czech: the common vernacular and the standard literary language (the codified norm). These two varieties differ in a number of respects—at the morphological, lexical and phonological levels. In comparing spoken and written Czech, very few studies include syntactic features and none are based on large-scale authentic spoken data. Based on the corpus data, the conjunction protože occurs strikingly more frequently in spoken Czech than in written language. This study looks at some differences in its distribution. The study is based on extensive corpus data of both written Czech (comprising fiction, newspapers and academic texts) and spoken Czech (corpora of spontaneous conversations and TV debates).

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Aspects of Anaphora in Chinese and in Some Germanic, Romance, and Slavic Languages, the ‘Syntactic’ Versus ‘Pragmatic’ Language Typology, and Neo-Gricean Pragmatics

Conceptual and Procedural Aspects of Causal Meaning: Corpus-Analytic Evidence from Modern Greek

Article 28 April 2017

Valandis Bardzokas

Discourse Connectives: Theoretical Models and Empirical Validations in Humans and Computers

Notes

We assume that there are features that are e.g. typical for spoken English (as opposed to written English) and we may not be able to establish cross-linguistic parallels, while there are features typical for spoken English, where we may be able to claim a degree of cross-linguistic universality.
Page numbers given for Sanders (2005) refer to the online copy of the article available at: http://www.let.uu.nl/~Ted.Sanders/personal/uploads/pdf/Sanders%20(2005).pdf.
Cf. also Czech “příčina” [cause] and “důvod” [reason] (Svoboda 1954; Grepl and Karlík 1998).
Czech belongs to the Slavic language family and is a highly inflected language with a complex morphological system. Historical events connected to the Czech struggle for independence have shaped the prevalent attitudes to language correctness and have given space to puristic tendencies. Czech is stratified geographically, stylistically and socially, with perhaps the most notable difference drawn between StC and CC (as well as between StC, which is spoken in Bohemia, and the distinct Moravian dialects). However, in actual language use there are no clear-cut boundaries between the two varieties and speakers oscillate between them depending on the situation, and especially with regard to the degree of formality of the given situation.
The H variety corresponds to StC and the L variety to CC.
Recently, for example, Massot (2008) and Rowlett (2011) have argued that French is a diglossic situation. Rowlett (2011: 13) notes, it has been commonplace with French to “claim there had been no significant syntactic change since the end of the 17th century”. With the development of spoken corpora interest in syntactic variation and specifically variation in spoken French has risen (Rowlett 2011: 3) and a potential alternative account for synchronic syntactic variation in French is offered by e.g. Massot (2008). Massot suggests the diglossic approach primarily in response to the extensive research into spoken French (e.g. Blanche-Benveniste 1997, 2003, 2004) and he considers the distinctions between “spoken” and “written” insufficient to explain syntactic variation. He explains the variation in terms of two distinct internal grammars of French (that of the vernacular and that of the more prestigious variety acquired later — note here the parallel with Czech) that speakers have access to, while maintaining that this distinction is not the same as the distinction between spoken and written language (Rowlett 2011). However, all languages with evolved literary tradition to some degree could be considered from this point of view. Learning to write a language always requires adjustments of the spoken variety.
Cf. also Linear Unit Grammar: Integrating Speech and Writing (Sinclair and Mauranen 2006).
Examples with letters are examples from literature review and examples with consecutive numbering are authentic examples from our data (in brackets is the corpus/subcorpus from which the example is taken).
And these certainly have cross-linguistic validity. See also Sinclair and Mauranen (2006).
For French, see e.g. Debaisieux (2013).
For an interesting account of the use of because in the web communications (e.g. Twitter) see Bohman (2016).
Schiffrin analyzes the following as DMs: and, because, but, I mean, now, oh, or, so, then, well, y‘know.
Fraser (2006) uses the “cover term” pragmatic markers (expressions that “occur as part of a discourse segment but are not part of the propositional content of the message”) and distinguishes between four types of pragmatic markers. DMs (that “signal a relation between the discourse segment which hosts them, and the prior discourse segment”) are one of them. Fraser claims that there are over 100 DMs in English, which can be divided into four basic semantic relationships (contrastive, elaborative, inferential, and temporal markers). Aijmer seems to prefer the term pragmatic marker as well (Aijmer 2013). For further discussion of terminology and theoretical approaches to DMs, see e.g. Fischer (2005), for contrastive studies of pragmatic markers see Aijmer and Simon-Vandenbergen (2006).
Crystal (1988: 48) talks about these pragmatic expressions as “the oil which helps us perform the complex task of spontaneous speech”.
But, as further research shows, this is not the case with all discourse markers (e.g. Brinton 1996).
Fraser (2006) mentions that “[o]ne area which requires more investigation is the extent to which there is polysemy throughout the class of DMs”.
For diachronic distribution of the two main Czech causal conjunctions protože and neboť, see http://syd.korpus.cz/ipP7hWCE.dia.
For an overview of the frequency distribution between protože and neboť in spoken and written language, see http://syd.korpus.cz/ipP7hWCE.syn.
The corpora were chosen in analogy to LGSWE (Biber et al. 1999) but in the case of the spoken language we opted for two varieties.
Data are freely available at www.korpus.cz.
More detailed information is available at http://wiki.korpus.cz/doku.php/en:cnk:oral2013.
http://ujc.dialogy.cz/?q=en/node/83.
The lemmatised spoken corpus will be made available later in 2017.
Capitalisation indicate lemma, i.e. overall frequency of all forms including reduced variants.
Schiffrin (1987), however, maintains that one of the important features of a DM is its simultaneous anaphoric and cataphoric function.
For the spoken data we used a pilot version of automatic annotation, which does not reach the accuracy of the annotation of the written data.
Using X ², at p < .001.
Small town in Moravia.
Šopáč is probably a place name.
Ne means ‘no’ but can be used colloquially without having the negative meaning, see example 12.
One of the anonymous reviewers pointed to the fact that it is probably quite common at this point, especially if our online processing has not been completed in terms of syntax. While we are saying the first clause, we may be weighing up the listener’s reaction to what we are saying and reconfiguring it.

References

Adolphs, S., & Carter, R. (2013). Spoken corpus linguistics: From monomodal to multimodal. London: Routledge.
Google Scholar
Aijmer, K. (2002). English discourse particles: Evidence from a corpus. Amsterdam: John Benjamins.
Book Google Scholar
Aijmer, K. (2013). Understanding pragmatic markers. A variational pragmatic approach. Edinburgh: Edinburgh University Press.
Google Scholar
Aijmer, K., & Simon-Vandenbergen, A.-M. (2006). Pragmatic markers in contrast. London: Elsevier.
Google Scholar
Altenberg, B. (1984). Causal linking in spoken and written English. Studia Linguistica, 38, 20–69.
Article Google Scholar
Bermel, N. (2000). Register variation and language standards in Czech. LINCOM Europa, Studies in Slavic Linguistics 13. Muenchen.
Bermel, N. (2014). Czech diglossia: Dismantling or dissolution? In J. Árokay, J. Gvozdanović, & D. Miyajima (Eds.), Divided languages? (pp. 21–37). Berlin: Springer.
Chapter Google Scholar
Biber, D. (1991). Variation across speech and writing. Cambridge: Cambridge University Press.
Google Scholar
Biber, D., Johansson, S., Leech, G., Conrad, S., & Finegan, E. (1999). Longman grammar of spoken and written english. Pearson Education Ltd: Harlow.
Google Scholar
Blanche-Benveniste, C. (1997). Approhes de la langue parlée en français. L’Essentiel. Paris: Ophrys.
Google Scholar
Blanche-Benveniste, C. (2003). La langue parlée. In M. Yaguello (Ed.), Le grand livre de la langue française (pp. 317–344). Paris: Seuil.
Google Scholar
Blanche-Benveniste, C. (2004). Le singulier et le pluriel en français parlé contemporain. Bulletin de la Société de linguistique de Paris, tome XCIX, pp. 129–154.
Bohman, A. (2016). Language change because Twitter? Factors motivating innovative use of because across the English speaking Twittersphere. In L. Squires (Ed.), English in computer-mediated communication. Variation, representation, and change (pp. 149–178). Berlin: Walter de Gruyter.
Google Scholar
Brazil, D. (1995). A grammar of speech. Oxford: OUP.
Google Scholar
Brinton, L. J. (1996). Pragmatic markers in english: Grammaticalization and discourse functions. Berlin: Walter de Gruyter.
Book Google Scholar
Burridge, K. (2014). Cos: A new discourse marker for australian english? Australian Journal of Linguistics, 34(4), 524–548.
Article Google Scholar
Carter, R. & McCarthy, M. (2017). Spoken grammar: Where are we and where we are going? Applied Linguistics, 38(1), 1–20.
Article Google Scholar
Čermák, F. (2002/2014). Obecná čeština: Je součástí české diglosie? In F. Čermák (2014), Jazyk a slovník. Vybrané lingvistické studie (pp. 477–485). Praha: Karolinum.
Claridge, C., & Walker, T. (2001). Causal clauses in written and speech-related genres in Early Modern English. ICAME Journal, 25, 31–63.
Google Scholar
Couper-Kuhlen, E. (1996). Intonation and clause combining in discourse: The case of because. Pragmatics, 6(3), 389–426.
Article Google Scholar
Couper-Kuhlen, E. (2011). When turns start with because: an excercise in interactional syntax. Studies in Variation, Contacts and Change in English, vol. 8. Available at: www.helsinki.fi/varieng/journal/volumes/08/couper-kuhlen.
Crystal, D. (1988). Another look at, well, you know…. English Today, 4(1), 47–49.
Article Google Scholar
Cvrček, V., Čermáková, A., & Křen, M. (2016). Nová koncepce synchronních korpusů psané češtiny. Slovo a slovesnost, 77(2), 83–101.
Google Scholar
Cvrček, V., et al. (2010). Mluvnice současné češtiny I. Praha: Karolínum.
Google Scholar
D’Arcy, A., & Tagliamonte, S. A. (2015). Not always variable: Probing the vernacular grammar. Language Variation and Change, 27, 255–285. doi:10.1017/S0954394515000101.
Article Google Scholar
Davison, A. (1975). Indirect speech acts and what to do with them. In P. Cole & J. L. Morgan (Eds.), Syntax and semantics (Vol. 3, pp. 142–185)., Speech acts New York: Academic Press.
Google Scholar
Debaisieux J.-M. (2013). Analyses Linguistique sur corpus. Subordination et insubordination en français. Paris: Hermès, Chap.
Ferguson, C. A. (1959). Diglossia. Word, 15(2), 325–340.
Article Google Scholar
Fischer, K. (2005). Towards an understanding of the spektrum of approaches to discourse particles: Introduction to the volume. In K. Fischer (Ed.), Approaches to discourse markers (pp. 1–20). London: Elsevier.
Google Scholar
Ford, C. E. (1994). Dialogic aspects of talk and writing: Because on the interactive-edited continuum. Text, 14(4), 513–554.
Article Google Scholar
Fraser, B. (1999). What are discourse markers? Journal of Pragmatics, 31(7), 931–952.
Article Google Scholar
Fraser, B. (2006). Towards a theory of discourse markers. Approaches to Discourse Particles, 1, 189–204.
Google Scholar
Gattercole, S. E., & Baddeley, A. D. (1993). Working memory and language. Hove: Lawrence Erlbaum.
Google Scholar
Gazdar, G. (1980). Pragmatics and logical form. Journal of Pragmatics, 4, 1–13.
Article Google Scholar
Grepl, M., & Karlík, P. (1998). Skladba češtiny. Olomouc: Votobia.
Google Scholar
Hoffmannová, J. (2012). Syntaktická stylistika mluvených projevů. In S. Čmejrková, J. Hoffmannová & J. Klímová (Eds), Čeština v pohledu synchronním a diachronním. Stoleté kořeny Ústavu pro jazyk český (pp. 707–713). Praha: UK, Karolinum.
Hoffmannová, J. & Zeman, J. (2017). Výzkum syntaxe mluvené češtiny: vstupní inventarizace problémů. Slovo a slovesnost.
Hošnová, E. (2005). Studie z vývoje novočeské syntaxe (Konjunkce, pronominalizace). Acta Universitatis Carolinae Philologica — Monographia CXLV. Praha: Karolinum.
Janda, L., & Townsend, C. (2000). Czech. Műnchen: Lincom Europa.
Google Scholar
Leech, G. (2000). Grammars of spoken english: New outcomes of corpus-oriented research. Language Learning, 50(4), 675–724.
Article Google Scholar
Lenker, U. (2010). Argument and rhetoric. Adverbial connectors in the history of english. Berlin: De Gruyter Mouton.
Book Google Scholar
Massot, B. (2008). Français et diglossie. Décrire la situation linguistique française contemporaine comme une diglossie: Arguments morphosyntaxiques. Doctoral dissertation, Université Paris VIII Vincennes-Saint Denis.
McCarthy, M. (1998). Spoken language and applied linguistics. Cambridge: Cambridge University Press.
Google Scholar
McTear, M. F. (1980). The pragmatics of because. Mimeo: Ulster Polytechnic.
Google Scholar
Miller, J., & Weinert, R. (1998). Spontaneous spoken language: Syntax and discourse. Oxford: Clarendon Press.
Google Scholar
Morreal, J. (1979). The evidential use of because. Papers in Linguistics, 12(1–2), 231–238.
Article Google Scholar
Müller, S. (2005). Discourse markers in native and non-native English discourse. Amsterdam: John Benjamins Publishing.
Book Google Scholar
Noordman, L., & Vonk, W. (1998). Memory-based processing in understanding causal information. Discourse Processes, 26, 191–212.
Article Google Scholar
Raso, T. (2014). Prosodic constraints for discourse markers. In T. Raso & H. Mello (Eds.), Spoken corpora and linguistic studies (pp. 411–467). Amsterdam: John Benjamins.
Chapter Google Scholar
Rissanen, M. (1997). Towards an integrated view of the development of English: Notes on causal linking. In J. Fisiak & M. Krygier (Eds.), Advances in english historical linguistics (pp. 389–406). Berlin: Mouton de Gruyter.
Google Scholar
Rowlett, P. A. (2011). Syntactic variation and diglossia in French. Salford Working Papers in Linguistics and Applied Linguistics, 1, 13–26.
Google Scholar
Sampson, G. (2013). The empirical trend: Ten years on. International Journal of Corpus Linguistics, 18(2), 281–289.
Article Google Scholar
Sanders, T. (2005). Coherence, causality and cognitive complexity in discourse. In M. Aurnague & M. Bras (Eds.) Proceedings of the first international symposium on the exploration and modelling of meaning (pp. 31–46). Toulouse: Université de Toulouse-le-Mirail. Available at: http://www.let.uu.nl/~Ted.Sanders/personal/uploads/pdf/Sanders%20(2005).pdf. Retrieved 15 Feb 2016.
Sanders, T., & Noordman, L. (2000). The role of coherence relations and their linguistic markers in text processing. Discourse Processes, 29(1), 37–60.
Article Google Scholar
Schleppegrell, M. J. (1991). Paratactic because. Journal of Pragmatics, 16, 323–337.
Article Google Scholar
Schleppegrell, M. (1992). Subordination and linguistic complexity. Discourse Processes, 15(1), 117–131.
Article Google Scholar
Sgall, P., Hronek, J., Stich, A., & Horecký, J. (1992). Variation in language: Code switching in Czech as a challenge for sociolinguistics. Amsterdam: John Benjamins.
Book Google Scholar
Shiffrin, D. (1987). Discourse markers. Cambridge: Cambridge University Press.
Book Google Scholar
Sinclair, J. M., & Mauranen, A. (2006). Linear unit grammar: Integrating speech and writing. Amsterdam: John Benjamins.
Book Google Scholar
Stenström, A.-B. (1998). From sentence to discourse: Cos (because) in teenage talk. In A. H. Jucker & Y. Ziv (Eds.), Discourse markers (pp. 127–146). Amsterdam: John Benjamins.
Chapter Google Scholar
Štícha, F., et al. (2013). Akademická gramatika spisovné češtiny. Praha: Academia.
Google Scholar
Stubbs, M. (1983). Discourse analysis. The sociolinguistic analysis of natural language. Oxford: Basil Blackwell.
Google Scholar
Svoboda, K. (1954). O takzvaných větách příčinných a účinkových. Naše řeč, 37(1–2), 1–14.
Google Scholar
Sweetser, E. (1990). From etymology to pragmatics. Cambridge: Cambridge University Press.
Book Google Scholar
Taboada, M. (2006). Discourse markers as signals (or not) of rhetorical relations. Journal of Pragmatics, 38(4), 567–592.
Article Google Scholar
Weinreich, U., Labov, W., & Herzog, M. (1968). Empirical foundations of a theory of language chase. In W. P. Lehmann & Y. Malkiel (Eds.), Directions for historical linguistics (pp. 95–188). Austin: University of Texas Press.
Google Scholar
Wilson, J. (2010). Moravians in Prague. A sociolinguistic study of dialect contact in the Czech Republic. Frankfurt am Main: Peter Lang.
Google Scholar
Zufferey, S. (2012). ‘Car, parce que, puisque’ revisited: Three empirical studies on French causal connectives. Journal of Pragmatics, 44(2), 138–153.
Article Google Scholar

Download references

Funding

This study is a result of the research funded by the Czech Science Foundation as the project GA ČR 15-01116S “Syntax of spoken Czech”.

Author information

Authors and Affiliations

Institute of the Czech National Corpus, Charles University, Prague, Czech Republic
Anna Čermáková, Zuzana Komrsková, Marie Kopřivová & Petra Poukarová

Authors

Anna Čermáková
View author publications
You can also search for this author in PubMed Google Scholar
Zuzana Komrsková
View author publications
You can also search for this author in PubMed Google Scholar
Marie Kopřivová
View author publications
You can also search for this author in PubMed Google Scholar
Petra Poukarová
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Anna Čermáková.

Ethics declarations

Conflict of interest

On behalf of all authors, the corresponding author states that there is no conflict of interest.

Additional information

We would like to thank the editors and two anonymous reviewers of this paper for their suggestions and comments.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Čermáková, A., Komrsková, Z., Kopřivová, M. et al. Between Syntax and Pragmatics: The Causal Conjunction Protože in Spoken and Written Czech. Corpus Pragmatics 1, 393–414 (2017). https://doi.org/10.1007/s41701-017-0014-y

Download citation

Received: 28 November 2016
Accepted: 14 April 2017
Published: 25 April 2017
Issue Date: December 2017
DOI: https://doi.org/10.1007/s41701-017-0014-y

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Between Syntax and Pragmatics: The Causal Conjunction Protože in Spoken and Written Czech

Abstract

Access this article

Similar content being viewed by others

Aspects of Anaphora in Chinese and in Some Germanic, Romance, and Slavic Languages, the ‘Syntactic’ Versus ‘Pragmatic’ Language Typology, and Neo-Gricean Pragmatics

Conceptual and Procedural Aspects of Causal Meaning: Corpus-Analytic Evidence from Modern Greek

Discourse Connectives: Theoretical Models and Empirical Validations in Humans and Computers

Notes

References

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Between Syntax and Pragmatics: The Causal Conjunction Protože in Spoken and Written Czech

Abstract

Access this article

Similar content being viewed by others

Aspects of Anaphora in Chinese and in Some Germanic, Romance, and Slavic Languages, the ‘Syntactic’ Versus ‘Pragmatic’ Language Typology, and Neo-Gricean Pragmatics

Conceptual and Procedural Aspects of Causal Meaning: Corpus-Analytic Evidence from Modern Greek

Discourse Connectives: Theoretical Models and Empirical Validations in Humans and Computers

Notes

References

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation