Abstract
Research into causal conjunctions suggests that there are various degrees of causality and that causality is better situated on a cline between strong and weak. Some studies of English because/’cause/cos suggest a diachronic change in the spoken language, where the use of because is shifting from prototypical subordinator to discourse marker (Stenström, in: Jucker, Ziv (eds) Discourse markers, John Benjamins, Amsterdam, 1998; Burridge in Aust J Linguist 34(4):524–548, 2014). This study examines in detail the use of the most frequent Czech causal conjunction protože in both written and spoken language, thus making a further contribution to cross-linguistic research into causality and to research into the differences between spoken and written language more generally. There are two major language varieties of Czech: the common vernacular and the standard literary language (the codified norm). These two varieties differ in a number of respects—at the morphological, lexical and phonological levels. In comparing spoken and written Czech, very few studies include syntactic features and none are based on large-scale authentic spoken data. Based on the corpus data, the conjunction protože occurs strikingly more frequently in spoken Czech than in written language. This study looks at some differences in its distribution. The study is based on extensive corpus data of both written Czech (comprising fiction, newspapers and academic texts) and spoken Czech (corpora of spontaneous conversations and TV debates).
Similar content being viewed by others
Notes
We assume that there are features that are e.g. typical for spoken English (as opposed to written English) and we may not be able to establish cross-linguistic parallels, while there are features typical for spoken English, where we may be able to claim a degree of cross-linguistic universality.
Page numbers given for Sanders (2005) refer to the online copy of the article available at: http://www.let.uu.nl/~Ted.Sanders/personal/uploads/pdf/Sanders%20(2005).pdf.
Czech belongs to the Slavic language family and is a highly inflected language with a complex morphological system. Historical events connected to the Czech struggle for independence have shaped the prevalent attitudes to language correctness and have given space to puristic tendencies. Czech is stratified geographically, stylistically and socially, with perhaps the most notable difference drawn between StC and CC (as well as between StC, which is spoken in Bohemia, and the distinct Moravian dialects). However, in actual language use there are no clear-cut boundaries between the two varieties and speakers oscillate between them depending on the situation, and especially with regard to the degree of formality of the given situation.
The H variety corresponds to StC and the L variety to CC.
Recently, for example, Massot (2008) and Rowlett (2011) have argued that French is a diglossic situation. Rowlett (2011: 13) notes, it has been commonplace with French to “claim there had been no significant syntactic change since the end of the 17th century”. With the development of spoken corpora interest in syntactic variation and specifically variation in spoken French has risen (Rowlett 2011: 3) and a potential alternative account for synchronic syntactic variation in French is offered by e.g. Massot (2008). Massot suggests the diglossic approach primarily in response to the extensive research into spoken French (e.g. Blanche-Benveniste 1997, 2003, 2004) and he considers the distinctions between “spoken” and “written” insufficient to explain syntactic variation. He explains the variation in terms of two distinct internal grammars of French (that of the vernacular and that of the more prestigious variety acquired later — note here the parallel with Czech) that speakers have access to, while maintaining that this distinction is not the same as the distinction between spoken and written language (Rowlett 2011). However, all languages with evolved literary tradition to some degree could be considered from this point of view. Learning to write a language always requires adjustments of the spoken variety.
Cf. also Linear Unit Grammar: Integrating Speech and Writing (Sinclair and Mauranen 2006).
Examples with letters are examples from literature review and examples with consecutive numbering are authentic examples from our data (in brackets is the corpus/subcorpus from which the example is taken).
And these certainly have cross-linguistic validity. See also Sinclair and Mauranen (2006).
For French, see e.g. Debaisieux (2013).
For an interesting account of the use of because in the web communications (e.g. Twitter) see Bohman (2016).
Schiffrin analyzes the following as DMs: and, because, but, I mean, now, oh, or, so, then, well, y‘know.
Fraser (2006) uses the “cover term” pragmatic markers (expressions that “occur as part of a discourse segment but are not part of the propositional content of the message”) and distinguishes between four types of pragmatic markers. DMs (that “signal a relation between the discourse segment which hosts them, and the prior discourse segment”) are one of them. Fraser claims that there are over 100 DMs in English, which can be divided into four basic semantic relationships (contrastive, elaborative, inferential, and temporal markers). Aijmer seems to prefer the term pragmatic marker as well (Aijmer 2013). For further discussion of terminology and theoretical approaches to DMs, see e.g. Fischer (2005), for contrastive studies of pragmatic markers see Aijmer and Simon-Vandenbergen (2006).
Crystal (1988: 48) talks about these pragmatic expressions as “the oil which helps us perform the complex task of spontaneous speech”.
But, as further research shows, this is not the case with all discourse markers (e.g. Brinton 1996).
Fraser (2006) mentions that “[o]ne area which requires more investigation is the extent to which there is polysemy throughout the class of DMs”.
For diachronic distribution of the two main Czech causal conjunctions protože and neboť, see http://syd.korpus.cz/ipP7hWCE.dia.
For an overview of the frequency distribution between protože and neboť in spoken and written language, see http://syd.korpus.cz/ipP7hWCE.syn.
The corpora were chosen in analogy to LGSWE (Biber et al. 1999) but in the case of the spoken language we opted for two varieties.
Data are freely available at www.korpus.cz.
More detailed information is available at http://wiki.korpus.cz/doku.php/en:cnk:oral2013.
The lemmatised spoken corpus will be made available later in 2017.
Capitalisation indicate lemma, i.e. overall frequency of all forms including reduced variants.
Schiffrin (1987), however, maintains that one of the important features of a DM is its simultaneous anaphoric and cataphoric function.
For the spoken data we used a pilot version of automatic annotation, which does not reach the accuracy of the annotation of the written data.
Using X 2, at p < .001.
Small town in Moravia.
Šopáč is probably a place name.
Ne means ‘no’ but can be used colloquially without having the negative meaning, see example 12.
One of the anonymous reviewers pointed to the fact that it is probably quite common at this point, especially if our online processing has not been completed in terms of syntax. While we are saying the first clause, we may be weighing up the listener’s reaction to what we are saying and reconfiguring it.
References
Adolphs, S., & Carter, R. (2013). Spoken corpus linguistics: From monomodal to multimodal. London: Routledge.
Aijmer, K. (2002). English discourse particles: Evidence from a corpus. Amsterdam: John Benjamins.
Aijmer, K. (2013). Understanding pragmatic markers. A variational pragmatic approach. Edinburgh: Edinburgh University Press.
Aijmer, K., & Simon-Vandenbergen, A.-M. (2006). Pragmatic markers in contrast. London: Elsevier.
Altenberg, B. (1984). Causal linking in spoken and written English. Studia Linguistica, 38, 20–69.
Bermel, N. (2000). Register variation and language standards in Czech. LINCOM Europa, Studies in Slavic Linguistics 13. Muenchen.
Bermel, N. (2014). Czech diglossia: Dismantling or dissolution? In J. Árokay, J. Gvozdanović, & D. Miyajima (Eds.), Divided languages? (pp. 21–37). Berlin: Springer.
Biber, D. (1991). Variation across speech and writing. Cambridge: Cambridge University Press.
Biber, D., Johansson, S., Leech, G., Conrad, S., & Finegan, E. (1999). Longman grammar of spoken and written english. Pearson Education Ltd: Harlow.
Blanche-Benveniste, C. (1997). Approhes de la langue parlée en français. L’Essentiel. Paris: Ophrys.
Blanche-Benveniste, C. (2003). La langue parlée. In M. Yaguello (Ed.), Le grand livre de la langue française (pp. 317–344). Paris: Seuil.
Blanche-Benveniste, C. (2004). Le singulier et le pluriel en français parlé contemporain. Bulletin de la Société de linguistique de Paris, tome XCIX, pp. 129–154.
Bohman, A. (2016). Language change because Twitter? Factors motivating innovative use of because across the English speaking Twittersphere. In L. Squires (Ed.), English in computer-mediated communication. Variation, representation, and change (pp. 149–178). Berlin: Walter de Gruyter.
Brazil, D. (1995). A grammar of speech. Oxford: OUP.
Brinton, L. J. (1996). Pragmatic markers in english: Grammaticalization and discourse functions. Berlin: Walter de Gruyter.
Burridge, K. (2014). Cos: A new discourse marker for australian english? Australian Journal of Linguistics, 34(4), 524–548.
Carter, R. & McCarthy, M. (2017). Spoken grammar: Where are we and where we are going? Applied Linguistics, 38(1), 1–20.
Čermák, F. (2002/2014). Obecná čeština: Je součástí české diglosie? In F. Čermák (2014), Jazyk a slovník. Vybrané lingvistické studie (pp. 477–485). Praha: Karolinum.
Claridge, C., & Walker, T. (2001). Causal clauses in written and speech-related genres in Early Modern English. ICAME Journal, 25, 31–63.
Couper-Kuhlen, E. (1996). Intonation and clause combining in discourse: The case of because. Pragmatics, 6(3), 389–426.
Couper-Kuhlen, E. (2011). When turns start with because: an excercise in interactional syntax. Studies in Variation, Contacts and Change in English, vol. 8. Available at: www.helsinki.fi/varieng/journal/volumes/08/couper-kuhlen.
Crystal, D. (1988). Another look at, well, you know…. English Today, 4(1), 47–49.
Cvrček, V., Čermáková, A., & Křen, M. (2016). Nová koncepce synchronních korpusů psané češtiny. Slovo a slovesnost, 77(2), 83–101.
Cvrček, V., et al. (2010). Mluvnice současné češtiny I. Praha: Karolínum.
D’Arcy, A., & Tagliamonte, S. A. (2015). Not always variable: Probing the vernacular grammar. Language Variation and Change, 27, 255–285. doi:10.1017/S0954394515000101.
Davison, A. (1975). Indirect speech acts and what to do with them. In P. Cole & J. L. Morgan (Eds.), Syntax and semantics (Vol. 3, pp. 142–185)., Speech acts New York: Academic Press.
Debaisieux J.-M. (2013). Analyses Linguistique sur corpus. Subordination et insubordination en français. Paris: Hermès, Chap.
Ferguson, C. A. (1959). Diglossia. Word, 15(2), 325–340.
Fischer, K. (2005). Towards an understanding of the spektrum of approaches to discourse particles: Introduction to the volume. In K. Fischer (Ed.), Approaches to discourse markers (pp. 1–20). London: Elsevier.
Ford, C. E. (1994). Dialogic aspects of talk and writing: Because on the interactive-edited continuum. Text, 14(4), 513–554.
Fraser, B. (1999). What are discourse markers? Journal of Pragmatics, 31(7), 931–952.
Fraser, B. (2006). Towards a theory of discourse markers. Approaches to Discourse Particles, 1, 189–204.
Gattercole, S. E., & Baddeley, A. D. (1993). Working memory and language. Hove: Lawrence Erlbaum.
Gazdar, G. (1980). Pragmatics and logical form. Journal of Pragmatics, 4, 1–13.
Grepl, M., & Karlík, P. (1998). Skladba češtiny. Olomouc: Votobia.
Hoffmannová, J. (2012). Syntaktická stylistika mluvených projevů. In S. Čmejrková, J. Hoffmannová & J. Klímová (Eds), Čeština v pohledu synchronním a diachronním. Stoleté kořeny Ústavu pro jazyk český (pp. 707–713). Praha: UK, Karolinum.
Hoffmannová, J. & Zeman, J. (2017). Výzkum syntaxe mluvené češtiny: vstupní inventarizace problémů. Slovo a slovesnost.
Hošnová, E. (2005). Studie z vývoje novočeské syntaxe (Konjunkce, pronominalizace). Acta Universitatis Carolinae Philologica — Monographia CXLV. Praha: Karolinum.
Janda, L., & Townsend, C. (2000). Czech. Műnchen: Lincom Europa.
Leech, G. (2000). Grammars of spoken english: New outcomes of corpus-oriented research. Language Learning, 50(4), 675–724.
Lenker, U. (2010). Argument and rhetoric. Adverbial connectors in the history of english. Berlin: De Gruyter Mouton.
Massot, B. (2008). Français et diglossie. Décrire la situation linguistique française contemporaine comme une diglossie: Arguments morphosyntaxiques. Doctoral dissertation, Université Paris VIII Vincennes-Saint Denis.
McCarthy, M. (1998). Spoken language and applied linguistics. Cambridge: Cambridge University Press.
McTear, M. F. (1980). The pragmatics of because. Mimeo: Ulster Polytechnic.
Miller, J., & Weinert, R. (1998). Spontaneous spoken language: Syntax and discourse. Oxford: Clarendon Press.
Morreal, J. (1979). The evidential use of because. Papers in Linguistics, 12(1–2), 231–238.
Müller, S. (2005). Discourse markers in native and non-native English discourse. Amsterdam: John Benjamins Publishing.
Noordman, L., & Vonk, W. (1998). Memory-based processing in understanding causal information. Discourse Processes, 26, 191–212.
Raso, T. (2014). Prosodic constraints for discourse markers. In T. Raso & H. Mello (Eds.), Spoken corpora and linguistic studies (pp. 411–467). Amsterdam: John Benjamins.
Rissanen, M. (1997). Towards an integrated view of the development of English: Notes on causal linking. In J. Fisiak & M. Krygier (Eds.), Advances in english historical linguistics (pp. 389–406). Berlin: Mouton de Gruyter.
Rowlett, P. A. (2011). Syntactic variation and diglossia in French. Salford Working Papers in Linguistics and Applied Linguistics, 1, 13–26.
Sampson, G. (2013). The empirical trend: Ten years on. International Journal of Corpus Linguistics, 18(2), 281–289.
Sanders, T. (2005). Coherence, causality and cognitive complexity in discourse. In M. Aurnague & M. Bras (Eds.) Proceedings of the first international symposium on the exploration and modelling of meaning (pp. 31–46). Toulouse: Université de Toulouse-le-Mirail. Available at: http://www.let.uu.nl/~Ted.Sanders/personal/uploads/pdf/Sanders%20(2005).pdf. Retrieved 15 Feb 2016.
Sanders, T., & Noordman, L. (2000). The role of coherence relations and their linguistic markers in text processing. Discourse Processes, 29(1), 37–60.
Schleppegrell, M. J. (1991). Paratactic because. Journal of Pragmatics, 16, 323–337.
Schleppegrell, M. (1992). Subordination and linguistic complexity. Discourse Processes, 15(1), 117–131.
Sgall, P., Hronek, J., Stich, A., & Horecký, J. (1992). Variation in language: Code switching in Czech as a challenge for sociolinguistics. Amsterdam: John Benjamins.
Shiffrin, D. (1987). Discourse markers. Cambridge: Cambridge University Press.
Sinclair, J. M., & Mauranen, A. (2006). Linear unit grammar: Integrating speech and writing. Amsterdam: John Benjamins.
Stenström, A.-B. (1998). From sentence to discourse: Cos (because) in teenage talk. In A. H. Jucker & Y. Ziv (Eds.), Discourse markers (pp. 127–146). Amsterdam: John Benjamins.
Štícha, F., et al. (2013). Akademická gramatika spisovné češtiny. Praha: Academia.
Stubbs, M. (1983). Discourse analysis. The sociolinguistic analysis of natural language. Oxford: Basil Blackwell.
Svoboda, K. (1954). O takzvaných větách příčinných a účinkových. Naše řeč, 37(1–2), 1–14.
Sweetser, E. (1990). From etymology to pragmatics. Cambridge: Cambridge University Press.
Taboada, M. (2006). Discourse markers as signals (or not) of rhetorical relations. Journal of Pragmatics, 38(4), 567–592.
Weinreich, U., Labov, W., & Herzog, M. (1968). Empirical foundations of a theory of language chase. In W. P. Lehmann & Y. Malkiel (Eds.), Directions for historical linguistics (pp. 95–188). Austin: University of Texas Press.
Wilson, J. (2010). Moravians in Prague. A sociolinguistic study of dialect contact in the Czech Republic. Frankfurt am Main: Peter Lang.
Zufferey, S. (2012). ‘Car, parce que, puisque’ revisited: Three empirical studies on French causal connectives. Journal of Pragmatics, 44(2), 138–153.
Funding
This study is a result of the research funded by the Czech Science Foundation as the project GA ČR 15-01116S “Syntax of spoken Czech”.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
On behalf of all authors, the corresponding author states that there is no conflict of interest.
Additional information
We would like to thank the editors and two anonymous reviewers of this paper for their suggestions and comments.
Rights and permissions
About this article
Cite this article
Čermáková, A., Komrsková, Z., Kopřivová, M. et al. Between Syntax and Pragmatics: The Causal Conjunction Protože in Spoken and Written Czech. Corpus Pragmatics 1, 393–414 (2017). https://doi.org/10.1007/s41701-017-0014-y
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s41701-017-0014-y